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EDITORIAL 


A rather tenuous tradition opens the pages of psychological journals to incoming 
editors. At times these editorials are used to signal the high hopes of the virgin editor, 
and at others to announce changes in policy, format, or procedures. I am taking advan- 
tage of the opportunity offered an incoming editor with some trepidation. ie 

Our editorial system and its editorial and publication lag permits the incoming editor 
to write this editorial at a time when he has already spent more than six months processing 
manuscripts. The editorial mantle was transferred from Charles N. Cofer, completing 
six highly successful and notable years as editor of the Psychological Review, to the writer 
on February 1, 1970. In looking over the editorial written by my predecessor and pub- 
lished in the January 1965 issue of the Psychological Review, I find there adequate infor- 
mation on a variety of topics that I would not wish to change. 

The editorial policies announced by Dr. Cofer in 1965 seem to be still applicable to the 
Psychological Review in 1971. The prospective contributor is therefore directed to that 
editorial on such issues as the inappropriateness to the Psychological Review of literature 
reviews, statistical articles, methodological articles, and empirical papers. The guidelines 
stated there on those issues will continue to be used. 

At the same time, I would like to make some additional comments, which reflect no 
real change in policy but a possible specification of editorial attitudes, as well as some 
additions and innovations in terms of content and format of the Psychological Review. 

The Psychological Review continues to be the APA journal concerned with psychological 
theory and its development. As far as theory is concerned, the Review will primarily be 
concerned with the encouragement of theories that are unequivocal, determinate, and 
wherever possible, more than speculative. Psychological theories should, at their best, 
also reflect the cumulative nature of the psychological enterprise. At the same time, 

`- different fields vary in the degree to which they are able to generate, at this point in time, 
logically or mathematically unequivocal theories. However, as a goal, such lack of equiv- 
ocation must always be in the mind of both the author and the editor. In addition, and 
with Dr. Cofer's initial comments, a manuscript submitted to the Psychological Review 
should teflect a detailed working out of some theory or implication thereof. Minor com- 
ments, which may reflect a critical statement or an elaboration, will be acceptable for the 
"Theoretical Notes which will be continued, and prospective contributors might keep in 
mind the distinction between a regular contribution and a theoretical note. - 

The development of psychological theory very frequently impinges on historical con- 
cerns. The Psychological Review will consider for possible publication manuscripts that 
deal with historical themes, but only if such historical analysis is primarily concerned with 
theoretical issues. In addition, articles should make some potential contribution to 
ongoing psychological theory and research, The criterion of impact on current theory 
and research is also one that will be applied to articles dealing with the philosophy of 
science as it impinges on psychology. Philosophical disquisitions which display neither 


1 E {23 


u cf Egni, & Psyl 
© 1971 by the American Psychological Association, Inc. 2 oS T 
VOR. (WS d$. j 


EDITORIAL 


ided nor actual implications for current or future psychological theory and researcl 
will be discouraged. On the other hand, theoretical articles dealing with practical issues 
whether they be social, interpersonal, or technological, will be encouraged. There is nc 
reason to believe that psychological theory need be alienated from the issues facing our 
society. 

There will be three innovations introduced on an experimental basis in 1971. The first 
is a column entitled The State of the Art. In The State of the Art we shall invite com- 
mentary on the current state of psychological theory. Those invited will be, on the onc 
hand, limited in number, since we will, at most, have one such contribution per issue, and 
they will be leaders in theoretical thought and empirical research in a broad spectrum ol 
psychological activities. "This column is designed in the first instance to provide published 
commentary on current psychological theory and, second, to generate discussion and 
dialogue among psychologists. As a second step in that direction, we have introduced a 
Letters to the Editor section. Letters to the Editor in the Psychological Review are not 
intended to be archival publications. Their format will be letter formats, and their titles 
typically will be selected by the Editor in order to provide information rather than 
references. Within the body of a letter, again in order to avoid citable or archival quali- 
ties, references may be made to published material as the author sees fit, but never as 2 
separate list of references at the end of the letter. It is hoped that these letters will 
provide brief comments on issues relating to psychological theory, commenting on 
published articles, or on thoughts and ideas generated by the State of the Art column. 
Typically, letters to the Editor will not be refereed, but decisions will be made directly 
by the Editor, with the hope that publication lag for the Letters column can be kept s 
less than six months. Both for letters and for theoretical notes, a scheme originally S 
veloped by E. G. Boring in Contemporary Psychology will be applicable, namely, thata 
reply to an article, note, or letter should not occupy more than 50% of the space of the 
item to which it constitutes a reply. Such a method will, on the one hand, make discus- 
sions possible, but prevent them from becoming interminable. PA- 

Finally, the Editor reserves the right to publish editorials in issues other than the first 
m pon last of his editorship. It is m hope that in these editorials some reflections 0" 
the editorial process, the state of psychological theory as reflected in the submissions t? 
the Journal, and other issues may be discussed. The Editorial is one of the few rewards 
of the editorial position, but I do not expect to take advantage of it too often. - 

Finally, it should be noted that the quality of a journal is a direct reflection of the quality 
of psychological thought going on in the field. It does not require any humility t? te 
hat editors do not significantly direct the nature or quality of thinking in their peig 
Ve shall thus rely on the quality of the contributions and on the extensive he 
eferees who will, as in the past, be the best available critical reviewers in any 
rea in psychology. And it is obviously to these reviewers that both my hopes 
lready, my thanks go at this time. g 

All of these suggestions, thoughts, and innovations are highly tentative. If any of t 
vork to the benefit of the psychological community, they will be continued. If edt 
iot, they will quickly be abandoned. In the meantime, I can only look forward 1 lage 
ather unhappy prospect of losing friends, making enemies, and battling editorial 4 
he common fate of all editors. 
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THE “SUPERSTITION” EXPERIMENT: 


A REEXAMINATION OF ITS IMPLICATIONS FOR THE 
PRINCIPLES OF ADAPTIVE BEHAVIOR’ 


VIRGINIA L. SIMMELHAG* 
Scarborough College, University of Toronto 


J: E. R- STADDON* 


Duke University 


AND 


Replication and extension of Skinner's "superstition" experiment showed 
the development of two kinds of behavior at asymptote: interim activities 
(related to adjunctive behavior) occurred just after food delivery; the 
terminal response (a discriminated operant) occurred toward the end of 
the interval and continued until food delivery. These data suggest a view 
of operant conditioning (the terminal response) in terms of two sets of 
principles: principles of behavioral variation that describe the origins of 
behavior .“appropriate” to a situation, in advance of reinforcement; and 
principles of reinforcement that describe the selective elimination of be- 
havior so produced. This approach was supported by (a) an account of 
the parallels between the Law of Effect and evolution by means of natural 
selection, (b) its ability to shed light on persistent problems in learning 
(e.g., continuity vs. noncontinuity, variability associated with extinction, 
the relationship between classical and instrumental conditioning, the con- 
troversy between behaviorist and cognitive approaches to learning), and 
(c) its ability to deal with a number of recent anomalies in the learning 
literature (“instinctive drift,” auto-shaping, and auto-maintenance). The 
interim activities were interpreted in terms of interactions among motiva- 
tional systems, and this view was supported by a review of the literature 
on adjunctive behavior and by comparison with similar phenomena in 
ethology (displacement, redirection, and "vacuum" activities). The pro- 
posed theoretical scheme represents a shift away from hypothetical "laws of 
learning" toward an interpretation of behavioral change in terms of inter- 
action and competition among tendencies to action according to principles 
evolved in phylogeny. 


At times, one senses a widespread feeling of dis- 
couragement about the prospects of ever getting 
clear on the fundamentals of conditioning. Attempts 


'The field of learning has undergone in- 
creasing fractionation in recent years. In- 


——— 


terest in "miniature systems" and exact 
theories of local effects has grown to the 
detriment of any attempt at overall integra- 
tion. Consequently, as one perceptive ob- 
server has noted: 


1 Research was supported by grants from the 
National Institute of Mental Health, United States 
Public Health Service, and the National Science 
Foundation to Duke University and the Univer- 
sity of Toronto. The authors thank Nancy K. 
Innis for advice and assistance. Janice Frank, 
Irving Diamond, Carl Erickson, Richard Gil- 
bert, and Peter Klopfer kindly commented on 
earlier versions of this paper. A shorter version 
of the experimental part of this work was pre- 
sented in partial fulfillment of the requirements for 
the MA degree at the University of Toronto 
( Simmelhag, 1968). 

? Requests for reprints should be sent to LER 
Staddon, Department of Psychology, Duke Uni- 
versity, Durham, North Carolina 27706. 

3 Now at Duke University. 
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to arrive at firm decisions about alternative 
formulations rarely produce incisive results. Every 
finding seems capable of many explanations. Issues 
become old, shopworn, and disappear without a 
proper burial [Jenkins, 1970, pp- 107-108]. 


The present article outlines an attempt to 
redress this imbalance. It is organized 
around the problem of “superstitious” be- 
havior, originated by Skinner some years 
ago, which plays a crucial part in the em- 
pirical and theoretical foundations of cur- 
rent views of learning. Discussion of a 
replication of Skinner's original experiment 
leads to an account of the relationships be- 
tween evolution and learning, and a system 
of classification derived therefrom, The 
paper concludes with a theoretical account of 
"superstition" and some related phenomena, 
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THE “SUPERSTITION” EXPERIMENT 


In his classic experiment on “supersti- 
tious” behavior, Skinner (1948) showed 
that the mere delivery of food to a hungry 
animal is sufficient to produce operant con- 
ditioning. The pigeons in that experiment 
were allowed 5-second access to food every 
15 seconds, and food delivery was independ- 
ent of their behavior. Nevertheless, n early 
every pigeon developed a recognizable form 
of stereotyped, superstitious behavior that 
became temporally correlated with food de- 
livery as training progressed. 

Skinner’s (1961) analysis of this 
nomenon is a straightforward 
the Law of Effect: 


phe- 
application of 


The conditioning process is usually obvious. The 
bird happens to be executing some response as 
the hopper appears; as a result it tends to repeat 
this response, If the interva] before the next 
presentation is not so great that extinction takes 
place, a second "contingency" is probable. This 
strengthens the response still further and subse- 
quent reinforcement becomes more probable 


[p. 405]. 


Skinner's Observations were quickly re- 
peated in a number of laboratories. The 
apparent simplicity and reliability of the 
phenomenon, coupled with the plausibility 
of Skinner’s interpretation of it, and the 
more exciting attractions of work on re- 
nforcement schedules then developing, effec- 
ively stifled further Study of this situation. 
dowever, both the experiment and his ex- 
lication played a crucial role in advancing 
kinner’s theoretical view Of operant be- 
avior as the strengthening of unpredictably 
enerated (“emitted”) behavior by the auto- 
atic action of reinforcers. 

Two kinds of data obtained in recent years 
ise new questions about “superstition” 
this sense. First, experiments with time- 
lated reinforcement schedules have shown 
€ development of so-called “mediating” 
havior during the waiting period, when 
€ animal is not making the reinforced re- 
onse. Thus, on schedules which require 
e animal to space his responses a few sec- 
ds apart if they are to be effective in pro- 

cing reinforcement (spaced-responding 

:edules), pigeons often show activities 

ch as Pacing and turning circles. Simi- 


larly, on fixed-interval schedules, in which 
the first response ¢ seconds after the pre- 
ceding reinforcement is effective in produc- 
ing reinforcement, pigeons may show a simi- 
lar behavior during the postreinforcement 
"pause" when they are not making the re- 
inforced | response, Other species show 
activities of this sort in the presence of ap- 
propriate environmental stimuli; for ex- 
ample, schedule-induced polydipsia, in which 
rats reinforced with food on temporal rein- 
forcement Schedules show excessive drinking 
if water is continuously available (Falk, 
1969). None of these activities is rein- 
forced, in the sense of being contiguous with 
food delivery, yet they are reliably produced 
in situations similar in many respects to 
Skinner's superstition procedure. Possibly, 
therefore, some of the activities labeled 
superstitious by Skinner, and attributed by 
him to accidental reinforcement of spon- 
taneously occurring behavior, may instead 
reflect the same causal factors as these 
mediating activities, 

Second, a number of experiments have 
demonstrated the development of behavior 
in operant conditioning situations by a proc- 

S more reminiscent of Pavlovian (classi- 
cal) conditioning than Law of Effect learn- 
ing as commonly understood. Breland and 
Breland (1961) reported a series of observa. 
tions Showing that with continued operant 
training, species-specific behavior will often 
emerge to disrupt an apparently well-learned 
operant response, In the cases they de- 
scribe, behavior closely linked to food (pre- 
sumably reflecting an instinctive mechanism) 
began to occur in advance of food 
in the presence of previously 
(“instinctive drift”), 
vant” 


delivery, 
neutral stimuli 
Since these “irrele- 
activities interfered with food delivery 
by delaying the occurrence of the reinforced: 
Tesponse, they cannot be explained by the 
Law of Effect. A description in terms of 
stimulus substitution a principle usually 
associated with Pavlovian conditioning—is 
better, although still not completely satis- 
factory. More recently, Brown and Jenkins 
(1968) have shown that hungry pigeons can 
be trained to peck a lighted response key 
simply by illuminating the key for a few 
seconds before food delivery, Fewer than a 


+ 
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hundred light-food pairings are usually suf- 
ficient to bring about key pecking. The rela- 
tionship between this "auto-shaping" pro- 
cedure and Pavlovian conditioning is further 
emphasized by an experiment reported by 
Williams and Williams (1969). They found 
that auto-shaped key pecking is maintained 
even if the key peck turns off the light on the 
key and thus prevents food delivery on that 
occasion. All these experiments show the 
occurrence of food-related behaviors, in 
anticipation of food, under conditions more 
or less incompatible with the Law of Effect. 

'The auto-shaping procedure is opera- 
tionally identical to Pavlovian conditioning 
with short delay (the light-food interval in 
these experiments is typically 8 seconds). 
Therefore the eventual emergence of food- 
related behavior, in anticipation of food de- 
livery, is not altogether surprising—although 
the directed nature of key pecking has no 
counterpart in principles of conditioning that 
take salivation as a model response. The 
superstition situation is also equivalent to a 
Pavlovian procedure—in this case temporal 
conditioning, in which the UCS (food) is 
simply presented at regular intervals. Per- 
haps, therefore, prolonged exposure to this 
situation will also lead to the emergence of 
food-related behavior in anticipation of food. 
Possibly the superstitious behavior described 
by Skinner includes activities of this sort, 
that occur in anticipation of food, as well as 
mediating activities that occur just after food 


‘delivery. 


The present experiment affords an op- 
portunity to test these ideas. It provides 
comparative data on the effect of fixed versus 
variable interfood intervals on superstitious 
responding (Skinner used only fixed inter- 
vals), as well as allowing a comparison be- 
tween response-dependent and response- 
independent fixed-interval schedules. The 
experiment also extends Skinner's work by 
recording in some detail both the kind and 
time of occurrence of superstitious activities. 
The emphasis is on the steady-state adapta- 
tion to the procedures, but some data on the 
course of development of superstition are 
presented. 

We hope to show that careful study of the 
superstition situation makes necessary a re- 


vision of Skinner's original interpretation 
and, by extension, requires a shift of empha- 
sis in our view of adaptive behavior. 


Method 

Subjects 

Six pigeons were used: four white Carneaux, 
two with experimental experience (Birds 31 and 
29) and two experimentally naive (Birds 47 and 
49). Two other pigeons were of a local (Toronto, 
Ontario) breed and were experimentally naive 
(Birds 40 and 91). All the birds were main- 
tained at 80% of their free-feeding weights 
throughout. 


Apparatus 


Two standard Grason-Stadler operant condition- 
ing chambers were used. The response keys were 
covered with white cardboard except during the 
response-dependent condition when one key was ex- 
posed and transilluminated with white light. Data 
were recorded by a clock, digital counters, and an 
event recorder. Food delivery was controlled 
automatically by relays and timers. Behaviors 
were recorded via push buttons operated by an 
observer. A tape recorder was used to record 
comments and corrections. Except for the push 
buttons and the tape recorder, all programming 
and recording apparatus was located in a separate 
room. White noise was present in the experi- 
mental chamber and, together with the noise of 
the ventilating fan, served to mask extraneous 
sounds. 


Procedure 


Three schedules of food delivery were used: 
(a) A response-independent fixed-interval (FI) 
schedule in which the food magazine was pre- 
sented at 12-second intervals. (b) A response- 
independent variable-interval (VI) schedule in 
which the food magazine was presented on the 
average every 8 seconds. The following sequence 
of interreinforcement intervals was used, pro- 
grammed by a loop of ló-millimeter film with 
holes punched at appropriate intervals: 3, 6, 6, 
12, 9, 7, 3, 10, 21, 6, 5, 11, & 5 3, 9; 4; 9, 5. ae 
3. 8, 9, 4, 7, 12, 11, 3, 6, 5, and 9 seconds. (c) A 
response-dependent FI schedule in which food was 
delivered (reinforcement occurred) for the first 
key peck 12 seconds or more after the preceding 
reinforcement. 

Food delivery involved 2-second access to mixed 
Sessions ended after the sixty-fourth food 
delivery and the pigeons were run daily. 

Habituation sessions. All the birds were given 
a number of daily 10-minute sessions when no food 
was delivered; the birds were simply placed in the 
chamber and their behavior observed and recorded 
The birds received the following numbers of such 
sessions: Bird. 31, 3; Bird 29, 3; Bird 47, 15. 
Bird 49, 15; Bird 40, 7; Bird 91, 7. Note that the 


grain. 


experimentally naive birds received more habitua- 
tion exposure. 

Response-independent training. Four of the 
pigeons were then given a number of sessions on 
each of the two response-independent procedures, 
either FI followed by VI, or the reverse. The 
birds received the procedures in the indicated 
order (number of sessions in parentheses): Bird 
31: FI 12 (26), VI 8 (111) ; Bird 29: VI 8 (26), 
FI 12 (109); Bird 47: VI 8 (36), FI 12 (36); 
Bird 49: FI 12 (37), VI 8 (36). 

Response-dependent training. Two of the birds 
already trained on the response-independent pro- 
cedures were then switched to the response- 
dependent FI 12 for the following numbers of 
sessions: Bird 31, 37; Bird 47, 52. The two 
naive birds (40 and 91), after their habituation 
sessions, were given one session of response-inde- 
pendent FI 12 when the food magazine operated 
every 12 seconds, but contained no grain. This 
was to habituate these birds to the sound of the 
mechanism. The next day these two birds were 
placed in the experimental box with the food maga- 
zine continuously available for 10 minutes. The 
following day the two birds were introduced to 
the response-dependent FI 12-second schedule. A 
small piece of black tape was placed on the light 
response key, as an inducement to pecking, for this 
first session only. All the birds pecked the key 
during the first session of the response-dependent 
procedure. This rather elaborate key-training pro- 
cedure was designed both to prevent the experi- 
menter from shaping the naive birds’ behavior, and 
to avoid the Possibility of “superstitious” condi- 
tioning, which might be entailed by some form 
of auto-shaping. Bird 40 received a total of 45 
Sessions of the response-dependent FI 12-second 
schedule, Bird 91 received 38. 

Response description and scoring. Response 
categories were arrived at on the basis of initial 
observation during the habituation sessions and 
were altered as necessary to accommodate new 
behaviors. The names, descriptions, and numbers 
of the categories appear in Table 1. Responses 
were scored (by pushing the appropriate button) 
in two ways, either discretely or continuously. 
If a response tended to occur in discrete units 
(e.g, pecking), then the appropriate button was 
pushed cach time an instance of the response 
occurred. The observer was the same throughout 
(VLS), and the maximum recordable rate for 
discrete responses was 3-4 per second. A con- 
tinuous response is one which took an indefinite 
amount of time (e.g, facing magazine wall); the 
appropriate button was pressed throughout the 
duration of a continuous response. Discrete re- 
sponses were pecking (wall, key, or floor) and 
quarter circles, all the rest were continuous re- 
Sponses. In general, the response categories were 
mutually exclusive, The only exception is facing 
Magazine wall (Ri), which at various times oc- 
curred with Flapping wings (Rs), Moving alone 
magazine wall (Rs), and Pecking (R;). 
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Results 


The data of interest in this experiment are 
the kind and amount of behavior at different 
points in time following the delivery of food, 
As training progressed, a systematic pattern 
of behavior as a function of postfood time 
began to emerge. The properties of this 
steady-state pattern for VI and FI schedules 
is discussed first, followed by a description 
of the changes that took place during acquisi- 
tion. 


Steady-State Behavior 

In the steady state, the behavior developed 
under both the FI and VI procedures fell 
reliably into two classes: (a) The terminal 
response was the behavior that consistently 
occurred just before food delivery. It be- 
gan 6-8 seconds after food delivery. oh the 
FI procedures, and about 2 seconds aíter 
food on the VI procedure, and usually con- 
tinued until food delivery. (b) A number 
of activities usually preceded the termina 
response in the interval. "These activities are 
probably indistinguishable from what has 
been termed mediating behavior, but we pre- 
fer the more descriptive term interim activi- 
ties, "These activities were rarely con- 
tiguous with food. 

Figure 1 shows the performance averaged 
across three sessions of steady-state respond- 
ing under all conditions for all the pigeons. 
The left-hand panels show the response-de- 
pendent FT schedule; the middle panels, E 
response-independent ^ (superstitious) š 
schedule; and the right-hand panels, the re- 
sponse-independent VI procedure, for e 
four birds exposed to each. The graphs 
show the probability (relative frequency 
With which each of the activities acit 
during each second of postfood time. Bac 
bird shows the clear division between termi 
nal and interim activities already alluded toi 
Excluding R,, and the results for Bird a 
on VI, Pecking (R,) was the terminal E 
sponse for all the response-independent P 
cedures. For Bird 29, the terminal respon 
Head in magazine (R,,) became an inter" 
activity following the shift from VI tO y 
and was replaced as terminal response, 4 
Pecking. — Pecking remained the ieu 
response for this bird throughout a seque? 
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TABLE 1 
DESCRIPTION OF OBSERVED ACTIVITIES 


Description 


Response no. Name 
R: Magazine wall 
Re Pecking key 
Ra Pecking floor 
Ry 1 circle 
Rs Flapping wings 
Re Window wall 
Rz Pecking 
Rs Moving along magazine wall 
Ry Preening 
Rio Beak to ceiling 
Ru Head in magazine 
Rie Head movements along magazine 
wall 
Ris Dizzy motion 
Ria Pecking window wall 
Ras Head to magazine 
Rie Locomotion 


| An orientation response in which the bird's head and 


body are directed toward the wall containing the 
magazine. 

Pecking movements directed at the key. 

Pecking movements directed at the floor. 

A response in which a count of one } circle would be 
given for turning 90° away from facing the magazine 
wall, a count of two for turning 180° away, three for 
270°, and four for 360°. 

A vigorous up and down movement of the bird's wings. 

An orientation response in which the bird's head and 
body are directed toward the door of the experi- 
mental chamber containing the observation window. 

Pecking movements directed toward some point on the 
magazine wall. This point generally varied between 
birds and sometimes within the same bird at dif- 
ferent times. 

A side-stepping motion with breastbone close to the 
magazine wall, a few steps to the left followed by a 
few steps to the right, etc. Sometimes accompanied 
by (a) beak pointed up to ceiling, (b) hopping, 
(c) flapping wings. 

Any movement in which the beak comes into contact 
with the feathers on the bird's body. 

The bird moves around the chamber in no particular 
direction with its beak directed upward touching the 
ceiling. 

A response in which at least the beak or more of the 
bird's head is inserted into the magazine opening. 
The bird faces the magazine wall and moves its head 
from left to right and/or up and down. 1 
A response peculiar to Bird 49 in which the head vi- 
brates rapidly from side to side. It was apparently 

related to, and alternated with, Pecking (R3). 

Pecking movements directed at the door with the ob- 
servation window in it. . 

The bird turns its head toward the magazine. 

The bird walks about in no particular direction. 


of response-independent procedures after the 
ones reported here, lasting for a total in 
excess of 90 sessions. A curious idiosyn- 
cratic head movement accompanied pecking 
by Bird 49 (R,,:Dizzy motion) on response- 
independent FI, although it disappeared fol- 
lowing the switch to VI. The locus of Peck- 
ing on the magazine wall differed from bird 
to bird and varied both across and within 
sessions for some birds. The stable features 
of this response were its topography and its 
restriction to the general area of the maga- 
zine wall A variety of interim activities 
occupied the early parts of the FIs and the 
period within 2 or 3 seconds after food on 


the VI schedule:Pecking floor (Ri). i 
circles (R,), Flapping wings (R;), Moving 
along magazine wall (Rs). and Beak to ceil- 
ing (Rie) were the most frequent. , The 
interim activities were therefore more vari- 
able from bird to bird than was the terminal 
response. : A 
The pattern of behavior characteristic of 
each interval was little affected by whether 
or not food was dependent on key pecking. 
The similarity between the patterns during 
response-dependent and response-independ- 
ent FT is particularly striking for Bird 47 
who was switched directly from the re. 
sponse-independent to the response-depend- 
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nt procedure. Bird 31, for whom the re- 
ponse-independent and response-dependent 
Is were separated by response-independent 
‘I, shows more variation in the interim 
ctivities under the two conditions. Birds 
l and 40, who were exposed only to the 
esponse-dependent FI, show a similar pat- 
‘rn of interim activities to the response- 
idependent birds, although again there is 
ome variation as to details. This similarity 
innot be attributed to the procedure used 
shape key pecking (see Procedure). After 
ie imposition of the key-pecking contin- 
ency, Bird 31 retained the old Pecking re- 
jonse as an interim activity restricted to a 
eriod in the interval just before key peck- 
ig. Bird 47 showed a similar effect of pis 
'sponse-independent experience in that a 
gh proportion of his key pecks failed to 
press the key sufficiently to activate the 
itomatic recording circuitry, although they 
ere recorded as key pecks by the observer. 
verall, these data provide no evidence for 
ibstantial changes in the pattern of terminal 
- interim activities traceable to the imposi- 
jn of a key-pecking requirement. 
The general pattern of terminal and in- 
rim activities during the response-inde- 
ndent VI schedule was similar to the FT 
ocedures. Differences were restriction of 
e interim activities to the first 2 or 3 sec- 
ds of postfood time rather than to the first 
sr 7 seconds (with one exception, to be dis- 
ssed), and the smaller number of interim 
tivities, in most cases. Under both fixed 
d variable procedures, once the terminal 
sponse began in an interval, it continued 
til food delivery. The exception is Bird 
who showed a drop in the probability of 
> terminal response (Pecking, R;) accom- 
nied by a transient increase in the interim 
ivity of 1 circles (R,) at the 14—15- 
‘ond postfood time. A similar slight drop 
the Probability of Pecking, although not 
'ompanied by an interim activity, was also 
own by Bird 31, These differences are re- 
ed to the properties of the VI schedule 
ee Discussion), | 


quential Structure 


Figure 2 indicates something of the se- 
ential structure of the behavior occupying 


each interfood interval for each bird during 
the response-independent procedures. The 
figure summarizes the two to five behavior 
sequences that account for most of the inter- 
vals during the three steady-state sessions. 
For simplicity, no account is taken either of 
interbehavior times or of the duration of 
each activity in this method of representa- 
tion. The most striking characteristics of 
these sequences are: (a) that each bird 
showed only a small number of typical se- 
quences (usually three or four); (b) that 
the sequencing was very rigid, so that al- 
though a given behavior might fail to occur 
during a particular interval, it never occur- 
red out of sequence—this is indicated by the 
absence of return arrows (“loops”) in 
these diagrams ; and (c) that the variability 
of the sequences was greatest early in the 
interval and least at the end, in the period 
just preceding food delivery—this is indi- 
cated by the absence of “forks” (ambiguous 
transitions from one behavior to two or more 
others, as in the diagram for Bird 31 where 
R,—R, or R, —^ R; with approximately 
equal probability) late in the sequence of be- 
haviors shown in the diagrams. This reg- 
ular sequencing did not occur early in train- 
ing, as indicated in Figure 4, discussed 
below. 

An inviting possibility raised by these reg- 
ular sequences is that this behavior may be 
described by some kind of Markov chain 
(cf. Cane, 1961). Although the argument 
cannot be presented in full here, this assump- 
tion cannot be sustained for a number of 
reasons, the most important of which are 
(a) that the duration of a bout of a given 
activity was shorter the later the activity 
began within an interval, and (b) that the 
time between two successive activities was 
shorter the later in the interval the first 
activity ended. These and other considera- 
tions suggest that postfood time was the 
most important factor controlling both the 
onset and offset of each activity in the se. 


quence. 


Acquisition 
Figure 3 shows acquisition data for naive 


Bird 49, through and beyond the period 
when his behavior became stable on re- 
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Sponse-independent FI 


(as a function of sessions) the probability 
of the various behaviors in each of the 
SIX 2-second periods making up the E. 
The most noteworthy characteristic of acqui- 


sition is the relatively sudden disappearance 
of the behavior Head Magazine, which Was 
almost the Only behavior to occu 
the first few sessions for Bird 49 
of the terminal] response Peckin 
zine wall. For Bird 49 this transition took 
place between the seventh and eighth ses- 
sions of FI without any prior history of 
pecking during earlier sessions. Once peck- 
Ing became established as a terminal re. 
Sponse, the only further change was a slow 
decline in the probability of the response 
during early parts of the interval (e.g., be. 
tween 3 and 8 seconds). The other birds 
AP mou results on their first exposure 
MI bird dependent procedures ; 

' Pecking on the magazine wall 


: The graphs show 
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lationships among behaviors during the last 
four pigeons. (Fixed-interval is 
bers give probabilities of the indicated transi- 


1e diagrams are based and number of intervals 


“F” is food delivery, and behaviors (Ri) 


“fork” do not always sum to 1.0 


first occurred during the following sessions: 
for Bird 31, on the first FI session ; Bird 29, 
twelfth session of FI following the switch. 
from response-independent VT: Bird ue 
twenty-eighth session of VI; Bird 49, eighth 
session of FT. Thus, although one of the ex- 
perienced birds pecked during the first re- 
*sponse-independent session (31), the other 
did not until being switched to a different | 
schedule (29). No bird shifted to a different 
termina] response once he began pecking ; 
the total number of sessions of response- | 
independent experience (FI and VI) for | 
each of the four birds following the onset. 
Of pecking was as follows: Bird 31, 140; 
Bird 29, 96; Bird 47 45; Bird 49, 66. For 
Dird 29, the termina] response during the | 
"I procedure (i.e., be- 
pecking ) was Head in. 


Magazine, As with the Other birds, once 
pecking appeared it Was in full strength 


almost Immediately and further experience | 
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Fic. 4. Sequence data, in the same form as 
Figure 2, for the first three sessions under the 
first response-independent procedure to which each 
bird was exposed. (Details as in Figure 2.) 


had the effect merely of restricting it more 
to the later parts of the interval. 

Figure 4 shows sequence data, in the 
same form as Figure 2, for the first three 
Sessions of the first response-independent 
Schedule to which each bird was exposed. 
Approximately the same proportion of the 
ies number of sequences observed within 

intervals is accounted for by these dia- 
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grams as in the diagrams of Figure 2 for 
the last three sessions. The data for the 
first three sessions are much more variable, 
showing repetitions of a behavior within an 
interval and reversals of sequence; three 
out of four birds show no single terminal 
response. Thus, the regularities apparent 
in Figure 2 are evidently a real effect of 
training and not simply an artifact of this 
method of representation. 


Discussion 


The results of this experiment confirm the 
suggestion that the “superstition” situation 
generally produces two distinct kinds of ac- 
tivity: interim activities that occur at short 
and intermediate postfood times, and the 
terminal response that begins later in the- 
interval and continues until food delivery. It 
is not always clear from Skinner's original 
discussion just which kind of behavior he 
was observing. In one case, he briefly de- 
scribes an experiment in which the interfood 
interval was 60 seconds, and the “super- 
stitious" response (a “well-defined hopping 
step from the right to the left foot”) was 
automatically recorded. In this case, the 
behavior was evidently a terminal response, 


since it occurred with increasing frequency 
through the interval: 


The bird does not res 
but when 10 or 15 
it begins to respon 


the reinforcement j 
406]. 


pond immediately after eating, 
or even 20 sec, have elapsed 
d rapidly and continues until 
s received [Skinner, 1961, p- 


On other occ 


asions, however, Skinner may 
have been o 


bserving interim activities, as, 
in our experience, they are sometimes much 
more striking than the terminal response; 
especially early in training when they may 
include actions like jumping in the air, 
vigorous wing flapping, and idiosyncratic 
head and limb movements, We have also 
Sometimes observed interim activities during 
sessions when there was no obvious termina 
response, i 


Nature of the Terminal Response 


The data from both FT and VI schedules 
of food presentation indicate that the prob- 


ability of the terminal response at different 
postfood times was a function of the pro]. 
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P ibility of food delivery at those times: the 


probability of the terminal response in- 
creased just:in advance of increases in the 
probability of food delivery, and decreased 
just after decreases in that probability. 
Thus, on the FI schedule, the terminal re- 
sponse began at postfood times greater than 
about 8 seconds, corresponding to a prob- 
ability of food delivery of zero at times less 
than 12 seconds, and one thereafter. On 
the VI schedule, probability of food delivery 
was zero at postfood times less than 3 sec- 
onds (the shortest interfood interval in the 
VI sequence), and increased thereafter. 
Correspondingly, the birds began their ter- 
minal response after 2 seconds or so of post- 
food time and continued until food delivery. 
The data for Birds 29 and 47 on the VI 
schedule provide additional confirming evi- 
dence. Both birds showed a small decline 
in the probability of the terminal response 
between Seconds 12 and 17 of postfood time, 
accompanied by a brief reappearance of an 
interim activity (between Seconds 13 and 
16) in the case of Bird 47. This brief de- 
crease in the terminal response, and accom- 
panying reappearance of an interim activity, 
corresponds to the zero probability of food 
in the region of postfood time between 13 
and 21 seconds (because the VI sequence 
used had no interfood interval between 13 
and 21 seconds in length—see Method). 
Catania and Reynolds (1968) showed a 
similar relationship between rate of key 
pecking and probability of reinforcement, as 
a function of postreinforcement time, on a 
variety of conventional (response-depen- 
dent) VI reinforcement schedules. This 
similarity and the similarities between the 
response-independent and response-depen- 
dent conditions of this experiment emphasize 
that the terminal response must be regarded 
as a discriminated operant in Skinner's 
(1958) sense. 

Herrnstein (1966) reported an experi- 
ment in which the rate of key pecking on 
response-independent FI 11 seconds was 
lower than on response-dependent FI 11. 
However, in the present experiment, con- 
sidering just the terminal response of Peck- 
ing on magazine wall (R;), we found no 
evidence for a difference in either probability 
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or rate of response favoring the response-de- 
pendent procedure. Indeed, rate of key 
pecking (defined as switch operations) 
under the response-dependent FI was gen- 
erally lower than the observer-defined rate 
of pecking under the response-independent 
FI because many pecks failed to break the 
switch contact. On the other hand, because 
the location of pecking was much more 
variable under the response-independent 
condition, a comparison between key peck- 
ing (in the response-dependent condition) 
with pecking on a comparable area of the 
magazine wall (in the response-independent 
condition) would show more responding 
under the response-dependent condition. 
This result raises the possibility that the 
effect of the response-dependency, in 
operant conditioning experiments using in- 
terval reinforcement schedules, may be 
largely one of determining (perhaps imper- 
fectly) the location of pecking, rather than 
either its form or its frequency of occurrence. 


Nature of the Interim Activities 


Falk (1969, 1970) has coined the term 
adjunctive behavior for a variety of activities 
that are induced in a number of different 
animal species (rats, pigeons, monkeys, 
chimpanzees) by intermittent schedules of 
reinforcement. These activities generally 
occur just after reinforcement, when the re- 
inforced /ie. terminal) response is not 
occurring. However, the important variable 
in determining their temporal location seems 
to be the low probability of further reinforce- 
ment in the immediate postreinforcement 
period (rather than the interruption of eat- 
ing), since they also occur following brief 
stimulus presentations on second-order inter- 
val schedules (Rosenblith, 1970), following 
each response on spaced-responding (dif- 
ferential reinforcement of low rates) sched- 
ules (Segal & Holloway, 1963), during time- 
out periods (Wiittke, 1970), only after the 
last pellet when a number are delivered con- 
secutively at the end of each FI (Keehn, 
1970), and later in the interval during long 
FI schedules (Segal, Oden, & Deadwyler, 
1965). They are not elicited (e.g., by 
frustration) in any obvious sense, since they 
take some time to develop (Reynierse & 
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Spanier, 1968). They are related to moti- 
vational systems, since a number of activ- 
ities—for example, polydipsia (excessive 
drinking), pica (eating nonfood material), 
wheel running, and schedule-induced aggres- 
sion—may occur more or less interchange- 
ably depending on the presence of appro- 
priate stimuli; and animals will learn an 
operant response in order to obtain an op- 
portunity to engage in one of these activities 
(Falk, 1970). 

The interim activities in the present ex- 
periment appear to reflect the same causal 
factors as adjunctive behavior: they occur 
at times when reinforcement is not available 
and the terminal response is not occurring ; 
they occur on both response-dependent and 
response-independent schedules (cf. Azrin, 
Hutchinson, & Hake, 1966; Burks, 1970; 
Flory, 1969) ; and they occur on both FI 
and VI schedules, Adjunctive behavior re- 
quires appropriate stimulj (water for poly- 
dipsia, wood shavings for- pica, etc.) to which 
it is directed and by which it can be modified, 
within limits. This is unlikely to be a crucial 
difference, however, since appropriate stim- 
uli were not available in the present experi- 


ment (had water been available, schedule- 
induced drinking Would almost certainly 
have been Observed in lieu of the interim 
activities; cf. Shanab & Peterson, 1969), 
and behavior resembling interim activities 
has occasionally been Teported under condi- 
tions when most animals show adjunctive 
behavior, thus 


[this rat] was atypical in that it did not develop 
polydipsia exhibiting instead a number a 
stereotyped behaviors, like rearing and running 
to the corners of the experiment: £ 


al chamber, be. 
tween reinforcements [Keehn, 1970, pp. 164-167], 


account of interim 
in the concluding 


We return to a theoretical 
and adjunctive behavior 
section. 

An analogy can be drawn between the 
terminal and interim activities here and the 
classical dichotomy between consummatory 
and appetitive behavior (Craig, 1918). 
Thus, pecking, the stable terminal response, 
is a food-elicited (consummatory ) activity 
im Pigeons, and the interim activities were 


quite variable, as might be expected of appe- 
utive behavior, Moreover, the variability of 


the sequences (as measured by the number 
of “forks” in the sequence) was greatest at 
the beginning of the sequence and decreased 
toward the end (cf. Figure 2). More 
less unlearned sequences terminating in cd 
summatory acts show a similar reduct 
in variability toward the end of the seque 
(e.g., Morris, 1958). | 

In a similar vein, Falk (1970) has coni 
pared adjunctive behavior with displace- 
ment activities (Tinbergen, 1952): 


In both adjunctive behavior and displ 
tivity situations, the interruption of a consumma- 
tory behavior in an intensely motivated animal in- 
duces the occurrence of another behavior imme- 
diately following the interruption. [p. 305]. 


acement ac- 


p f 

At the present stage of knowledge, these 
comparisons do little more than group to 
gether a number of puzzling phenomena that 
cannot as yet be convincingly explained, 
either by ethological principles or by the 
Law of Effect. We can choose either to 
accept these behaviors as anomalies within 
our present conceptual system, hoping that 
further research will show how to reconcile | 
them, or revise the system in a way that will 
accommodate them more naturally — The 
first alternative is becoming increasingly 
hard to maintain, as it becomes clear that 
these behaviors are of wide occurrence, and 
as they continue to resist attempts to explain 
them in conventional terms, Polydipsia is 
the most widely studied adjunctive behavior, 
and Falk (1969) summarizes research re- 
sults as follows: 


lt is not explicable in terms of any altered state 
of water balance initiated by the experimental con- 


ditions. It cannot be attributed to adventitious 
reinforcing effects, Since it cannot be related to 
abnorm; 


al water losses, chronic internal stimulation 
from unusual states . . 
to the central nervous system, 
Would be classified clinically 
chogenic polydipsia [p. 587]. 
A revision of the conceptual foundations of 
Operant behavior, which will deal naturally 
with these behaviors, as well as guide Te 
search into more profitable channels, seem$ 
called for, i 


Der 


t te - or from injury 
the overdrinking 
as primary or psy 


‘elopment of the Terminal Response 
The development of 


P e 
1 the terminal respons 
provides some clues t 


oward an alternative: 


. "SUPERSTITION" 


conception, Changes in the terminal re- 
sponse throughout acquisition rule out an 
. unqualified application of the Law of Effect 
BR a description of the process. For ex- 
| jus ple, three of the four birds made the re- 
i Tee Head in magazine at a higher fre- 

Spo? €Y and for a larger fraction of the total 
abof than any other response, for the first 
geW sessions of exposure to the superstition 
‘procedure (either FI or VI, depending on 
fis bird). This behavior was also the one 
“most often contiguous with the delivery of 
food. Yet during later sessions, it dropped 
out abruptly and was replaced by Pecking. 
Thus the development of Pecking as terminal 
response resembles the findings of Williams 
and Williams and of Breland and Breland, 
much more than it does the strengthening 
of an emitted response, in Skinner's terms, 
by the automatic action of food as a rein- 
forcer. In all these cases, the presentation 
of food at a predictable time resulted, after 
training, in the regular occurrence of a food- 
telated behavior in anticipation of food de- 
livery, quite independently of the demand 
characteristics of the situation (ie, the 
reinforcement schedule), Given that food 
is a stimulus that elicits pecking by pigeons, 
the appearance of pecking, in anticipation of 
food, in the present experiment is an instance 
of the principle of stimulus substitution (Hil- 
gard & Marquis, 1940). The stimulus is, 
of course, a temporal one (postfood time), 
and the situation is analogous, therefore, to 
Pavlovian temporal conditioning ( Pavlov, 
1927), both operationally and in its con- 
formity to the substitution principle. There- 
fore, it is tempting to attribute the appear- 
ance of pecking to a classical conditioning 
mechanism, and leave it at that. 

A number of considerations suggest that 
this explanation will not suffice, however: 
(a) The behavior of one bird (29) is a par- 
tial exception, He showed a terminal re- 
sponse (Head in magazine) different from 
 Pecking, although this response was meta- 
stable (Staddon, 1965), in the sense that 
it was displaced by Pecking following a 
schedule shift. In addition, Skinner’s results 
and informal observations in a number of 
laboratories indicate that the superstition 
Procedure may generate behaviors other than 


€ 
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pecking that persist for considerable periods 
of time. Presumably, many of these be- 
haviors are terminal responses, in our sense, 
and, metastable or not, they cannot be dis- 
missed in favor of a Pavlovian account of all 
terminal responses in superstition situations. 
(5) The results of Rachlin (1969), who was 
able to obtain spontaneous key pecking by 
means of the auto-shaping procedure of 
Brown and Jenkins, but using electric shock- 
reduction rather than food as the reinforcer, 
also complicate the picture, since it is not 
clear that pecking is elicited by shock, as it 
is by food. The results of Sidman and 
Fletcher (1968), who were able to auto 
shape key pushing in rhesus monkeys, are 
also not readily explicable by stimulus sub- 
stitution. (ce) Williams and Williams 
(1969) note that the directed nature of the 
key peck in the auto-shaping situation is not 
readily accommodated within a Pavlovian 
framework : 

the directed quality of the induced pecking does 
not follow naturally from respondent principles 
(see also Brown and Jenkins, 1968). It is un- 
clear, for example, why pecking would be directed 
at the key rather than the feeder, or indeed why 
it would be directed anywhere at all [p. 519]. 


A similar objection can be raised here, since 
the terminal response of pecking was always 
directed at the magazine wall, although it is 
important to notice that this objection is 
more damaging to an interpretation in terms 
of classical conditioning than one couched 
simply in terms of stimulus substitution. 
(d) Finally, there is the problem of the 
skeletal nature of pecking. Ever „Since 
Skinner’s (1938) original suggestion, it has 
become increasingly common to restrict the 
concept of classical conditioning to auto- 
nomically mediated responses. Some cri- 
ticisms of this convention are presented þe- 
low, and Staddon (1970a) has argued 
against the operant/respondent and emitted / 
elicited dichotomies. For the moment, let 
it be said that the objection to the skeletal 
nature of the response is only a problem for 
an interpretation of the development of peck- 
ing in terms of classical conditioning, as 
traditionally conceived. It does not conflict 
with a stimulus substitution interpretation, 
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In summary, itlis clear that while the prin- 
ciple of stimulus substitution describes the 
development of the terminal response in à 
majority of cases, and is more successful 
than an appeal to Pavlovian principles 
modeled on the salivation reference experi- 
ment, it is not adequate as a universal 
account. We turn now to the possibility of 
a general scheme to deal with these anoma- 
lous facts. 


EVOLUTION AND LEARNING 


Objectively considered, the subject mat- 
ter of the psychology of animal learning— 
the behavior of animals—is a part of biology. 
This commonality does not extend to terms 
and concepts, however. Although the 
ethologists have investigated unlearned be- 
haviors in a variety of species, learning re- 
mains almost exclusively the possession of 
psychologists. Consequently, the theoretical 
foundations of the study of learning, such as 
they are, have evolved almost independently 
of biology. It is now 15 years since Ver- 
planck (1955) wrote: “the structure of the 
theory of unlearned behavior and that of 
learned behavior must prove to be similar 
if not identical [p. 140]," but little progress 
toward a unified set of concepts has been 
made. Yet the facts discussed in the pre- 
vious section seem to demand an interpreta- 
tion within the context of adaptive behavior 
as a whole. 

The principles of evolution by natural se- 
lection provide a unifying framework for 
biology.  Pfaffman (1970) has recently 
commented : 


I am impressed by the extent to which evolution, 
the genetic machinery, and biochemistry provide 
for biologists a common language and unity of 
theory that overrides the molecular versus orga- 
nismic debate within biology. In contrast, it is 
obvious that there is no unified theory of behavior 
for all students of behavior [p. 438]. 


These considerations—the commonality of 
subject matter between biology and animal 
psychology, the probable common basis of 
learned and unlearned behavior, the unifying 
role of evolutionary processes in biology— 
all suggest the application of evolutionary 
principles to the psychology of learning in 
animals. In recent years, several papers 


have drawn attention to the similarities be- 
tween evolution and learning (e.g., Breland 
& Breland, 1966; Broadbent, 1961 ; Gilbert, 
1970; Herrnstein, 1964; Pringle, 1951; 
Skinner, 1966a, 1966b, 1969), but no ver- 
sion of the evolutionary approach has proved 
influential as yet. The main reasons for 
this failure, perhaps, have been the compara- 
tive effectiveness of traditional versions of 
the Law of Effect in dealing with the limite 
phenomena of laboratory learning experi- 
ments and the lack of a substantial body of 
facts clearly in conflict with accepted theory. 
To some extent, of course, the second factor 
reflects the first—although it would be cyni- 
cal to speculate that the kind of experiments 
psychologists do are often such as to pre- 
clude data that might go beyond current 
theory. In any event, neither of these rea- 
sons now holds true. " 

The growing number of facts on auto- 
shaping, instinctive drift, adjunctive, and 
superstitious behaviors are not readily 
accommodated within traditional views. In 
addition, the history of the Law of Effect 
as a principle of acquisition (as opposed to 
the steady state, i.e., asymptote) has not been 
a distinguished one. Little remains of the 
impressive edifice erected by Hull and his 
followers on this base. We are no longer 
concerned with the production of learning 
curves, nor with the measurement of habit 
strength or reaction potential Hullian 
theory has proven effective neither in the 
elucidation of complex cases nor as an aid 
to the discovery of new phenomena. In- 
deed, the opposite has generally been truc: 
new learning phenomena such as schedules 
of reinforcement, learning, and reversal sets; 
etc. have typically been the result of unaided 
curiosity rather than the hypothetico-deduc- 
tive method, as exemplified by Hull and his 
students. This represents a failure of Hul- 
lian theory rather than a general indictment 
of the hypothetico-deductive method, wies 
has proven every bit as powerful as Hu 
believed it to be—when used by others e 
Darwin, cf. Ghiselin, 1969). A sim 
although less sweeping, verdict mus 
handed down on stochastic learning ' 
which represents perhaps the mos à 
attempt to translate the Law of Effect into 
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a quantitative principle of acquisition. With 
the exception of predictions about the steady 
state, such as Estes’ ingenious deduction of 
probability matching (Estes, 1957), the 
early promise of this approach as a way of 
understanding the behavior of individual 
animals has not been fulfilled. It seems fair 
to say that the rash of theoretical elaboration 
that has occupied the years since Thorndike 
first stated the Law of Effect has told us 
almost nothing more about the moment-by- 
moment behavior of a single organism in 
a learning situation. Although the alterna- 
tive we will offer may be no better than 
earlier views, it does accommodate the 
anomalies we have discussed—and the need 
for some alternative can hardly be ques- 
tioned. 

The close analogy between the Law of 
Effect and evolution suggests an approach 
that may be a step toward the general frame- 
work which the psychology of learning so 
obviously lacks. At the present stage of 
knowledge, this approach is simply an anal- 
ogy, although a compelling one, and cannot 
yet be called an evolutionary theory of learn- 
ing. However, it provides a beginning and 
may lead to a truly evolutionary account 
which can securely imbed the study of learn- 
ing in the broader field of biology. 


Behavioral Variation and Reinforcement 


The Law of Effect, suitably modified to 
take account of advances since "T horndike's 
original formulation, can be stated so as to 
emphasize acquisition, or the steady state, 
and learning (eg. S-R bonds), or per- 
formance. We will take as a point of de- 
parture a neutral version of the law that 
emphasizes performance in the steady state, 
as follows: "If, in a given situation, a posi- 
tive correlation be imposed between some 
aspect of an animal's behavior and the de- 
livery of reinforcement, that behavior will 
generally come to predominate in that situa- 
tion." The term *correlation" is intended to 
include cases of delay of reinforcement and 
experiments in which the response acts on 
the rate of reinforcement directly (e.g., 
Herrnstein & Hineline, 1966; Keehn, 1970). 
This formulation does not take account of 
More complex situations where more than 


one behavior and more than one correla- 
tion are involved (i.e., choice situations), as 
these complications do not affect the present 
argument. A discussion of three aspects of 
this law—(a) the initial behavior in the 
situation before reinforcement is introduced, 
(5) the process whereby this behavior is 
transformed into the dominant reinforced 
behavior, and (c) reinforcement—follows : 

(a) The behavior in a situation before 
the occurrence of reinforcement reflects a 
number of factors, including past experience 
in similar situations (transfer), motivation, 
stimulus factors (e.g., novel or sign stimuli), 
and others. We propose the label “prin- 
ciples of behavioral variation” for all such 
factors that originate behavior. These prin- 
ciples are analogous to Darwin's laws of 
Variation, corresponding to the modern laws 
of heredity and ontogeny that provide the 
phenotypes on which selection (analogous 
to the principles of reinforcement, see be- 
low) can act. Thus, the term “variation” 
is intended to denote not mere variability, 
but the organized production of novelty, in 
the Darwinian sense. 

(b) Transition from initial behavior to 
final behavior is the traditional problem of 
learning theory and, as we have seen, is 
essentially unsolved at the level of the indi- 
vidual organism. However, it is at this 
point that the analogy to the mechanism of 
natural selection becomes most apparent. 
Broadbent (1961) makes the parallel quite 
clear: 


Since individual animals differ, and those with 
useful characteristics [with respect to a particular 
niche] survive and pass them on to their children, 
we can explain the delicate adjustment of each 
animal's shape to its surroundings without requir- 
ing a conscious purpose on the part of the Polar 
bear to grow a white coat. Equally each indi- 
vidual animal [under the Law of Effect] tries 
various actions and those become more common 
which are followed by consummatory acts [i.e., 
reinforcement] [p. 56]. 


Thus, the transition from initial to final be. 
havior can be viewed as the outcome of two 
processes: a process that generates behavior, 
and a process that selects Gey selectively 
eliminates) from the behavior so produced. 
Since there is no reason to Suppose that the 
process which generates behavior foll owing 
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the first and subsequent reinforcements is 
different in kind from the process that gener- 
ated the initial behavior (although the 
effects will generally be different, see Ex- 
tinction, below), we can include both under 
the head of principles of behavioral variation. 
We propose the label “principles of rein- 
forcement” for the second, selective process. 
(c) As we have seen, the Darwinian 
principle of selection is analogous to the 
process that transforms initial behavior into 
final behavior—the “principles of reinforce- 
ment.” The notion of reinforcement (more 
exactly, the schedule of reinforcement) itself 
is in fact analogous to an earlier concept, one 
that preceded evolution by natural selection 
and can be derived from it: the Law of Con- 
ditions of Existence, that is, the fact that 
organisms are adapted to a particular niche. 
This is apparent in the form of the state- 
ments: “The Polar bear has a white coat 
because it is adaptive in his environment.” 
“The pigeon pecks the key because he is 
reinforced for doing so.” It is important 
to emphasize this distinction between rein- 
forcement and principles of reinforcement, 
and the analogous distinction between adap- 
tation to a niche and the process of selection 
by which adaptation comes about. In the 
case of adaptation, the process of selection 
involves differential reproduction, either 
through absence of a mate, infertility, or 
death before reproductive maturity. In the 
case of reinforcement, the distinction is less 
obvious, since the principles of reinforce- 
ment refer to the laws by means of which 
behaviors that fail to yield reinforcement are 
eliminated, rather than the simple fact that 
reinforced behaviors generally predominate 
at the expense of unreinforced behaviors, 
Thus, both evolution and learning can be 
regarded as the outcome of two independent 
processes: a process of variation that gener- 
ates either phenotypes, in the case of evolu- 
tion, or behavior, in the case of learning ; 
and a process of selection that acts within 
the limits set by the first process. In both 
cases, the actual outcome of the total proc- 
ess is related to, but not identical with, the 
material acted upon: phenotypes reproduce 
more or less successfully, but a gene pool is 
the outcome of selection ; similarly, behaviors 


are more or less highly correlated with rein- 


_ forcement, but learning (ie., an alteration in 


memory) results. 

The three aspects of this process—varia- 
tion, selection, and adaptation—have re- 
ceived differing emphases at different times, 
depending on the prevailing state of knowl- 
edge. Before Darwin, adaptation, in the 
form of the Law of Conditions of Existence, 
was emphasized, since the only explanation 
for it—the design of the Creator—was not 


scientifically fruitful. Following Darwin, 
selection, both natural and artificial, re- 


ceived increased attention, in the absence of 
firm knowledge of the mechanism of varia- 
tion (ie. inheritance). With the advent 
of Mendelian genetics, variation has been 
most intensively studied (this is one aspect 
of the “molecular vs. organismic" debate re- 
ferred to by Pfaffman). 

In terms of this development, the study 
of learning is at a relatively primitive level. 
since the Law of Effect, although a great ad- 
vance over the level of understanding which 
preceded it, simply represents the identifica- 
tion of environmental events—reinforcers— 
with respect to which behavior is adaptive. 
Lacking is a clear understanding of both se- 
lection (the principles of reinforcement) 
and variation (the principles of variation). 

Space precludes exhaustive elaboration 
of all the implications of the classificatory 
scheme we are suggesting. However, in 
order to provide some context for our 
account of the facts already discussed, it 
seems essential to briefly summarize some 
possible candidates for principles of varia- 
tion and reinforcement. It should be ob- 
vious that current knowledge does not per 
mit the categories used to be either exhaus- 
tive or clear-cut. 

Principles of behavioral variation. 1f 
"Transfer processes: One of the main sources | 
of behavior in a new situation is obviously 
past experience in similar situations. Trans: 
fer has been most exhaustively studio. 
under the restricted conditions of verba | 
learning (eg. Tulving & Madigan, 1970) | 
but the principles of memory thus de- | 
rived—proactive and retroactive interfer | 
ence, primacy and recency, retrieval factor? 
etc.—are presumably of general applicability: | 
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With few exceptions (eg, Gonzales, 
Behrend, & Bitterman, 1967), these prin- 
ciples have been little used to interpret 
animal learning experiments. Other prin- 
ciples of transfer are stimulus and response 
generalization (induction) and what might 
be called "compositional transfer," in which 
Several past experiences are combined to 
Senerate a novel behavior, as in insight learn- 
ing and other forms of subjective organiza- 
tion of past input. 

2. Stimulus substitution: This principle, 
Which is usually identified with Pavlovian 
conditioning, has already been discussed as 
a description of the origin of the terminal 
response of Pecking. It may also describe 
the origin of the metastable terminal re- 
sponse Head in magazine, since this response 
is also elicited by food under these condi- 
tions, and the animal which showed this re- 
sponse most persistently (Bird 29) had had 
considerable experimental experience. The 
difference in the persistence of Head in mag- 
azine in the case of this bird, as compared 
with others, cannot be explained in this way 
and might reflect a difference in other trans- 
fer processes. The final dominance of Peck- 
ing, in every case, may reflect a special 
Susceptibility of consummatory responses to 
the stimulus substitution principle, or the ac- 
tion of other transfer principles in this par- 
ticular situation in ways that are presently 
unclear. The indefinite persistence of key 
pecking following only three peck-contingent 
reinforcements, found by — Neuringer 
(1970b), tends to support the simpler con- 
clusion, as does the data of Wolin (1968), 
who reports a similarity between the to- 
pography of operant pecking for food or 
water reinforcers, and the appropriate un- 
conditioned response. We return later to a 
general discussion of this principle in rela- 
tion to these data (pp. 33-34). 

“3. Preparatory responses: This principle 
is also frequently associated with classical 
conditioning, and in that sense is related 
to, and to some extent overlaps with, the 
Stimulus substitution principle. Thus, some 
conditioned responses, such as salivation, can 
be equally well described by either principle. 
Others, also respondents (such as heart rate, 
which increases following electric shock, but 
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usually decreases in anticipation of it [Zea- 
man & Smith, 1965]) may be classified as 
Preparatory responses. Skeletal responses 
observed in classical conditioning situations 
are often preparatory in nature (see discus- 
sion of classical conditioning, below). 

4. Syntactic constraints: There are often 
sequential constraints among behaviors, so 
that a given behavior is determined by some 
Property of the sequence of preceding be- 
haviors. Examples are spontaneous position 
alternation observed in rats and other ro- 
dents, stimulus alternation observed in mon- 
keys on learning set problems (eg., Levine, 
1965), and sequential dependencies observed 
in most species which cause responses to 
occur in runs rather than alternating ran- 
domly (e.g. position habits and other per- 
Severative errors). Human language pro- 
vides the most developed example of syntac- 
tic constraints, 

5. Orienting responses: This category in- 
cludes all those transient behaviors, such as 
exploration, play, curiosity, etc., that expose 
the organism to new stimuli and provide 
the possibility of transfer to future situations. 

6. Situation-specific and species-typical 
responses: Certain situations seem to call 
forth specific responses, which are often 
typical of the species rather than the indi- 
vidual and do not seem to depend in any 
obvious way on any of the other principles 
of variation such as transfer, etc. Examples 
are the species-specific defense reactions dis- 
cussed by Bolles (1970), which occur in 
fear-producing situations, the tendency to 
peck bright objects shown by many birds 
(Breland & Breland, 1966), and the digging 
shown by small rodents (Fantino & Cole, 
1968). Other examples are given by Glick- 
man and Sroges (1966). 

Principles of reinjorcement. Before the 
discovery of the mechanism of inheritance, 
evolution could be explained only in terms 
of a goal—adaptation—and a means suf- 
ficient to reach that goal—variation and 
selection; the generation-by-generation de. 
tails of the process were obscure : “Our 
ignorance of the laws of variation is pro- 
found [Darwin, 1951, P. 170].” We are 
at present equally ignorant of the mecha- 
nisms of behavioral variation. Since learn- 
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ing must usually involve constant interplay 
between variation and reinforcement, we are 
not yet in a position to suggest anything spe- 
cific about the moment-by-moment details of 
the process, either in its variational or se- 
lective aspects. However, just as Darwin 
was able to say something about selection 
by pointing out the adaptive role of various 
structures, so it is possible to learn some- 
thing about the selective role of reinforce- 
ment by looking at steady-state adaptations 
to fixed conditions of reinforcement, that is, 
reinforcement schedules. On this basis, we 
suggest the following tentative generaliza- 
tions about the effects of reinforcement : 

1. Reinforcement acts directly only on the 
terminal response; activities which occur at 
other times (interim activities, adjunctive 
behavior, etc.) must be accounted for in 
other ways, to be discussed later. This 
assertion is perhaps closer to a definition 
than an empirical generalization, since it is 
equivalent to the assertion that the terminal 
response may, in general, be identified as 
the activity occurring in closest proximity to 
reinforcement in the steady state. Identifi- 
cation is, of course, no problem in condition- 
ing situations that enforce a contingency be- 
tween some property of behavior and the 
delivery of reinforcement. However, we 
will show later that there is no empirical or 
logical basis for separating situations that 
do impose a contingency between response 
and reinforcement from those that do not 
(see Classical Conditioning, below). 

2. Reinforcement acts only to eliminate 
behaviors that are less directly correlated 
with reinforcement than others. This gener- 
alization, like the first, is also more like a 
definition, since all that 1s observed (under 
consistent conditions of reinforcement) is 
the eventual predominance of one behavior 
over others—which is consistent with either 
a suppressive or a strengthening effect. As 
Skinner (1966a) points out in a summary 
of the Law of Effect: 


Thorndike was closer to the principle of natural 
selection than the [usual] statement of his law. 
He did not need to say that a response which had 
been followed by a certain kind of consequence 
was more likely to occur again but simply that 
it was not less likely. It eventually held the field 
because responses which failed to have such 
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effects tended, like less favored species, to dis- 
appear [p. 13]. 

Unfortunately, Skinner, and most other be- 
haviorists, elected to follow Thorndike in 
considering reinforcement to have a positive, 
strengthening, or “stamping-in” effect. One 
of the main purposes of the present paper is 
to suggest that this decision was a mistake 
and has given rise to a number of problems 
and controversies that can be avoided if the 
effects of reinforcement are considered to be 
purely selective or suppressive. 

There are three kinds of argument which 
support a purely selective role for reinforce- 
ment, The first, and most important, is that 
the overall conceptual scheme which results 
is simpler and more easily related to bio- 
logical accounts of behavior than the alterna- 
tive. 

The second point is that situations such 
as extinction and shaping by successive ap- 
proximations that might seem to require an 
active role for reinforcement can be inter- 
preted in a way that does not require any- 
thing more than a selective effect. This 
point is discussed later (see Extinction, be- 
low). 

The third point is that the superstition 
and related experiments suggest that the re- 
sponse contingency imposed by most rein- 
forcement schedules is not essential for the 
production of some terminal response, but 
only for the selection of one response over 
others, or for directing a response which 
would probably predominate in any case— 
as in key pecking by pigeons. Our failure 
to find a consistent difference in rate of ob- 
server-defined pecking between the response- 
dependent and response-independent condi- 
tions of the present superstition experiment 
supports this view, as does a recent finding 
that the contingency between electric shock 
and responding is not necessary to the gener- 
ation of behavior maintained by intermittent 
shock delivery to squirrel monkeys (Hutch- 
inson, 1970; Stretch, personal communica- 
tion, 1970). 

The usual interpretation of the fact that 
there is a terminal response in the supersti- 
tion situation, despite the absence of Te 
sponse-contingency, is ihe notion of acc 
dental strengthening of a response Dy con 
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tiguity with the delivery of reinforcement. 
Here we consider the general implications of 
adventitious reinforcement as an explanation, 
and its incompatibility with the notion of 
reinforcement as selection. Other problems 
related to adventitious reinforcement are dis- 
cussed later (see Acquisition, Classical Con- 
ditioning). 

First, adventitious reinforcement implies 
failure of constancy, in the sense that the 
animal is presumed to be unable, because 
of the stamping-in mechanism of reinforce- 
ment, to distinguish between real and acci- 
dental correlations between his behavior and 
the occurrence of reinforcement. This is a 
strong assumption, in view of the adaptive 
utility of the constancy process and its 
ubiquity in perceptual and motor mecha- 
nisms. In perception, a similar failure to 
distinguish changes in sensory input that are 
produced by our own behavior from changes 
that are independent of behavior might cause 
us to perceive the world as rotating every 
time we turn our head. It is of course true 
that on the basis of one or a few instances 
the animal may not be in a position to be 
certain about the reality of a contingent 
relationship between his behavior and rein- 
forcement—and this kind of sampling limita- 
tion might account for a transient supersti- 
tious effect. It is less convincing as an 
account of a long-term effect. 

Second, if reinforcement is considered as 
purely selective, it cannot be invoked as an 
explanation of behavior when no imposed 
contingency exists between reinforcement 
and behavior (ie., in the absence of selec- 
tion). To do otherwise would be like taking 
a population of white mice, breeding them 
for 20 generations without further selection 
for color, and then attributing the resulting 
white population to the results of “accidental 
selection.” In this case, as in the case of 
response-independent reinforcement, the out- 
come reflects a characteristic of the initial 
population (ie, the mice gene pool, the 
nature of the organism), and not a non- 
existent selection process. 

In short, the notion of adventitious rein- 
forcement is not a tenable one. The extent 
to which reinforcement can be invoked as 
an explanation for behavior is directly re- 
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lated both to the degree of imposed con- 
tingency between response and reinforce- 
ment and to the opportunities for that con- 
tingency to have some selective effect (i.e., 
the number of contingent reinforcements). 
If there is no contingency, or if few con- 
tingent reinforcements have occurred, the 
resulting behavior must owe more to prin- 
ciples of variation than to the selective action 
of reinforcement. On the other hand, if 
many contingent reinforcements have been 
delivered during a protracted period of shap- 
ing, the final form of behavior is obviously 
much less dependent on particular principles 
of variation, and the role of reinforcement 
(selection) may properly be emphasized. 
3. There is considerable evidence for the 
generality of a principle implicating rela- 
tive rate or proximity of reinforcement as 
the fundamental independent variable deter- 
mining the spatial and temporal location of 
responding in steady-state conditioning 
situations. Thus, Herrnstein (1970) has 
recently reviewed a number of operant con- 
ditioning experiments involving differential 
reinforcement of simultaneous (concurrent 
schedules) and successive (multiple sched- 
ules) choices which support the idea of rela- 
tive reinforcement rate as the independent 
variable most directly related to the rate of 
key pecking in pigeons. Shimp (1969) has 
presented an analysis which is formally dif- 
ferent from Herrnstein’s, but which also 
implicates differences in reinforcement rate 
as the crucial variable. An extensive series 
of experiments by Catania and Reynolds 
(1968) suggests a similar (although less 
exact) relationship between rate of pecking 
and relative temporal density of reinforce- 
ment on interval reinforcement. schedules, 
Jenkins (1970) summarizes a series of stud- 
ies with a discrete-trials procedure that 
led him to suggest relative proximity to re- 
inforcement as an important determiner of 
the tendency to respond in the presence of 
a stimulus. Staddon (1970a) has suggested 
a similar principle to account for positive 
“goal gradients" that underlie the effects of 
reinforcement omission on a variety of inter- 
val schedules. 
4. The concept of reinforcement implies a 
iix,to be reinforced. The fact that a 
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given stimulus may be reinforcing at one 
time, but not at another, requires the idea 
of a state corresponding to each class of 
reinforcers. The independent variable of 
which the strength of most of these states is 
a function is deprivation with respect to the 
appropriate class of reinforcers (food de- 
privation for the hunger state, water for 
thirst, etc.). This will not do for most nega- 
tive reinforcers (eg. the removal of electric 
shock), however, since there is no obvious 
counterpart to deprivation in this case. It 
js also unlikely that deprivation is the only 
independent variable sufficient to alter the 
strength of states associated with positive 
reinforcement. For example, evidence is 
discussed later in favor of reciprocal inhibi- 
tory interaction between states as a possible 
factor in polydipsia and other adjunctive be- 
havior. Interactions of this sort may also 
alter the strength of states for which there 
is no deprivation requirement, as in audio 
anaesthesia (Licklider, 1961). 

Thus, one may hope for a set of principles 
of reinforcement that will deal both with 
the proper classification of states and with 
the interactions among them. 

The theoretical vocabulary of learning is 
full of terms with an uneasy conceptual sta- 
tus somewhere between explanation, defini- 
tion, and category label. This terminology, 
which is not coherent or internally consist- 
ent, makes it difficult to approach particular 
topics with an open mind. It is too easy 
to dismiss an experimental result as due to 
adventitious reinforcement or respondent 
conditioning without, in fact, having any 
clear understanding of what has been said. 
Simply defining everything operationally is 
of little help in this situation, since a set 
of definitions is not à theory. Anda theory, 
in the sense of a system of concepts that is 
internally consistent and coherent, is what 
is required if we are to be sure, in particular 
cases, whether we really understand a phe- 
nomenon—or are merely substituting one 
mystery for another, with the assistance of 
an opaque vocabulary. a 

What we are proposing is too primitive to 
be called a theory in this sense. However. 
it does offer a system of classification that 
makes it difficult to have the illusion of 
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understanding a phenomenon if compre- 
hension is really lacking. In the following 
section, some implications of this scheme are 
shown in three major areas: acquisition, ex- 
tinction, and classical conditioning. This is 
followed by a brief discussion of possible 
difficulties of this approach. With the aid 
of this groundwork, it will then be easier to 
return to a general account of the supersti- 
tion and related experiments in the conclud- 
ing section. 

Acquisition 

The number of trials necessary for learn- 
ing is one of those perennial problems that 
seems to defy resolution. Appeal to data is 
not conclusive because learning curves are 
sometimes incremental and sometimes step- 
like, Even in particular cases, theory is not 
conclusive either, since with sufficient in- 
genuity, theoretical accounts of both kinds 
of curve may be constructed on the basis 
of either one (or a few) trial learning 
assumptions or incremental assumptions in- 
volving thresholds. We turn now to the 
possibility that the issue is a consequence 
of the stamping-in view of reinforcement 
and becomes less urgent once that view is 
challenged. 

A comment by Skinner (1953) on the 
necessary and sufficient conditions for the 
development of superstition provides an 
illustration : 


In superstitious operant behavior . . . the process 
of conditioning has miscarried. Conditioning 
offers tremendous advantages in equipping the 
organism with behavior which is effective in à 
novel environment, but there appears to be no 
way of preventing the acquisition of non-advanta- 
geous behavior through accident. Curiously, this 
difficulty must have increased as the process o 
conditioning was accelerated in the course © 
evolution. If, for example, three reinforcements 
were always required in order to change the 
probability of a response, superstitious behavior 
would be unlikely. It is only because organisms 
have reached the point at which a single CO". 
tingency makes a substantial change that they 
are vulnerable to coincidences [pp. 86-871. 


Even within the framework of the stamp, 
ing-in view, it is clear that the truth of this 
statement depends on a tacit assumption 
that responses will not generally occur more 
than once unless followed by reinforcement: 
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If a given response can be relied on to occur 
at least 20 times in succession, even without 
reinforcement, then 3-trial, or even 10-trial 
learning might well be sufficient to insure its 
acquisition under the conditions of the super- 
stition experiment. The assumption that 
responses will occur only once in the absence 
of reinforcement is a strong assumption 
about syntactic constraints, in our terminol- 
ogy. Moreover, it is contradicted by the re- 
sults of the Williams and Williams study 
(1969), which show indefinite persistence 
of pecking in the absence of any contiguous 
relationship between pecks and reinforce- 
ment. There is no reason to suppose that a 
similar persistence is not characteristic of 
other behaviors (e.g., position habits), al- 
though pecking may be more persistent than 
most. Thus, the finding of superstitious 
terminal responses, or of indefinite pecking 
following just three response-contingent re- 
inforcements  (Neuringer, 1970b), need 
imply nothing about the number of trials 
necessary for learning. 

These considerations suggest, as a mini- 
mum, the need to take variation into account 
in discussions of the "speed of condition- 
ing," since rapid acquisition may either re- 
flect an unpersistent response that is really 
learned rapidly, or a very persistent one 
that may be learned quite slowly. No in- 
ferences about "speed of conditioning" can 
be drawn solely on the basis of speed of 
acquisition without information about the 
frequency and pattern of a given behavior 
to be expected in a given situation (which 
may include predictable delivery of rein- 
forcement) in the absence of contiguity be- 
tween that behavior and reinforcement. In 
practice, since information of the required 
sort is rarely, if ever, available, it seems 
wise to defer the issue of speed of learning 
until behavioral variation has been much 
more thoroughly studied.* 

Thus, the moment-by-moment details con- 
cerning the effect of reinforcement remain 

4 Problems of this sort are not solved by re- 
ferring to a hypothetical “operant level" because 
(a) this level is often zero in the absence of a 
history of reinforcement in the situation; (b) it is 
"arely constant, as the term level implies; and 
(c) the problem of the origin of this level is 
thereby simply evaded. 


uncertain until much more is known about 
variation. In the meantime it seems more 
parsimonious and less likely to lead to fruit- 
less controversies about “speed of condition- 
ing," continuity versus noncontinuity, etc. 
to assume that the appearance of one be- 
havior, rather than another, at a certain 
time or place, rather than some other time 
or place, always requires explanation in 
terms of principles of variation, with only 
the disappearance of behaviors being attrib- 
utable to the effects of reinforcement. 

This general approach is not novel It 
resembles both Harlow's (1959) account of 
learning-set acquisition in terms of the pro- 
gressive elimination of error factors, and 
certain versions of stimulus-sampling theory 
(Neimark & Estes, 1967). In Harlow's 
terms, as in ours, one-trial acquisition is a 
phenomenon that depends on the existence 
of factors that make the correct behavior 
much more probable (and persistent) than 
others (ie. upon principles of variation). 
In the learning-set case, these factors are 
embodied in the prior training procedure, 
which progressively selects for an initially 
weak behavior (the “win stay, lose shift" 
strategy) at the expense of the initially 
much stronger tendencies to approach par- 
ticular stimuli. A lengthy process may not 
be essential, however, for principles of varia- 
tion involving insight (“compositional trans- 
fer,” see above) may serve the same func- 
tion, if they are available to the animal. 
The important point is the shift of emphasis 
away from the supposed efficacy of some 
stamping-in mechanism, the action of which 
must remain obscure in the absence of 
knowledge about variation, to the principles 
of variation that determine the strength of 
behaviors in advance of contiguity with re- 


inforcement.* 


Extinction 
Extinction is often used as a test for 
“what is learned" during a training pro- 


5 Memory has not been separately discussed in 
this account of acquisition because it is embodied 
in most of the variational and selective processes 
we have described. The argument of the present 
section suggests that a separate account of mem. 
ory may have to await advances in our knowledge 
of these processes. 
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cedure, as in generalization testing (Gutt- 
man & Kalish, 1956), and testing for con- 
trol by temporal factors (Ferster & Skin- 
ner, 1957; Staddon, 1970b). Under these 
conditions, it is assumed that behavior is 
determined almost entirely by transfer from 
the base-line condition. Providing the dif- 
ference between the extinction and training 
conditions is not too great, either in terms 
of environmental factors (the stimulus situa- 
tion is not too different) or temporal factors 
(the extinction is not prolonged), this 
assumption can be justified by the reliability 
and predictability of the behavior usually 
observed. 

When these conditions are not satisfied 
or when the training preceding extinction 
has not been protracted, this reliability is not 
usually found. On the contrary, extinction 
under these conditions is usually associated 
with an increase in the variability of be- 
havior (Antonitis, 1951; Millenson & Hur- 
witz, 1961). This increase in variability is 
exactly what would be expected if, as we 
have suggested, reinforcement has a purely 
selective effect: in these terms, training 
involves a progressive reduction in vari- 
ability under the selective action of rein- 
forcement (centripetal selection, see be- 
low), so that absence of reinforcement 
(extinction) represents a relaxation of se- 
lection—with an attendant rise in vari- 
ability. We turn now to a brief account 
of the effects of changes in the amount and 
direction of selection in evolution, which may 
shed some further light on the properties of 
behavioral extinction. 

Darwin (1896) comments on the effect 
of domestication as follows: 


From a remote period to the present day, under 
climates and circumstances as different as it is 
possible to conceive, organic beings of all kinds, 
when domesticated or cultivated, have varied. . . . 
These facts, and innumerable others which could 
be added, indicate that a change of almost any 
kind in the conditions of life suffices to cause 
variability . . . [Vol. 2, p- 243]. 


Although Darwin sometimes (erroneously) 
interpreted this observation as reflecting a 
direct effect of changed conditions on the 
reproductive system, it can be interpreted 
in modern terms as due to a relaxation of 


selection. This is clear from the concept of 
centripetal selection (Haldane, 1959; Mayr, 
1963; Simpson, 1953), which refers to the 
fact that selection under unchanging condi- 
tions, if long continued, acts to weed out ex- 
tremes, rather than systematically to shift 
population characteristics in any particular 
direction : 


When adaptation is keeping up, selection at any 
one time will be mainly in favor of the existing 
type. . . . In such cases, the intensity of selec- 
tion tends to affect not the rate of change but the 
amount of variation [Simpson, 1953, p. 147]. 


Thus, a change in conditions will generally 
involve a shift away from centripetal selec- 
tion, with its tendency to reduce variability, 
and will often lead, therefore, to increased 
variability. The most obvious example of 
the effects of relaxation of selection in evolu- 
tion is degenerating or vestigial structures, 
that are no longer being selected for: 


It is so commonly true that degenerating struc- 
tures are highly variable that this may be advanced 
as an empirical evolutionary generalization [Simp- 
son, 1953, p. 75]. 

We have already noted that the onset of 
variability in extinction is often delayed. A 
similar delay in the effect of changed condi- 

{ 


tions is often apparent in evolution, Darwin 


(1896) notes: 


We have good grounds for believing that the in- 
fluence of changed conditions accumulates, so that 
no effect is produced on a species until it has been 
exposed during several generations to continued 
cultivation or domestication. Universal experi- 
ence shows us that when new flowers are first 
introduced into our gardens they do not vary; 
but ultimately all, with the rarest exceptions, vary 
to a greater or less extent [Vol. 2, p. 249]. 


Similar delays have also been reported in 
experiments on artificial selection (Mayr: 
1963). 'These delays seem to reflect what 
has been termed "genetic inertia" or “gener 
tic homeostasis” (Mayr, 1963), that is, the 
tendency for a gene pool which is the result 
of a long period of consistent selection, 

resist changes in the direction of selec 
A similar mechanism in behavior mig? 
account for the dependence of variability 1^ 
extinction on the duration of the prece We 
training period, which was referred to €^. 
lier: The amount of variability might be €* | 


^ 


“SUPERSTITION” EXPERIMENT AND ApaPrIVE BEHAVIOR 25 


pected to be greater and its onset sooner 
following a brief training period than after 
one of longer duration. Genetic homeostasis 
also seems to be involved in the phenomenon 
of reversion, to be discussed next. 

Not all the variation which occurs either 
in behavioral extinction, or following a 
change in the conditions of life in evolution, 
is wholly novel. A relatively common effect, 
for example, is the reappearance of what 
Darwin terms “ancestral types,” that is, 
phenotypes which predominated earlier in 
phylogeny but which have been selected 
against more recently. This is the phenome- 
non of reversion which, because of his 
ignorance concerning heredity, Darwin 
(1896) found among the most mysterious 
of evolutionary processes: 

But on the doctrine of reversion the germ 
[germ plasm] becomes a far more marvellous ob- 
ject, for, besides the visible changes which it under- 
goes [i.e., phenotypic expressions], we must believe 
that it is crowded with invisible characters, proper 
to . . . ancestors separated by hundreds or even 
thousands of generations from the present time: 
and these characters, like those written on paper 
with invisible ink, lie ready to be evolved whenever 
the organisation is disturbed by certain known 
or unknown conditions [Vol. 2, pp. 35-36]. 
Thus, one effect of a relaxation of selection 
is a more or less transient increase in the 
relative influence of the distant past at the 
expense of the immediate past. In be- 
havioral extinction, this should involve the 
Teappearance of old (in the sense of pre- 
viously extinguished) behavior patterns ; 
that is, transfer from conditions preceding 
the training condition at the expense of 
transfer from the training condition? In 
both cases, evolution and behavior, the effect 
of the change in conditions may be expected 
to depend on variables such as the magni- 
tude of the change and the time since the 
preceding change. 

8 Other than clinical accounts of regression, we 
have been able to find only one published report of 
this effect—in an account describing shaping por- 
poises to show nove] behaviors (Pryor, Haag, & 
O'Reilly, 1969), However, we have frequently 
Observed it while shaping pigeons: if a pigeon has 
been trained in the Past to perform a variety of 
Tesponses, the increase in variability during ex- 
tinction of the most recently reinforced response 


generally includes the Teappearance of earlier re- 
SPonses, 


The analogy from Darwin suggests that 
any considerable change in conditions should 
increase variability, yet a change in rein- 
forcement schedules that includes an in- 
crease in reinforcement rate is not usually 
thought of as producing an increase in vari- 
ability. This apparent contradiction is re- 
solved by noting that an increase in rate 
of reinforcement, in addition to changing 
conditions, also increases the rate of selec- 
tion (since the analogy assumes reinforce- 
ment to have a purely selective effect). 
Thus, variability may be briefly increased, 
but since the rapiditv of selection is also 
increased, the net effect may be small. An 
analogous (but impossible) phenomenon in 
evolution would be to decrease the time be- 
tween generations at the same time that con- 
ditions are changed. This would speed up 
the attainment of a new equilibrium and 
minimize the increase in variability generally 
associated with changed environment. 

The increase in variability due to extinc- 
tion is most directly put to use in the process 
of shaping by successive approximations. 
Frequently, following the first few reinforce- 
ments delivered during a "shaping" session, 
the effect is simply an increase in the range 
and vigor of behavior. This change can be 
viewed as being due to the interruption of 
eating (cf. Mandler, 1964), however, rather 
than any direct strengthening effect of rein- 
forcement (which we are questioning in any 
case). In terms of the foregoing analysis, 
the conditions following the first reinforce- 
ment should be optimal for an increase in 
variability: the change is large (from con- 
tinuous eating to absence of food) and the 
training procedure is of short duration (the 
34-second eating bout), so that time since 
the preceding change is also short. As food 
continues to be delivered intermittently, se- 
lection occurs and variability decreases, 

We have been suggesting a purely selec- 
tive (rather than strengthening, stamping- 
in, or energizing) role for reinforcement, 
The present discussion suggests that such an 
essentially passive role is compatible with a 
number of phenomena—extinction, the acti- 
vating effects of isolated reinforcements — 
that may appear to demand a More active 
role for reinforcement. This compatibility 
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was established by drawing attention to 
similar phenomena in evolution, where the 
purely selective effect of the conditions of 
life (analogous to the schedule of reinforce- 
ment) i$ unquestioned. However, no nec- 
essary identity between the genetic mecha- 
nism, which is responsible for the effect of 
changed conditions on variability in struc- 
ture, and whatever process is responsible 
for analogous effects in behavior, is intended. 
Any process for the production of variation 


that incorporates some latent memory of 
past adaptations is likely to show similar 
effects. 


Classical. Conditioning 


Our scheme has strong implications for 
the distinction between classical ( Pavlovian, 
respondent) and instrumental (operant) 
conditioning, to the extent that the distinc- 
tion goes beyond procedural differences. 
Classical conditioning is often thought of as 
a paradigmatic instance of the primary proc- 
ess of learning: "The [learning] process ap- 
pears to be based entirely on temporal con- 
tiguity and to have classical conditioning 
as its behavioral prototype [Sheffield, 1965, 
p.321]. The salivation “reference experi- 
ment" can be interpreted as prototypical in 
at least two ways that are not always kept 
separate. The first (which has some simi- 
larities to our position) is referred to by 
Sheffield—the notion that learning depends 
solely on temporal relationships. Guthrie's 
aphorism that the animal "learns what he 
does" is a related idea. It is not easy to 
find a definitive account of this position, 
but it may perhaps be summarized by saying 
that reinforcement or reward is simply nec- 
essary to ensure that some behavior occurs 
in a conditioning situation. Principles in- 
volving temporal relationships (contiguity ) 
then ensure that whatever oceurs will trans- 
fer from one occasion to the next. 

The second way in which classical con- 
ditioning is discussed as prototypical is in 
terms of the rule that relates the conditioned 
and _ unconditioned responses. Pavlov 
(1927) emphasized stimulus substitution as 
the distinctive property of the situation: the 
"m originally elicited only by the UCS 
is later made to the CS, Subsequently, two 
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kinds of departure from this rule have been 
pointed out: (a) Even in the salivation 
experiment, there are other readily identifi- 
able components of the conditioned response 
that do not fit the stimulus substitution rule. 
These preparatory responses (Zener, 1937) 
are largely, but not exclusively, skeletal 
(rather than autonomic). (b) Even in the 
case of salivation and other autonomic re- 


sponses, the CR is rarely identical to the 
WER: We. a redintegrative response), 80 
that components of the UCR may be 


missing from the CR. More serious are 
differences in direction of change be- 
tween CR and UCR, which may not even 
be consistent across individuals, as in heart 
rate and respiratory conditioning (Martin 
& Levey, 1969; Upton, 1929; Zeaman & 
Smith, 1965). 

Partly because of problems involving pre- 
paratory responses, classical conditioning has 
increasingly been restricted to autonomically 
mediated responses. There were two bases 
for this restriction : the apparent difficulty of 
conditioning skeletal responses by the opera- 
tions of classical conditioning and according 
to the stimulus substitution. principle (cf. 
Skinner, 1938, p. 115), and the supposed 
impossibility of conditioning autonomic re- 
sponses via the Law of Effect. This is cleai 
from Kimble's (1961) comment : 


Obviously the common expression, “the conditione! 
response," is misleading, and probably in impor 
tant ways. At the same time it should be recog 
nized that the behavior described by Zener [pre 
paratory responses] was almost certainly instrt 
mentally, rather than classically, conditioned 
54]. 

The force of this argument is lost once tl 
susceptibility of autonomic responses Stk 
as salivation to operant conditioning is der 
onstrated. 

The foregoing facts are sufficient to shc 
the error of continuing to regard. classic 
conditioning as a unified process, mu 
less as an explanatory element in accou 
of operant conditioning. If classical € 
ditioning is a single process, then it must 
describable by principles of operation 
apply to every instance. As we have Se 
even stimulus substitution, the most gene 
such principle, fails to apply in every e 
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Many, but not all, the anomalous cases are 
skeletal responses—which suggested that 
perhaps the notion of a single process could 
be preserved by restricting the term to auto- 
nomic responses. The only independent 
basis for this is to segregate skeletal and 
autonomic responses on the grounds that 
operant conditioning of autonomic responses 
is impossible. Since this is now known to 
be false (Miller, 1969), the only remaining 
basis for excluding skeletal responses from 
the class of classically conditionable re- 
sponses is their failure to conform to the 
principle of stimulus substitution. But, in 
addition to being tautologous (classical con- 
ditioning simply becomes equivalent to learn- 
ing via stimulus substitution), this fails be- 
cause many autonomic responses do not con- 
form to this principle and, based on the 
work of Brown and Jenkins and of Williams 
and Williams, at least one skeletal response 
—pecking in pigeons—does obey it (other 
possibilities are leg flexion and eyeblink). 
Thus, the class of classically conditionable 
responses can be defined neither in terms of 
the neural mediating system (autonomic vs. 
skeletal) nor in terms of adherence to a par- 
ticular principle of learning. 

The only remaining feature common to 
all the situations labeled as classical condi- 
tioning is the procedure itself. Research in 
this area has tended to focus on the proper- 
ties of the temporal relationship between CS 
and UCS that are necessary and sufficient 
for the CS to acquire the power to elicit the 
conditioned response, and a consensus ap- 
pears to be emerging that the crucial factor 
is the extent to which the CS is a predictor 
of the UCS (Rescorla, 1967). However, 
the notion of predictiveness does not appear 
to differ from relative proximity (of the CS 
to the UCS, or of a stimulus to reinforce- 
ment) which, as we have seen ( Principle 
of Reinforcement 3, above), is a factor of 
wide applicability in operant conditioning. 
Thus, for all practical purposes, classical 
conditioning may be defined operationally 
as a class of reinforcement schedules that 
involve presentation of reinforcement inde- 
pendently of the subject's behavior. 

We conclude, therefore, that the division 
of the field of learning into two classes— 
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classical and instrumental conditioning— 
each governed by separate sets of principles, 
has no basis in fact. As an alternative, we 
suggest an analvsis based on the principles 
of behavioral variation and reinforcement 
we have already discussed. In terms of this 
analysis, all adaptive behavior is subsumed 
under an expanded version of the Law of 
Effect, and a given situation is to be under- 
stood in terms of two factors: (a) the rein- 
forcement schedule, that is, the rule prescrib- 
ing the delivery of reinforcement, or, more 
generally, stimuli, in relation to the behavior 
of the organism, and (b) the nature of the 
response under consideration. In terms of 
such an analysis, the properties normally 
considered as distinctive of classical con- 
ditioning may, once attention is directed to 
the qubstion, be seen as due in part to a 
reinforcement. schedule that happens to pre- 
scribe no correlation. between the delivery 
of reinforcement and the subject's behavior, 
and in part to the special properties of re- 
sponses such as salivation (see Implications 
2 and 3, below). 

Implications. Several puzzles become 
clearer once classical and instrumental con- 
ditioning are no longer regarded as separate 
processes: 

l. In the earlier discussion of auto-shaped 
pecking, Williams and Williams noted that 
"the directed quality of the induced pecking 
does not follow naturally from respondent 
principles." Since we question this frame- 
work as a general description of anything 
outside the salivation experiment, its inabil- 
ity to deal with this particular situation 
poses no problem. The appearance of key 
pecking in the auto-shaping and superstition 
situations is, of course, not fully understood. 
It may reflect a special susceptibility of con- 
summatory responses to the principle of 
stimulus substitution, as suggested earlier, 
or the action of transfer principles in ways 
that reflect something in common among 
the past histories of most pigeons. Simi- 
larly, little can be said about the directedness 
of pecking until the conditions under which 
this response is, and is not, directed have 
been more fully explored; although the re- 
sults so far (with pigeons and monkeys) 
suggest that skeletal responses may always 
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be directed. Rather than attempt to explain 
(or explain away) these characteristics, the 
best course seems to be simply to observe 
and classify these and other behaviors under 
a variety of response-independent and re- 
sponse-dependent schedules, in the hope that 
hypotheses can be devised that have more 
hope of generality than anything that can be 
inferred from the scanty data presently 
available. We return to these issues in the 
concluding section. 

2. There is little doubt that most auto- 
nomic responses are more easily conditioned 
by response-independent schedules than by 
response-dependent ones. However, this 
need not imply the existence of two different 
kinds of conditioning. It can as well be 
interpreted as reflecting the existence of 
internal controlling factors that are not 
amenable to the principles of reinforcement. 
Because of these fixed factors, these re- 
sponses are not as free to come under the 
control of external stimulus factors as are 
skeletal responses that perform no function 
in the internal economy of the organism. 
Moreover, this principle can be extended 
beyond autonomic responses to deal with 
any response that is strongly connected to 
any stimulus, internal or external. Thus, 
in a recent discussion of operant conditioning 
of drinking, Black (1970) writes: 


This discussion suggests that one dimension along 
which responses might be classified with respect 
to operant conditioning is the degree to which 
they are constrained from being changed by operant 
reinforcement by the properties of the neural sub- 
systems of which they are a part. The regulatory 
systems can vary from very simple reflexes, such 
as the knee-jerk, to complex instinctive ones, 
such as those involved in courtship. The main 
point is not the complexity of the subsystems but 
rather the extent to which they limit the condi- 
tions under which operant reinforcement will 
work [p. 267]. 


Segal (1970) has made a similar suggestion 
ina thoughtful discussion dealing with a 
number of the points raised in the present 
paper. 

As an example, the difficulties associated 
with demonstrating operant conditioning of 
heart rate can be viewed as analogous to 
the problem of acquiring control over an 
Operant response that is already under the 
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control of other variables, that is, such sched- 
ules are really concurrent rather than simple 
schedules. The heart rate problem is per- 
haps analogous to training an animal to 
alter his rate of bar pressing to receive food, 
while pressing the same bar is also necessary 
to obtain another reinforcer (such as oxygen 
or heat). Even without considering the 
problem of interactions among drives, one 
would not be surprised to find rather weak 
control by the food reinforcer. 

3. Williams (1965) recorded salivation 
in dogs while they were bar pressing for. 
food reinforcement on both fixed-interval 
and fixed-ratio reinforcement schedules. He 
found that the onset of salivation within each 
interreinforcement interval approximately 
coincided with the onset of bar pressing in 
the fixed-interval case, but began later than 
bar pressing on fixed-ratio. This interesting 
result is incompatible with an explanation 
of the operant response in terms of an under- 
lying classical conditioning process. How- 
ever, it may be understood in terms of the 
view we have been proposing by assuming 
that (a) the occurrence of each response 
(bar pressing or salivation) is separately 
and independently determined by the con- 
ditions of reinforcement peculiar to it, and 
(b) both responses tend to occur at times 
of greatest relative proximity to reinforce- 
ment (Principle of Reinforcement 3). In 
the fixed-interval case, these two assump- 
tions predict a similar time of occurrence 
for both behaviors, because time since the 
beginning of a trial is, for both responses: 
the best predictor of reinforcement. How- 
ever, in the fixed-ratio case, it is apparent 
that no matter what determines the time 
of onset of bar pressing, once it has stabilized 
its onset provides a better predictor of rein- 
forcement than does trial time (since the 
fixed number of responses making up th 
fixed ratio take an approximately fixed i 
time). Thus, at asymptote our two assump 
tions imply that salivation should be reliably 
delayed with respect to bar pressing on fixe 
ratio, but not on fixed-interval, as Willian 
reports. 7 

However, there is no reason to expect tH" ( 
delay early in training, since the animal 5d 
not in a position to learn the cue significance’ 
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of bar Pressing until it has more or less 
Stabilized. — This expectation is also con. 
firmed by Williams (1965), who notes that 
the delayed onset of salivation in the fixed- 
ratio case "emerged only after repeated ex- 
Posure to the schedules [pp. 344—345." 

On the basis of a failure to find salivation 
preceding bar pressing on a spaced-respond- 
ing ("controlled latency") procedure, Wil- 
liams (1965) concludes that "the hypothesis 
that the two measures are independent may 
be rejected [p. 347]," which contradicts 
Assumption a, above. However, since both 
responses are assumed (Assumption b) to 
Occur at times of greatest relative proximity 
to reinforcement, and reinforcement. cannot 
occur before a bar press if it is always con- 
tingent upon bar pressing, there is no reason 
to expect salivation reliably to precede bar 
pressing under conditions where the com- 
mon reinforcement for both responses de- 
pends on bar pressing alone. Thus, the ap- 
parent asymmetry between salivation and 
bar pressing observed by Williams may 
simply reflect an asymmetry between the 
conditions of reinforcement for each re- 
sponse, and need not imply any fixed internal 
linkage between them. 

^. Recently, considerable attention has 
been devoted to avoidance of particular foods 
conditioned by a nauseous experience (in- 
duced by insulin or X rays) taking place 
Several hours after ingestion (Garcia, Er- 
vin, & Koelling, 1966; Kalat & Rozin, 
1970; Revusky & Bedarf, 1967 ; Rozin, 
1969). Since the CS-UCS interval in these 
experiments is considerably longer than is 
customary in classical conditioning experi- 
ments, these data are even less congenial to 
a Pavlovian analysis than the auto-shaping 
results. They hint at the existence of a 
number of unsuspected built-in linkages be- 
tween response systems and various salient 
stimuli. Such linkages are not unexpected 
rom a broad evolutionary point of view that 
ees principles of variation and reinforce- 
nent as behavioral characteristics that are 
eparately selected for, and bear as much 
9r as little) relationship to one another as 
e morphological characteristics, 
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Science is conservative and, quite cor- 
rectly, resists most attempts to alter an 
established theoretical framework, We have 
already tried to show that the number of 
anomalies facing current learning theories 
is sufficient to justify a search for alterna- 
tives. Nevertheless, the radical appearance 
of the scheme we suggest is a substantial 
obstacle to its consideration, It is im- 
portant, therefore, to point out that it is 
little more than an extension and reorgan- 
ization of familiar concepts, that is, rein- 
forcement, S-R behavior units, learning 
principles such as transfer, and ethological 
observations on species-related behaviors. 
The difference is therefore largely one of 
emphasis and selection rather than the in- 
troduction of wholly novel ideas. 

Any discussion of evolution and learning 
naturally brings to mind the learning-instinct 
issue. There is no simple parallel between 
this dichotomy and anything in the scheme 
we propose. The origin of every behavior is 
supposed traceable to principles of variation; 
if, for example, in a particular case a prin- 
ciple of transfer is involved, one might want 
to say that the behavior is learned. However, 
the question must then simply be asked 
again about the previous situation from 
Which transfer has supposedly occurred. In 
this way, almost any question about the 
relative roles of heredity and environment 
will involve unraveling the whole of ontog- 
eny. This conclusion will not be unfamiliar 
to ethologists (cf. Beach, 1955). 

It is also important to emphasize that we 
have not been directly concerned with the 
evolution of the capacity to learn; although 
it may be that increasing knowledge of varia- 
tion will shed light on this issue. ; 

One objection that may be „raised to 
the proposed scheme is that it is derived 
from and deals explicitly only with positive 
reinforcement. However, a recent account 
of behavior sustained by negative reinforce. 
ment (Bolles, 1970) is in perfect agreement 
with our position. Bolles points Out that 
some activities are much more easily con. 
ditioned than others in avoidance Situations, 
and these are the unconditioneq activities 
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that normally occur in a variety of poten- 
tially dangerous circumstances. These spe- 
cies-specific defense reactions, in Bolles’ 
terminology, occur in advance of reinforce- 
ment (i.e., the avoidance of electric shock) 
—in our terms, they are determined initially 
by one of the principles of behavioral varia- 
tion. The lack of arbitrariness of the re- 
sponse is perhaps more obvious in avoidance 
than in any other situation because of the 
complexity of the schedules involved: the 
animal must usually learn something about 
the pattern of occurrence of an intermittent 
aversive event, in the absence of responding, 
before he is in a position to detect altera- 
tions in that pattern correlated with his 
own behavior. Although a similar situation 
prevails in all reinforcement schedules, the 
change to be discriminated seems consider- 
ably easier both in appetitive conditioning, 
where the shift is from zero reinforcement 
in the absence of responding to reinforce- 
ment following every response, and in es- 
cape, where it is from continuous presence 
of the aversive stimulus in the absence of 
responding to complete absence following 
each response. Bolles suggests other rea- 
sons, related to the limited opportunities for 
avoidance (in the schedule sense) in the 
wild life of small mammals, and thus the 
limited opportunities for the capacity to 
avoid to be selected in phylogeny. 

The strongest point in favor of our pro- 
is its promise of parsimony. Conse- 
the most damaging criticism that 
against it is the absence of 
firm specification of the principles of rein- 
forcement and variation. This appears to 
allow the creation of such principles at will, 
enabling us to explain everything—and 
nothing. There are two defenses against 
this criticism. First, we again emphasize 
the tentative nature of the principles we 
have suggested. The overlap among the 
principles of variation, 


particularly, sug- 
gests that our list is provisional. Second, 
there is the strong possibility that clear 
recognition of the distinction between varia- 
tion and reinforcement may be essential to 
further-advance. In defense of this: proposi- 
tion, we first briefly discuss some examp 


les 
from synoptic accounts of cu 


posal 
quently, 
can be directed 


rrent learning 


theory, which show it to incorporate few 
safeguards against multiple explanations for 
phenomena. Since our scheme of classifica- 
tion is at least internally consistent and 
forces one to relate each new principle of 
variation to others that already exist, it has 
some advantages in this respect. Second, 
we discuss the controversy between cogni- 
tive and behavioristic theorists regarding 
the role of structure in behavior, in relation 
to a similar controversy in the history of 
evolutionary thought. The persistence of 
this controversy and its amenability to anal- 
ysis in terms of variation and reinforcement 
suggest that our classification may be of 
some value despite its incompleteness. 

1, Hilgard and Marquis (1940) list as 
principles of reinforcement: stimulus sub- 
stitution, expectancy, and the principle 
(law) of effect. It should be apparent from 
the earlier arguments that the Law of Effect 
is the result of the combined effect of both 
variation and reinforcement, stimulus sub- 
stitution is a principle of variation, and ex- 
pectancy refers to a general characteristic 
which can be imputed to most learning. 
Consequently this set of terms allows for 
considerable uncertainty in application to 
particular situations. For example, our 
analysis of the Williams and Williams ex- 
periment (see p. 33) makes use of both 
stimulus substitution and a principle of rein- 
forcement analogous to what Hilgard and 
Marquis mean by the Law of Effect. Yet 
the same situation could also be analyzed 
in terms of expectancy ; and it appears to be 
incompatible with the Law of Effect as 
traditionally understood. Progress since 
1940 has not been dramatic, as illustrated 
by a list of "elementary conditioning proc- 
esses" inventoried by Jenkins (1970) in 
connection with,his. work on cyclic reinforce- 
ment schedules: generalization, delay of re- 
inforcement, conditioned reinforcement un- 
conditioned effects of eating, frustration 
effects, effects related to “behavioral c 
trast.” Despite the number of these proc- 
esses and the lack of any obvious relation- 
ship among them, Jenkins finds that they 
are unable to account for some rather d 
ple features of his data, which require à 
description in terms of the relative proximity 
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of a stimulus or a response to reinforcement 
as a major determiner (see Principles of 
Reinforcement, above). 

2. There is a history of fruitless contro- 
versy between many behaviorists, who place 


little emphasis on the structural properties 
of behavior, and students of cognitive proc- 
esses, who see structure as the most interest- 
ing and important behavioral attribute (cf. 
Staddon, 1967, 1969, for a discussion in re- 
lation to operant conditioning). The dis- 
tinction between variation and reinforcement 
can shed some light on this issue, which can 
be illustrated by briefly considering the 
contrasting views of Skinner and Chomsky 
(Chomsky, 1959; MacCorquodale, 1969) 
on the causation of learned behavior. 

Chomsky’s major concern is with prin- 
ciples of variation, in our terms, as is clear 
from his emphasis on the rule-governed 
nature of language (see Principle of Be- 
havioral Variation 4, above). 

Skinner’s position is less obvious, but 
becomes clear from his account of the shap- 
ing of behavior (Skinner, 1953) ; he writes: 


Operant conditioning shapes behavior as a sculp- 
tor shapes a lump of clay. Although at some point 
the sculptor seems to have produced an entirely 
novel object, we can always follow the process 
back to the original undifferentiated lump, and 
we can make the successive stages by which we 
return to this condition as small as we wish. At 
no point does anything emerge which is very dif- 
ferent from what preceded it. The final product 
seems to have a special unity or integrity of de- 
sign, but we cannot find a point at which this 
suddenly appears. In the same sense, an operant 
is not something which appears full grown in the 
behavior of the organism. It is the result of a 
continuous shaping process [p. 91]. 


For Skinner, apparently, moment-to-moment 
variation in behavior is small in magnitude, 
and essentially random (in the sense that 
it is unrelated to the final goal) in direc- 
tion. Behavior is the result of the “accumula- 
tion . . . of indefinite variations which have 
proved serviceable” in Darwin’s phrase. 
The similarity to natural selection is fur- 
ther emphasized by Darwin’s (1951) 
account of the evolution of complex struc- 
tures: 

If it could be demonstrated that any complex 


organ existed, which could not possibly have been 
formed by numerous, successive, slight modifica- 


tions, my theory would absolutely break down 
[p. 191]. 

In the history of evolution after Darwin, 
the rediscovery of Mendel’s laws led to a 
retftat from gradualism in favor of a salta- 
tionism that traced evolutionary progress 
(especially evolutionary novelty) to large 
changes (mutations) of a more purposive 
sort (cf. Mayr, 1960). This position is 
closer to the view of Chomsky and other 
cognitive theorists, who tend to stress the 
importance of insight and other rules of 
composition that can produce sudden jumps 
in behavior. 

The history of evolution has not supported 
the saltationist view. Fisher (1930) showed 
that large changes are much less likely to be 
adaptive than small ones, and Haldane and 
others have shown by a variety of arguments 
that the time available for evolution by the 
selection of small variations is more than 
sufficient to account for the observed dif- 
ferences among taxa: “The saltationism of 
the early Mendelians has been refuted in 
all its aspects [ Mayr, 1960, p. 350]." 

At a superficial level, therefore, these 
comparisons might appear to favor Skinner's 
gradualism and emphasis on reinforcement 
(selection), to the detriment of the cognitive 
position. This is probably unjustifiable for 
two main reasons. First, detailed analysis 
of complex problem solving clearly indicates 
the insufficiency of random variation as an 
account of the process (e.g., Neisser, 1967). 
The heuristics that are employed may, of 
course, be attributed to past learning based 
entirely on random variation. , However, 
this suggestion meets with quantitative dif- 
ficulties when applied to the development of 
language—the best-studied example of rule- 
governed behavior. Although calculations 
in this area are of limited validity in the 
absence of established principles of variation 
(analogous to Mendelian genetics), the 
attempts that have been made seem to indi- 
cate that the time available in ontogeny for 
the development of language is incompatible 
with any kind of learning by random varia- 
tion (Chomsky, 1962; McNeill, 1968), This 
negative result is the opposite of Haldane’s 
affirmative conclusion on the sufficiency of 
small mutations as a basis for phylogenetic 
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changes. It suggests that neither the heuris- 
tics employed in complex problem solving, 
nor the rules of syntax, need to be built up 
entirely de novo during ontogeny. 

Second, the relationship between evolu- 
tion and learning is such as to allow greater 
flexibility to learning. (This is because 
natural selection can only be a response to 
small differences in "fitness": the most fit 
genotype will tend to prevail, although the 
species as a whole may thereby be led into 
an evolutionary blind alley. In terms of 
contemporary accounts of goal-directed 
mechanisms, natural selection represents a 
hill-climbing process (Minsky, 1961) and 
has no provision for prediction. This is 
clear in a familiar analogy due to Sewall 
Wright (1931) which shows the relation- 
ships among selection, structure, and varia- 
tion. He pictures the field of possible struc- 
tural variation as a landscape with hills and 
valleys. The range of variation present in 
a population of organisms is represented by 
a closed area on this landscape. Selection 
pressure is represented by the gradient (up- 
ward slope) of the landscape, so that each 
peak is an adaptive optimum for a given 
constellation of characters ; valleys represent 
unstable equilibria yielding so-called centrif- 
ugal selection. Tf the area representing a 
given species includes a single adaptive peak, 
selection will be centripetal, so that the 
species will tend to cluster more and more 
closely around the peak. Thus, small muta- 
tions are more likely to lead to improvements 
in fitness than large ones (with a limiting 

as Fisher, 1930, has 
he consequent predominant 
fferences in fitness in 
ss becomes obvious. 
limited, however, be- 


cause the principles of variation can be 
weighted to take account of regularities in 
the past history of the species (these are 
Skinner’s, 1966b, “phylogenic contingen- 
cies") ; that is, behavior need not occur at 
random in advance of reinforcement, but can 
reflect a priori probabilities that have been 
selected for during phylogeny. Other more 
complex strategies of this sort may also be 
built up by natural selection, giving learn- 
ing a predictive capacity largely denied to 


probability of 
shown), and t 1 
role of such small di 
the evolutionary proce 

Learning is not so 


evolution itself. "Thus, although learned be- 
havior reflects differences in reinforcement 
rate, just as evolution reflects selection on 
the basis of relative fitness, it need not be 
generally true either that small changes in 
behavior are more likely to be adaptive than 
large ones, or that the direction of change 
is unrelated to the final goal—as Skinner's 
account implies, and as is usually (although 
not invariably) the case in evolution. How- 
ever, since the more elaborate principles of 
variation must themselves be built up step- 
by-step by natural selection, it is to be 
expected that the pattern and range of be- 
havioral variation must bear some relation- 
ship to the phylogenetic status of the orga- 
nism: “higher” organisms, such as man, are 
likely to have developed more complex prin- 
ciples of variation than “lower” organisms, 
such as the pigeon. 

Thus, the major focus of the argument 
between Skinner and Chomsky is not on 
the importance of reinforcement, but about 
the complexity of the principles of varia- 
tion that determine the nature of behavior 
in advance of reinforcement. Since Skin- 
ner derives his ideas from work on rats 
and pigeons, and Chomsky from the study 
of human language, there are considerable 
grounds for a disagreement, especially if 
its basis is not clearly perceived by either 
party. Clear conceptual separation of va 
tion from reinforcement makes this kin 
confusion much less likely. 


CONCLUSION 


The argument so far has served to draw 
attention to a number of generalizations 
about steady-state conditioning situations: 

1. Most such situations involve som 
times and stimuli associated with relatively 
high reinforcement probability (eg the 
period at the end of the interval on f c m 
interval schedules), and others asso iste 
with relatively low reinforcement arty 
ity (e.g., the perig@@iaithe beginnin f the 
interval). . T 

2. The terminal response (a discriminated 
operant in Skinner's terminolo ) is " 
stricted to periods of relativel i h ri n 
forcement probability. Thi dis ibe ied 
the terminal response wi pecca 

with respect to time 


[3 
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and stimuli corresponds to a principle of 
reinforcement that relates the strength of a 
response to the relative frequency, density, 
or proximity of reinforcement associated 
with that response (Catania & Reynolds, 
1968; Herrnstein, 1970; Jenkins, 1970). 

3. The type, as opposed to the temporal 
and stimulus location, of the terminal re- 
sponse in situations involving both response- 
dependent and response-independent rein- 
forcement is determined by the interaction 
between principles of variation (e.g., trans- 
fer, stimulus substitution) that describe the 
occurrence of the response, in advance of 
reinforcement, and principles of reinforce- 
ment that determine whether it will persist 
or not (selective function of reinforcement). 

4. Periods of low reinforcement prob- 
ability are generally associated with interim 
activities, resembling appetitive behavior. 
If appropriate stimuli (goal objects) are 
provided, stereotyped adjunctive behavior 
(e.g., polydipsia, pica; Falk, 1969) takes 
the place of the more variable and relatively 
undirected interim activities. 

5. Both terminal response and interim 
activities are more correctly labeled as pre- 
disposing conditions or states rather than be- 
haviors, since in the absence of response de- 
pendency, the type of activity falling into 
these categories is not fixed. Thus, drink- 
ing, wheel running, fighting, pecking, and a 
number of other activities may be either 
terminal or adjunctive behaviors, depending 
on historical and stimulus factors (Segal, 
1969b; Skinner, 1959; Skinner & Morse, 
1958). Directing factors for adjunctive 
behavior are the availability of appropriate 
goal objects (see above), and factors related 
to reinforcement that render some kinds of 
activity more probable than others: for 
example, polydipsia appears to partially dis- 
place both adjunctive wheel-running (Segal 
1969a) and chewing-manipulatory behavior 
(Freed & Hymowitz, 1969) in rats (in 
situations with food as terminal reinforcer), 
even when both supporting stimuli are con- 
currently available. 


The Terminal Response 


We have already discussed the probable 
role of principles of variation such as stim- 


ulus substitution in describing the origin of 
terminal responses such as Pecking and 
Head in magazine in the superstition situa- 
tion. Before turning to the more complex 
matter of the interim activities, a word 
should be said about the paradoxical results 
of the Williams and Williams (1969) study, 
in which they found persistent pecking at 
a brief stimulus, ending in reinforcement, 
despite the fact that a key peck terminated 
the stimulus and thus prevented reinforce- 
ment. In terms of our analysis, this situa- 
tion pits reinforcement and variation against 
one another; thus, (a) the predictable de- 
livery of food at the end of each presenta- 
tion of a brief key stimulus may be a suf- 
ficient condition for the occurrence of key 
pecking in that stimulus, by the principle 
of stimulus substitution. But, (b) because 
of the response-contingency, the occurrence 
of a peck turns off the stimulus, omitting re- 
inforcement on that occasion, and thus re- 
ducing the overall reinforcement rate. In 
turn, this reduction in reinforcement rate 
will, via Principle of Reinforcement 3, tend 
to reduce the tendency to make any terminal 
response, including pecking, in that situa- 
tion. This process will continue until the 
tendency to peck has been sufficiently re- 
duced to allow the key stimulus to continue 
unpecked until the delivery of reinforcement, 
which will again provide the occasion for 
the operation of the stimulus substitution 
principle, making pecking likely once again. 
Thus, an equilibrium will be established at 
a rate of pecking higher than zero, but less 
than the rate which would obtain if pecking 
had no effect on reinforcement rate. Ex- 
tinction of pecking takes place if key pecking 
prevents reinforcement, but does not turn 
off the key stimulus, because the predict- 
ability of reinforcement, and thus the nec- 
essary condition for the operation of stim- 
ulus substitution, is thereby destroyed, 
Under these conditions, variation and rein- 
forcement combine to weaken the tendency 
to peck, which therefore declines relatively 
rapidly. "M 

The data on "instinctive drift" reported 
by Breland and Breland (1961, 1966) are 
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also compatible with this kind of analysis.’ 
However, their results strongly suggest that 
too much stress should not be laid on the 
apparent identity we (and others, e.g., 
Brown & Jenkins, 1968; Williams & Wil- 
liams, 1969; Wolin, 1968) have found be- 
tween the topography of the terminal re- 
sponse and the (unconditioned) response 
made to the terminal reinforcer. As in the 
case of adjunctive behavior (discussed be- 
low) and behavior elicited by central stimu- 
lation in reinforcing brain areas (eg; 
Glickman & Schiff, 1967; von Holst & von 
Saint Paul, 1963), the effect of the experi- 
mental procedures we have described ap- 
pears to be the induction of a state (in our 
terminology) or “mood” (von Holst & 
von Saint Paul) which makes some kinds 
of activity much more likely than others but 
preserves some flexibility in the animal's 
mode of response to the stimulating environ- 
ment. 

For example, Breland and Breland 
(1961) describe two situations, both show- 
ing “instinctive drift" in chickens, but only 
one of which conforms to the stimulus sub- 
stitution principle. In the first case, the 
chicken pecked a ball which he had learned 
to project (via a remote firing mechanism) 
at a target—a hit being immediately followed 
bv food. The history of contiguity between 
the moving ball and reinforcement and the 
similarity of the responses to both ball and 
food (i.e., pecking) fit easily into the stim- 
ulus substitution paradigm. In the second 
case, however, the chicken was reinforced 
for a chain of responses. the last of which 
involved standing on a platform for 15 sec- 
After training, the chicken showed 


onds. á 
vigorous ground scratching while standing 


* The delayed appearance of the food-related be- 
haviors in the Breland and Breland situations (as 
compared to the relatively rapid emergence of 
pecking in auto-shaping experiments) reflects the 
fact that food delivery could not become predict- 
able (the necessary condition for the operation of 
the stimulus substitution principle) until after the 
animals had learned to produce it by making the 
required "arbitrary" response. However, once this 
initial response was learned (via principles of 
variation other than stimulus substitution), the 
Stage was set for the operation of stimulus sub- 
stitution, which could then override the original 
learning. 


on the platform. This response has no top- 
ographic resemblance to pecking food, but 
is, of course, a universal food-getting be- 
havior in chickens and typically occurs in 
the vicinity of food. 

These examples, and others discussed by 
3reland and Breland (e.g., raccoons “wash- 
ing” poker chips, porpoises swallowing 
manipulanda, pigs “rooting” tokens, etc.) 
as well as exceptions to stimulus substitution 
in the auto-shaping literature (e.g., Rachlin, 
1969; Sidman & Fletcher, 1968), are com- 
patible with a more general notion, to the 
effect that the stimulus (temporal or extero- 
ceptive) most predictive of reinforcement 
comes to control a state or mood (the ter- 
minal state) appropriate to that reinforcer. 
The particular activity which occurs during 
the terminal period will then depend él 
principles of variation, which take into 
account both its motivational properties 
(e.g.. food-related activities become more 
likely if the terminal reinforcer is food), and 
the nature of the stimulating environment ; 
that is, the nature of the terminal state deter- 
mines what stimuli will be effective in elicit- 
ing what behavior. Thus, the Breland and 
Breland chickens pecked when the stimulus 
defining the terminal state was appropriate 
(in some sense) for pecking, but scratched 
when it was not; pigeons peck the key in 
auto-shaping experiments, but (based. on 
our results) are slower to peck in the super- 
stition situation—presumably because an 
appropriate target is not provided. Simi- 
larly, Glickman and Schiff (1967) reviewed 
a large number of studies of behavior in. 
duced by direct brain stimulation which 
suggested that the effect of stimulation at a 
particular site is to induce a predisposing 
motivational condition or state Which may 
lead to a variety of behaviors depending on 
the presence or absence of appropriate su 
porting stimuli. More recent work a 
b ince Cox, & Kakolewski, 1970) fur. 
ther emphasizes the similarj D 
termita stata and ehano a petice the 

Á sies 3 induced by cen- 
tral stimulation : 


Hypothalamic stimulation doe. 
3 a d S no i 
one specific behavior Pattern, The) only 
seems to excite the substrate for a ace qs 
sponses that in a given Species are py. 35 
common state [Valenstein et al., 1970 p 30] as 
E lm. E 
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Within this more general framework, 
stimulus substitution becomes a special case, 
which simply reflects the fact that the re- 
sponse normally made to the terminal rein- 
forcer often becomes highly probable when 
the animal is in a terminal state correspond- 
ing to that reinforcer, and will occur if mini- 
mal environmental support is provided. 


Interim. Activities and Adjunctive Behavior 


The principles of variation and reinforce- 
ment so far discussed refer to the origin and 
maintenance of the terminal response. The 
interim activities (including adjunctive be- 
havior) require a separate although com- 
plementary account, to which we now turn, 

There is as yet no general agreement on 
the causal factors underlying adjunctive be- 
havior (ie. behavior occurring during the 
interim. period on a varietv of intermittent 
reinforcement schedules). As we have al- 
ready noted (p. 13), data presently available 
appear to rule out simple physiological inter- 
pretations (Falk, 1969), although they do 
not clearly point to any alternative. Some 
help is offered by the similarities between 
adjunctive behavior and displacement activ- 
ities: the same explanation should be ade- 
quate for both. Possibilities are also some- 
what restricted by general functional con- 
siderations. Tn the present section, we state 
a tentative general hypothesis and its em- 
pirical basis in what is presently known of 

-adjunctive behavior. The relationship of this 

hypothesis to accounts of displacement be- 
havior and to general adaptiveness is dis- 
cussed in the following section. 

The hypothesis may be stated in the form 
of three propositions: 

1. The interim and terminal periods corre- 
spond to states, in the sense described ear- 
lier, defined by (a) the class of reinforcer 
or reinforcers that are effective at that time, 
and (b) the applicability of principles of 
variation appropriate to that class of rein- 
forcer (e.g., food-related behaviors are likely 
to occur during the food state, defense reac- 
tions during the fear state, etc.). 

2. The terminal state corresponds to the 
terminal reinforcer; the state during the 
interim period corresponds to all other rein- 
forcers, although all need not be equally 


effective. The linkage between terminal and 
interim states is assumed to be direct and 
reciprocal, so that the strength (defined 
below in terms of rate) of activities during 
the interim period is directly related to the 
strength of the terminal response. 

3. The strength of the terminal response 
is directly related to the “value” of the re- 
inforcement schedule ; that is, to relative rate 
and amount of reinforcement and to motiva- 
tional factors (eg., deprivation). 

As a consequence of the reciprocal inter- 
action between the terminal and interim 
states (Proposition 2) and the dependence 
of the terminal response on the value of the 
reinforcement schedule (Proposition 3), the 
strength of behaviors associated with the 
interim period will be determined both by 
the value of the terminal reinforcement 
schedule, as well as by the value of the rein- 
forcers proper to them. 

The notion of the interim and terminal 
periods as states (Proposition 1) has al- 
ready been discussed. The dependence of 
the strength of the terminal response on 
variables related to the value of the rein- 
forcement schedule (Proposition 3) should 
also encounter no opposition. It remains to 
show, first, that reinforcers other than the 
terminal reinforcer are effective during the 
interim period; second, that a number of 
different reinforcers may be effective at 
this time; and third, that the direct relation 
between the strength of the terminal re- 
sponse and the strength of adjunctive be- 
havior implied by Propositions 2 and 3 has 
some basis in fact. Evidence for the effec- 
tiveness of reinforcers other than the termi- 
nal reinforcer, during the interim period, 
comes largely from studies of polydipsia 
(excessive drinking), induced by inter- 
mittent schedules of food reinforcement, as 
follows: (a) Consummatory behavior (e.g., 
drinking) occurs in the presence of the ap- 
propriate goal object (water) during the 
interim period on a variety of schedules, 
(b) This goal object can reinforce operant 
behavior : 


If water is not freely available . | , concurrently 
with a food schedule, but is available in small 
portions contingent upon the completion of a fixed- 
ratio schedule, polydipsia is acquired and will sus. 
tain large fixed-ratios [Falk, 1970, p. 297] 
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Azrin (1964, reported in Falk, 1970) has 
found similar schedule control, in pigeons, 
by a bird provided as a target for schedule- 
induced aggression. (c) Polydipsic drink- 
ing can be both increased and decreased by 
appropriate alteration of the palatability of 
the liquid available (Falk, 1964, 1966). 
(d) Polydipsia is usually reduced by using 
liquid terminal reinforcers. In the cases 
where the liquid used is at least as reinforc- 
ing to a hungry rat as dry food pellets 
(e.g., liquid Metrecal, condensed milk, 
liquid monkey diet; Falk, 1964, 1966; 
Hawkins, Everett, Githens, & Schrot, 1970; 
Stein, 1964), this decrease may be attributed 
to a direct effect on the thirst system (state), 
due to the water content, as in water pre- 
loading (see below). Wesson oil as terminal 
reinforcer is also less effective than food 
pellets in producing polydipsia (Stricker & 
Adair, 1966), although it contains no water, 
but is probably also less reinforcing and may 
reduce polydipsia for this reason (see Prop- 
ositions 2 and 3). (e) Acquisition of poly- 
dipsia can be prevented by presession stomach 
loads of water (Chapman, 1969, reported in 
Falk, 1970), although established polydipsia 
is little affected. This kind of effect is also - 
found with food-motivated terminal re- 
sponses which, once established, will con- 
tinue to occur at a somewhat reduced rate 
even in the presence of ad lib food (Neur- 
inger, 1970a). 

That more than one reinforcer is effective 
during the interim period is suggested by the 
facts that (a) the interim activities (which 
occur in the absence of appropriate goal ob- 
jects) have no consistent direction and are 


not obviously related t r 
forcer ; (b) a variety of goal objects—water, 


wood shavings, another animal—are suf- 
ficient to elicit appropriate consummatory 
reactions (drinking, chewing or eating, ag- 
gression); (c) physiological data. suggest- 
ing short-term reciprocal interactions þe- 
tween hunger and thirst drives which were 
induced centrally by electrical or chemical 
stimulation (Grossman, 1962; von Holst & 
von Saint Paul, 1963), indicate a possible 
mechanism for the simultaneous effectiveness 
of reinforcers other than food at nonfood 
times (interim periods) and food at food 


o any particular rein- 


times (terminal periods). While it is as 
well to be cautious in generalizing both 
across species and from physiology to be- 
havior, these hypothalamic mechanisms are 
evidently quite similar in birds and mammals 
(cf. Akerman, Andersson, Fabricius, & 
Svensson, 1960), and recent work supports 
the similarity between  schedule-induced 
drinking and drinking induced by direct 
hypothalamic stimulation implied by these 
comparisons (Burks & Fisher, 1970). This 
kind of reciprocal interaction suggests that 
at a time when activity motivated by hunger 
is suppressed (e.g., during the period of 
interim activities on a food schedule), activ- 
ities motivated by thirst might be facilitated 
That the effective reinforcers during the 
interim period are other than the terminal 
reinforcer is suggested both by the nonoccur- 
rence of the terminal response at that time 
and by data reported by Segal (1969b) 
showing a shift in the status of drinking — 
from an adjunctive behavior, occurring early 
in the interval, to a terminal response, occur- 
ring largely at the end. The changeover 
took several experimental sessions, but it 
suggests that in the steady state, the same 
activity is unlikely to occur during both 
terminal and interim periods, as might be 
expected if these periods are associated with 
the action of different reinforcers. 
Propositions 2 and 3 in combination imply 
that the strength of adjunctive behaviors 
like that of the terminal response, should be 
directly related to the "value" of the rein- 
forcement schedule, as indexed by motiva 
tional and reinforcement variables, 'The mi. 
dence in favor of this deduction is as fol 
lows: (a) With the interreinforcement item. 
val held constant, the amount of polydipsi 
drinking in a session of fixed length is į : 
versely related to body weight (Falk 1960). 
Similar results have been reported for sch i 
ule-induced attack and air licking e 
1970). (b) Postpellet pause (unti] á = 
set of polydipsic drinking) increas ey 
function of the interval duration (Se as a 
al, 1965). (c) Rate of lickin me 
drink bout tends to decrease as "e within a 
interval length (Segal et a] m s 
Rate of polydipsic drinking da i » (a) 
increasing the size of food Ao h 
inforcement 
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(Hawkins et al, 1970). Hawkins et al. 
attribute a contrary result by Falk (1967) to 
session length differences between the one- 
and two-pellet conditions of Falk’s experi- 
ment. (e) Polydipsic drinking falls off 
drastically at fixed-interval values longer 
than about 3 minutes (Falk, 1966; Segal 
et al, 1965). Falk (1966) also reports a 
direct relationship between polydipsia (mea- 
sured as total amount drunk per session) 
and fixed-interval length over the range 2- 
180 seconds. However, based on the results 
of experiments indicating the relative con- 
stancy of ingestion rate within a drinking 
bout in rats (Davis & Keehn, 1959; Schaef- 
fer & Premack, 1961; Stellar & Hill, 1952), 
and the different times available for poly- 
dipsic drinking under different fixed-interval 
values, Falk’s finding of a direct relationship 
between  fixed-interval value and total 
amount drunk, over part of the range, is 
compatible with an overall inverse relation- 
ship in terms of rate of drinking. This in- 
ference is confirmed by recent data reported 
by Hawkins et al. (1970) which show a 
monotonically decreasing ingestion rate as 
a function of fixed-interval length, over the 
range 1-5 minutes. Given that overall in- 
gestion rate is probably better than total 
amount drunk as a measure of the tendency 
to drink, this finding is both consistent with 
the deduction from Propositions 2 and 3 
and more easily reconciled with other mea- 
sures that indicate an inverse relationship 
between tendency to drink and frequencv of 
reinforcement. Falk’s finding that total 
amount drunk is maximal at intermediate 
interval values may reflect, therefore, an 
optimal balance between two factors: tend- 
ency to drink, which decreases as interval 
value increases, and time available for drink- 
ing, which increases with interval value. 

The above account, in terms of interaction 
between motivational systems (states), 
seems to be the simplest that can presently 
be given of both adjunctive behavior and 
displacement activities, Its relationships to 
these activities, to the ethological interpre- 
tation of them, and to general functional 
considerations are discussed in the next 
section. 


Adjunctive Behavior and Displacement 
Activities 

Most behavioral and morphological char- 
acteristics are adaptive, in the sense that 
they can be directly related to the reproduc- 
tive fitness of the organism in its natural 
environment. Morphological exceptions to 
this rule are either the result of “correlated 
variation,” in Darwin’s phrase, or are vesti- 
gial characters, in the process of being lost. 
In neither case do they show the ubiquity 
and reliability that distinguish adjunctive 
and displacement activities. It is very likely, 
therefore, that these behaviors reflect a 
function, or functions, of considerable adap- 
tive value to animals in the wild. What 
might this function be? 

Learning theories generally consider the 
organism to be motivated by one thing at a 
time, for example, hunger, thirst, explora- 
tory drive, etc. In like spirit, most learning 
experiments are designed to ensure the pre- 
dominance of one kind of motivation at the 
expense of all others. In the wild, however, 
animals must allocate their time among a 
variety of activities so as to both satisfy cur- 
rent wants and anticipate future ones. It is 
reasonable to assume that there has been con- 
siderable selection pressure favoring an opti- 
mal balance among the various possibilities. 

We have already noted the fact, related 
to stimulus discrimination, that animals tend 
to make the terminal response only at times 
when reinforcement is likely. This fact, 
and the principle of reinforcement based on 
it, might seem to reflect some kind of Law 
of Effort, since responding at times or places 
when reinforcement never occurs is ob- 
viously wasteful. However, a considerable 
weight of evidence suggests that the Law of 
Effort is not a major psychological prin- 
ciple, since it is easy to devise situations in 
which animals make many more responses 
than necessary (cf. Ferster & Skinner, 
1957). A Law of Effort principle would 
also not explain active avoidance of situa- 
tions associated with nonreinforcement, 

A more plausible alternative is that these 
facts are related to animals’ need to budget 
their time effectively. In these terms, a 
: : y 8, 
time, or stimulus, reliably associated with 


s WES 
the absence of a given reinforcer provides 
information just as useful as a time per- 
fectly correlated with the delivery of that 
reinforcer, since it permits the animal to 
attend to present and future needs other 
than the one associated with the absent rein- 
forcer. However, other potentialities of the 
environment cannot usually be sampled as 
long as the animal remains in the vicinity 
of the unavailable reinforcer. One might 
expect, therefore, that natural selection will 
have fostered the development of a mecha- 
nism to ensure that animals avoid places at 
times when, on the basis of past experience, 
they have learned that reinforcement is not 
forthcoming. 

Evolution is notoriously opportunistic in 
the sense that adaptation is achieved by 
whatever structural or functional means 
happen to be available. In the present case, 
we suggest that the means for ensuring that 
animals will not linger in the vicinity of 
food (or other reinforcers) at times when it 
is not available may be provided by the fa- 
cilitation of drives other than the blocked 
one (Propositions 2 and 3); that is, that 
the relative aversiveness of the stimuli in 
the vicinity of food, during the interim pe- 
riod, may be a direct effect of the simulta- 
neous suppression of the food state and fa- 
cilitation of states associated with other re- 
inforcers. In the wild, such facilitation will 
usuallv ensure that the animal leaves the 
situation to seek other reinforcers. More- 
over, once the animal has left the situation, 
generalization decrement will ensure that 
the effect of factors acting to facilitate these 
other drives is reduced, restoring the animal 
to a state appropriate to his condition of de- 
privation and allowing him to take advantage 
of new opportunities to satisfy the previously 
blocked drive. . : 

In experimental situations showing ad- 
junctive behavior, however, the animal is 
kept in the vicinity of the withheld rein- 
forcer, both by the physical restraint of the 
enclosure and, perhaps more importantly, by 
the properties of the reinforcement schedule. 
Since enclosure size has not been explicitly 
investigated, one cannot be sure of the rela- 
tive importance of these two factors. In our 
laboratory, we have often observed animals 
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on time-based schedules and noticed that 
pigeons tend to turn away from the key dür- 
ing the no-pecking phase of Schedules such 
as the fixed-interval, which yield à period 
of no pecking followed by pecking. Fie. 
ure 1, which shows the birds’ ie 
(R,) cas a function of postfood time, is a 
quantitative record of this effect, Howeyer 
the avoidance of the key is much more com- 
plete on schedules which require Eding 
followed by no pecking (temporal go-no 4 
schedules, Staddon, 1970a, 1970b) oa 
sumably because the no-pecking ded 7^ 
terminated by an external event (delivery 
of reinforcement) rather than by the bird 
returning to the key to peck, These dines! 
vations provide some evidence both dep Th. 
tendency of pigeons to avoid the key at tin s 
when key pecking is not reinforced and "e 
the restriction placed on this tendency b 
fixed-interval schedules, "i: 

In situations involving externa] (ratl 
than temporal) stimuli, there is also - 
siderable evidence for the aversive "mig 
of stimuli associated with nonreinforcem, ui 
when they occur in a context associated mith 
reinforcement (e.g., multiple and ier : 
reinforcement schedules; cf. Beale 2n 
1970; Catania, 1969; Terrace, 
Thus, the temporal locus of adjunctis. 
(and interim) behavior coincides wit i 
period when, by other measures, the sit : 
tion is aversive to the animal so that ps 
will withdraw from it if he can. Studi ke 
schedule-induced escape have dim n t 
animals will learn to make a response q that 
the interim period on fixed-ratio scl ae 
that has the effect of removing the sedidlen 
the situation, even though the fre ae 
the terminal reinforcement ma 
reduced (eg. Azrin, 1961. 
1964). The present argument su y 
these data may reflect a general 2 
interim periods, although experim 
sults with other schedules an SHal fes 
lacking. "UC presently 

Falk (1969, 1970) has ably A 
the similarities between adjunc "PL iue 
and displacement activities e behavior 
the apparent “irrelevance” e a include 
activity, their association witl pe Nds of 
which a strong drive is | | Situations in 
S blocked, and their 


& Winton, 
1966). 


quency of 
Y thereby þe 
hompson, 
Sests that 
Toperty of 


In 
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modifiability by available stimuli and con- 
ditions of deprivation other than the major 
drive (cf. Morris, : 1954; Rowell, 1961; 
Sevenster, 1961). A study by McFarland 
(1965) concerning displacement pecking in- 
duced by preventing drinking in thirsty doves 
further emphasizes the "state" property of 
displacement behavior: 


The evidence for the view that this pecking be- 
longs to the feeding system . . . [is] as follows: 
1. Total time spent pecking is increased by the 
presence of grain. 2. Time spent pecking is partly 
replaced by time spent at a specific food getting 
activity, when the birds have previously been 
trained to obtain food in this way. 3. Food de- 
privation increases the time spent pecking 
when grain is present, and this effect is counter- 
acted by [prefeeding] [p. 298]. 


The interpretation of adjunctive behavior 
that we have presented is very similar to the 
disinhibition hypothesis concerning displace- 
ment activities first suggested by Andrew 
(1956). Hinde (1966) summarizes this 


view as follows: 


When mutual incompatibility prevents the appear- 
ance of those types of behaviour which would 
otherwise have the highest priority, patterns which 
would otherwise have been suppressed are per- 
mitted to appear [p. 279]. 


Displacement behavior is usually exhibited 
in approach-avoidance conflict situations 
(e.g., territory defense, birds returning to 
the nest after an alarm, etc.) when the 
animal is consequently prevented from leav- 
ing the situation. We have already seen 
that during the interim period on intermit- 
tent reinforcement schedules, animals are 
also restrained in the situation both by the 
enclosure and, probably, by the properties 
of the schedule. Both situations therefore 
meet the conditions necessary and sufficient, 
by our hypothesis, for the elevation (rather 
than merely disinhibition) of motivational 
states other than the blocked one, leading in 
the schedule case to adjunctive behavior, and 
in the approach-avoidance case to displace- 
ment, redirection, or vacuum activities—the 
particular activities being a function of the 
strength and nature of the blocked response, 
the proximity to the goal, the stimuli avail- 
able, tlie past history of the animal, and the 
duration of the blocking. This modifica- 
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tion causes no difficulty in application to 
displacement behavior and meets theoretical 
objections to certain forms of the disinhibi- 
tion hypothesis raised by McFarland (1966). 
Finally, our account of the adaptive signif- 
icance of adjunctive behavior, as a reflection 
of the integrative capacities of the organism 
which enable it to strike an efficient balance 
among a number of activities, finds a coun- 
terpart in McFarland’s (1966) account of 
the significance of displacement behavior: 


Thus it is suggested that the functional signif- 
icance of displacement activities is that they are 
the by-product of a mechanism which enables 
animals to break away írom a specific course of 
action, when progress in that course of action 
comes to a standstill [p. 231]. 


We conclude, therefore, that interim, ad- 
junctive, and displacement behaviors may be 
grouped together on the basis of similar 
functional properties, similar probable causal 
factors, and similar adaptive role. 

In summary, the argument relating to the 
interim period is as follows: (a) Extant data 
on adjunctive behavior are consistent with 
a tentative general interpretation in terms 
of interactions among motivational systems 
(states). (b) On the basis of general 
adaptive considerations, we have suggested 
the probable existence of a mechanism which 
enables animals to budget their time effi- 
ciently (e.g, by giving up temporarily in- 
effective activities). (c) McFarland has 
suggested that displacement and other "irre- 
levant" activities may reflect the action of 
such a mechanism. (d) Falk has pointed 
out the extensive similarities between ad- 
junctive and displacement activities. (e) 
McFarland’s suggestion may therefore be 
extended to interim and adjunctive behav- 
iors, both on the basis of their resemblance 
to displacement activities and their restric- 
tion to the aversive interim period. Thus, 
the general interpretation of adjunctive be- 
havior offered earlier gains additional sup- 
port from the resemblances between adjunc- 
tive and displacement activities, from its 
similarity to the disinhibition hypothesis for 
displacement activities, and from its ade- 
quacy as a mechanism for enabling t] 


: sare le 
animals to budget their time efficiently 
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This ends our outline of conditioning. 
We have dealt with both terminal periods— 
which, we suggest, reflect a Law of Effect 
process that can best be understood by 


analogy with evolution by means of natural 
selection—and interim periods—which may 
reflect a mechanism enabling animals to 
allocate their activities efficiently. Learned 
behavior, under the relatively simple condi- 
tions of reinforcement schedules at least, is 
viewed as reflecting the sequencing, with 
respect to time and stimuli, of terminal and 
interim periods; and the scheme is therefore 
potentially comprehensive, although nec- 
essarily incomplete as to details. 

Our proposal is founded on the belief that 
the most distinctive thing about living crea- 
tures is the balance they maintain among a 
number of tendencies to action, each one 
adaptive, yet each destructive if pursued to 
the exclusion of others. This emphasis on 
the integration of behavior has required that 
the scheme attempt to be comprehensive and 
that it relate in a natural way to biological 
and physiological considerations. Such 
merits as it possesses lie not in formal ele- 
gance or precision, but in an ability to 
organize otherwise unrelated facts and to 
suggest gaps where others may possibly be 


found. 
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The scope and power of the drive theo 


Oklahoma 


ry of social facilitation are extended 


by taking fuller advantage of learning theory models oí drive. Existing 


theory employs the irrelevant drive pai 


radigm, in which neither the initia- 


tion nor the termination of audience observation is contingent on the be- 
havior of the subject or on the onset of the conditioned stimulus. Extensions 
include escape and avoidance of audience observation, as well as the classical 


conditioning of symbolic audiences. In 
the strength of audience-induced drive 


addition, nine methods for varying 
are developed, based on the model 


of learned drive, including extinction, summation, generalization, acquisition 


and five forms of inhibition. 


The effects of the presence of others on 
the behavior of an individual is generally 
recognized as one of the traditional central 
problems of social psychology. The study 
of such effects has come to be called the prob- 
lem of social facilitation, despite the fact 
that the presence of others sometimes facili- 
tates and sometimes impairs performance. 
In a theoretical essay of great elegance, 
Zajonc (1965) reconciled the apparently 
contradictory results of numerous experi- 
ments on social facilitation. Zajonc pro- 
posed that the presence of others, whether 
audience or coactors, serves to arouse drive 
(D). Since generalized drive energizes all 
habits, both correct and incorrect responses 
will be facilitated, with the greatest benefits 
going to the best learned dominant responses 
(e.g, Spence. 1956). Cottrell (1968) 
greatly sharpened this theoretical analysis by 
(a) replacing the overinclusive concept of 
dominant responses with the more correct 
learning theory concept of stronger habit 
strength (H). (b) identifying and demon- 
strating the learned drive basis oi D, (6) 
clarifying the boundary conditions of the 
effect, and (d) emphasizing the importance 
of using experimental tasks which are al- 
ready known to reflect the effects of drive. 
Tt now seems quite clear that the drive 
induced by audience observation is not an 
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appetitive drive like hunger or thirst, but an 
aversive one like frustration or anxiety 
This point follows directly from Cottrell's 
learned drive interpretation, since the evi- 
dence is abundant that learned drives are 
based on noxious primary drives like pain 
(e.g. Miller, 1948) and frustration (eg 
Daly, 1969; Wagner, 1963) and cannot 
be based on appetitive drives like hunger 
and thirst (e.g., Cravens & Renner, 1969; 
Myers & Miller, 1954; Novin & Miller, 
1962). Social psychological experiments 6ü 
the effects of audience observation on ner 
vous sweating (Martens, 19692, 1969b) also 
support this conclusion.? 
A rapidly expanding research li 

has been directly inspired by the Fe s 
of social facilitation (e.g., Cottrell, Rittle & 
Wack, 1967;* Henchy & Glass 1968; 
Klinger, 1969; Martens, 1969a; Matlin & 
Zajone, 1968; Weiss, Miller, Langan, & 
Cecil, 1971). In these studies, the soci, 1 

facilitated subject typically works on a 1 d 
ing task while an audience ibus "hie 


3 Among the particular driv: : 
social facilitation which have iinet for 
fear and anticipatory frustration Cael are 
Lott & Lott, 1968) as well as conflict-indu y 1968 ; 
(Thibaut & Kelley, 1959). Variatio; uced drive 
drive might include shame, Paean on fear 
tance, and, occasionally, guilt. Cote T effec- 
Lott and Lott (1968) briefly entertai (1968) and 
ity of facilitation based on ince: in the possibil- 


the available evidence does ponh motivation, but 
hypothesis of drive-like E e a ioi Spencian 
responses by conditioned incentive of preexisting 
son, Guinan, & Crowell, 1964; A, (Armus, Carl- 
ski Dolinsiy, 1966; Trapold jaan & Sniadow- 
Winokur, 1967). k 
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performance. The presence of the audience 
induces irrelevant drive (D), which ener- 
gizes all responses in the situation. The 
habit strength (H) of the various responses 
may be manipulated with the result that the 
greatest benefits of differences in drive go 
to the strongest habits, whether these are 
"correct" or "incorrect." Thus, all the 
studies based on the drive theory of social 
facilitation have used social analogs of the 
standard learning paradigm of a test for 
the energizing effect of an irrelevant drive on 
behavior, in which neither the onset nor the 
reduction of drive is contingent on the be- 
havior of the subject or the presentation of 
the CS. However, learning theory ascribes 
other effects to drive besides response ener- 
gization, which may be studied in paradigms 
in which drive is relevant to the stimulus- 
response-reinforcement contingencies. Thus, 
the reduction of noxious drives is responsible 
for conditioning instrumental escape and 
avoidance responses. In other paradigms, 
drive stimuli serve a punishing function. 
Learning theory and its related research also 
offer a fairly detailed statement of the man- 
ner in which the contingency between a CS 
and drive onset leads to the conditioning of 
learned drives. The purpose of this paper 
is to extend and deepen the drive theory of 
social facilitation by specifying the implica- 
tions of learning models for predicting the 
effects of the presence of others on behavior, 
beyond the confines of the irrelevant drive 
paradigm. The paradigms to be discussed 
include escape and avoidance conditioning, 
punishment, and the classical conditioning 
of learned drives. For each paradigm we 
shall summarize the learning model, derive 
hypotheses concerning audience effects on 
performance, and suggest some ways of im- 
plementing the empirical tests. 

Before discussing the four paradigms just 
mentioned, it will be useful to make more 
explicit the theory construction methods em- 
ployed, and then to specify the methods for 
manipulating the strength of audience-in- 
duced drive which are essential to the dis- 
cussion of all four paradigms. 


METHOD 


In this paper, the general approach which Miller 
(1959) has called “extension of liberalized S-R 
theory” is employed to further develop the Zajonc- 
Cottrell drive theory of social facilitation. Hull- 
Miller-Spence learning theory and its related 
research (referred to frequently hereafter simply 
as “learning theory” or “learning principles”) 
(e.g, Amsel, 1967; Brown & Farber, 1968; 
D'Amato, 1969; Dollard & Miller, 1950; Hull, 
1943, 1952; Logan, 1959; Spence & Spence, 1966; 
Spence, 1956, 1966) is used as a model for social 
facilitation paradigms corresponding to instru- 
mental escape conditioning, avoidance conditioning, 
punishment, and classical conditioning of learned 
drives. 

Hullian learning theory has been developed 
primarily to predict individual behavior in highly 
controlled experimental situations. Nevertheless, 
learning theory has been extended not only to social 
facilitation but to numerous social processes with 
a considerable measure of success (eg, Lott & 
Lott, 1968; Miller & Dollard, 1941; R. F. Weiss, 
1963, 1968; Wheeler, 1966; Whiting & Child, 
1953; Wolpe, 1958). The explanatory power of 
learning theory stems, in part, from three sources. 
First, learning theory principles cover a variety 
of basic learning paradigms, such as classical de- 
fense conditioning, instrumental escape condition- 
ing, etc. Second, learning theory principles may 
be combined in a determinate manner, becoming 
more powerful explanatory tools when the manner 
of their interaction is specified. Third, learning 
theory is quantitative, with the usual advantages 
that attend. scientific quantification. 


Technique of Theory Construction 


The use of a model in theory construction typi- 
cally involves the specification of a dictionary of 
analogies (rules of correspondence) which relate 
the variables of the model to the variables of the 
research area to be explained and predicted. Once 
this is done, the relations holding among the vari- 
ables of the model must, theoretically, also hold 
between the corresponding (analogous) variables 
in the data area to be explained (e.g., Brodbeck, 
1959: Campbell, 1920; Lachman, 1960; Oppen- 
heimer, 1956). The systematic use of learning 
theory as a model for social facilitation makes it 
possible to take full advantage of the three char- 
acteristics of learning theory mentioned above: 
principles covering a variety of learning para- 
digms, combination of principles ina determinate 
manner, and quantitative specification. Thus, for 
example, social facilitation research has never 
taken advantage of the numerous learning princi- 
ples applicable to an escape conditioning paradigm, 
Three such principles regarding the effects of de- 
lay of reinforcement on measures of escape con- 
ditioning may be stated informally in the followi 
manner: (a) the delay of reinforcement "ng 
ce di ing and negativel 1 , gradient 
is decreasing g y accelerated in shape; 


io i ioe 


(b) delay of reinforcement and number of trials 
combine multiplicatively ; and (c) shorter delays of 
reinforcement maximize the effects of differences 
in magnitude of reinforcement, If a social variable 
is to be analogous to delay of reinforcement in 
escape conditioning, its effects on appropriate de- 
pendent measures must show that (a) this social 
variable gradient is decreasing and negatively 
accelerated in shape, (b) this social variable and 
number of trials combine multiplicatively, and (c) 
smaller values of this social variable maximize 
the effects of differences in magnitude of reinforce- 
ment. 

The above points illustrate the development of 
an analogy between learning and social independent 
variables. The analogy does not have testable im- 
plications until analogies between learning and 
social dependent variables are also developed, 
Moreover, it is necessary to clearly specify which 
learning paradigm the social conditions are analo- 
gous to; instrumental reward conditioning, in- 
Strumental escape conditioning, selective learning, 
etc, differ sharply in certain regards, 


Models and Translations 


The theories developed here are not mere trans- 
lations of one behavioral science language into an- 
other, since the theories generate hypotheses which 
are not in the existing literature. A “mere trans- 
lation” would require, for example, that the hypoth- 
€ses concerning the previous paragraph's social 
variable already exist in the literature of social 
facilitation research before they could be merely 
translated into Statements about delay of rein- 
forcement and number of trials. 


Analogy and Reduction 


Nowhere in this essay is it proposed that Social 
behavior represents a distinct higher level of 
phenomena which can only be fully explained by 
reducing it to a lower level of phenomena such 
as conditioning. Nor is this dubious philosophical 
program implicit in our intentions. Rather, the 
intention has been to use the relatively well under- 
stood as a model for that which is presently less 
well understood. The programmatic assumptions 
involved are commonplace to scientific method : 
Well-constructed models and analogies help to 
stimulate and guide research and to integrate broad 
ranges of knowledge through an underlying set of 
common principles. 


Nature of Reinforcement 


The theories developed here are not specifically 
concerned with the ultimate nature of reinforce- 
ment: an empirical law-of-effect position is taken 
throughout, The empirical law of effect recognizes 
the fact that reduction of noxious stimulation fol- 
lowing a Tesponse is reinforcing, whereas the more 
Controversia] drive reduction theory of reinforce- 
ment holds, in its fully developed form, that all 
reinforcement. js based on drive reduction. The 
drive reduction theory is not essential to Hull- 
Miller-Spence learning theory. — Hullian theory 
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predicts and explains in the same manner without 
reference to this particular conception of the na- 
ture of reinforcement (Dollard & Miller, 1950; 
Miller, 1959; Spence, 1956). 


MANIPULATING Drive STRENGTH 


In the discussion of the escape, avoidance, 
punishment, and classical conditioning para- 
digms for social facilitation which follow the 
present section, variation of the strength of 
audience-induced drive occupies a prominent 
place, and it will be useful, therefore, to first 
explore methods for manipulating | drive 
strength. If the drive induced by audience 
observation js a learned one, as theory and 
experiment indicate (e.g., Cottrell, 1968 ; 
Cottrell, Wack, Sekerak, & Rittle, 1968; 
Henchy & Glass, 1968; Klinger, 1969; Lott 
& Lott, 1968; Paulus & Murdoch, in press), 
it should be possible to extend some of the 
familiar techniques for the Manipulation of 
learned drive Strength to the socia] facilita- 
tion situation. These techniques include ex- 
tinction, inhibition, summation, g 


i itioi eneraliza- 
uon, and acquisition, 


Extinction 


The strength of a learned driy 
decreased by repeated present 
CS which arouses that drive, unaccompanied 
by the aversive UCS (e.g., Armus, 1960. 
Kalish, 1954), If audience observation 
arouses a learned drive, it should be Possible 
to decrease the drive-arousing Power of audi- 
ence observation by repeated exposures to 
that audience without noxious consequences 
following from that observation, 


e may be 
ation of the 


Inhibition 


_ Since learned drives manifest a high re- 
sistance to extinction, a more powerful 
method of decreasing the strength of audi- 
ence-induced drive may be to elicit 4 re- 
Sponse which is incompatible with 
learned drive, One version of thi 


would be to give Subjects reciproca] inhibi- 


the 
S method 


tion therapy (Wolpe 1958) for “audi 

ru DIE aud 
Phobia” (Paul, 1967 ). Another pen 
would be to countercondi 


ition against the 
Presentations of 
ved by pleasant 
elvin & Brown, 
al countercondi. 
oal response toa 


Ú 
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primary reward object may be drive reduc- 
ing (Wilson & Dinsmoor. 1970; but see 
also Nelson, 1966), suggesting the 
applicability of Razran's (1938) luncheon 
technique to social facilitation research. 
Similarly, there is some evidence that a 
signal which predicts positive reward will 
inhibit fear drive (Bull & Overmier, 1969) ; 
a signal which indicates that pleasant re- 
warding consequences will follow audience 
observation should then serve as an inhibitor 
of audience-induced drive. Indeed, a sig- 
nal which predicts mere safety from shock 
will inhibit a learned fear drive (e.g., Bull 
& Overmier, 1968; Rescorla, 1969; Rescorla 
& LoLordo, 1965) ; a signal which indicates 
that negative consequences will not follow 
audience observation should then serve as 
an inhibitor of audience-induced drive. Such 
a signal could, of course, be verbal. 


Summation 


If a learned drive is independently condi- 
tioned to more than one CS, the strength of 
the learned drive may be increased by pre- 
senting these conditioned stimuli simul- 
taneously (McAllister & McAllister, 1965, 
1967). Ifa learned drive is aroused by ob- 
servers, it may be possible to vary the 
strength of the learned drive by varying the 
number of observers in the audience. 


Generalization 

Learned drives may be weakened by re- 
ducing the similarity of the drive-arousing 
CS to the CS originally used in establish- 
ing the learned drive. Similarly, learned 
drives will generalize to a new situation to 
the extent that the new drive-arousing CS 
resembles the original one (e.g., Desiderato, 
1964). Thus, it should be possible to vary 
the strength of the learned drive aroused by 
audience observation by varying the similar- 
ity of the audience involved to those audi- 
ences whose observation and evaluation have 
typically been followed by harsh conse- 
quences. The important finding of Cottrell 
et al. (1968) that blindfolded observers do 
not arouse enough drive to produce a social 
facilitation effect may be regarded as an ex- 
treme case of this generalization mechanism. 
Less drastic variations might include the 
adoption of different degrees of evaluative 


stance by the observers (Paulus & Mur- 
doch, in press), or the observation of sub- 
jects by observers who typically exercise dif- 
ferential hierarchal or peer-level punitive 
power over them—as the observation of 
white-collar workers by (a) managers, or 
(b) other white-collar workers, or (c) blue- 
collar workers—or the observation of college 
students by other students or by children. 
Given sufficient information on the subjects, 
the possibilities of arousal of the shame and 
guilt forms of learned fear’ (e.g., Aronfreed, 
1968 ; Dollard & Miller, 1950) are intriguing. 
Consider the possibilities of facing subjects 
who have stronger feelings of guilt and 
shame toward their mothers than toward 
their fathers with audiences consisting of 
women or men, or the reactions of (mea- 
surably) guilty whites to white or black 
observers. 


Acquisition 

Finally there is the possibility of trying 
to directly condition learned drives to the 
audience in order to increase its ability to 
arouse drive, for example, by following audi- 
ence observation with harsh evaluation or 
other noxious stimuli, prior to employing 
this audience in a typical social facilitation 
experiment. The acquisition method per- 
mits a laboratory simulation of the natural 
origins (Cottrell, 1968) of audience effects. 
It may well be possible to condition drive 
to a particular audience, without requiring 
the audience to judge the subject harshly, by 
means of higher order conditioning (e.g. 


1969; 


Anderson, Johnson, & Kempton, 1 
Kamil, 1969; McAllister & McAllister, 
1964). Aronfreed (1968) argues that 


learned shame is aroused by highly visible 
incompetence. If the experimenter arranges 
for the subject to perform incompetently 
before an audience, shame should be elicited, 
and a higher order learned drive of shame 
should then become conditioned to that par- 
ticular audience. 
EscAPE CONDITIONING 

Primary Concepts 

One of the most important tools of thought 
provided by learning theory is the Thorn- 
dikian tradition of the law of effect (eg. 
Berger & Lambert, 1968; Skinner, 1963), 
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In this instrumental learning tradition, atten- 
tion is focused on the contingency between 
a response and its reinforcing consequences, 
rather than on the stimulus antecedents of a 
response. In the typical irrelevant drive 
paradigm for audience effects, we focus on 
the stimulus antecedents of a response: The 
subject is exposed to the audience stimula- 
tion and the effects on behavior are observed. 
In the eseape conditioning paradigm, we 
avail ourselves of the opportunity of the 
Thorndikian tradition to focus on the con- 
tingency between a response and its reinforc- 
ing consequences. In a standard learning 
experiment on escape conditioning, reduc- 
tion of an aversive drive, such as electric 
shock, is made contingent on the perform- 
ance of an instrumental response, such as 
bar pressing. The reduction of the aver- 
sive drive serves to reinforce the instru- 
mental response. In social facilitation re- 
search, the aversive drive is provided by 
audience observation, and the termination of 
this observation should reinforce a response 
which precedes it, A concrete example of a 
Possible social facilitation escape condition- 
ing experiment, phrased in Miller and Dol- 
lard’s (1941) familiar terms of drive, cue, 
response, and reinforcement, follows. The 
deception procedures needed in a finished 
experiment are omitted here, and leave con- 
siderable room for experimenter ingenuity, 
Two observers watch and evaluate a subject 
through a transparent glass screen and 
microphones. They are clearly visible to 
the subject and serve to induce drive. The 
social drive-reducing instrumental response 
to be learned is button pushing. Pushing 
the button opaques the screen and turns off 
the microphone, terminating the observation 
for some preset period of time, thus rein- 
forcing the subjects. The cue for respond- 
ing is the removal of a cover from the button. 
The time between this cue and the social 
drive-reducing instrumental response is re- 
sponse latency, and latency, or its reciprocal, 
response speed, is the dependent variable. 
If audience observation induces an aversive 
drive, the subjects should learn to press a 
button in order to terminate this observa- 
tion. 


Independent Variables: 


Rules of Corre- 
spondence 


The possibilities of the escape conditioning 
model are not exhausted by the single in- 
formal prediction above. If rules of corre- 
spondence between escape conditioning vari- 
ables and social facilitation variables are 
specified, then the relations holding among 
the escape conditioning variables should, in 
theory, hold among the corresponding social 
facilitation variables, The rules of corre- 
spondence relating the variables of escape 
conditioning to the variables of social fa- 
cilitation are given below, and numbered for 
later reference. Several kinds of trials are 
possible in this paradigm. Corresponding 
to a reinforced trial in escape conditioning 
is a social facilitation trial in which the audi- 
ence observation begins before the social 
drive-reducing instrumental response and is 
terminated by it CL». Corresponding to a 
nonreinforced trial in escape conditioning 
is a social facilitation trial in which the audi- 
ence observation begins before the response, 
but the response does not terminate the 
aversive observation (2). A series of such 
nonreinforced trials following acquisition is 
analogous to extinction (3), while a mix- 
ture of trials on which the button push 
sometimes terminates the audience observa. 
tion and sometimes does not is analogous 
to partial reinforcement (4). Jb ds also 
possible to have a trial Which is not observed 
by an audience, corresponding to a procedure 
Sometimes used 


which the shock is altogether omitted from 
some trials (5), i 


intermittent shock schedules introduced by 
Jones (1953) and recently studied by Fran- 
china (e.g., 1966, 1969a) (6). 


mental rules, Similarly, in th 
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The intensity of drive induced through 
audience observation can be manipulated by 
the several techniques described earlier (7). 
In escape conditioning the shock is typically 
reduced to a zero level, following a response, 
on a reinforced trial. When it is necessary 
to v magnitude of reinforcement inde- 
pendently of the intensity of drive, this may 
be done by reducing the shock by large or 
small amounts following the response. If 
drive intensity depends at all on the number 
of observers, it might be possible to vary an 
analog of magnitude of reinforcement by re- 
ducing the number of observers by large 
or small amounts following the response (8). 
A procedure precisely analogous to this 
has been successfully employed in fear-moti- 
vated escape conditioning by McAllister and 
McAllister (1967). who motivated their 
subjects by two simultaneously presented 
conditioned fear stimuli and reinforced them 
by terminating one fear stimulus for the 
low-reinforcement group and both fear stim- 
uli for the high-reinforcement group. An 
interesting variation of this procedure might 
be an audience consisting of representatives 
of various social sectors, such as blue-collar 
and white-collar observers, negro and white 
observers, or male and female observers, 
with reinforcement terminating the observa- 
tion of one or both social sectors. Finally, 
the time interval between the instrumental 
response and the termination of observation 
corresponds to delay of reinforcement (9). 


Dependent Variables: Rules of Correspond- 
ence 

In the preceding sections we have specified 
the kind of learning paradigm to which our 
social facilitation paradigm is analogous. 
and we have developed a dictionary of analo- 
gies between learning and social facilitation 
independent variables. The failure to use 
appropriate dependent variables is one of 
the most common failings of social learning 
research. Cottrell (1968) and Weiss (1962, 
1968) have strongly emphasized the import- 
ance of choosing dependent variables which 
are appropriate to the theory to be tested. 
Indeed, "the theory does not have testable 


implications until analogies between learning 
and social dependent variables are developed 


[R. F. Weiss, 1968, p. 111] In Hullian 
learning theory, speed (l/latenc) ) and re- 
sistance to extinction are theoretically ap- 
propriate dependent measures of escape 
conditioning, although the extinction mea- 
sure is not very well researched. Latency 
corresponds quite straightforwardly to the 
time between the cue and the button-push 
response in the above example (10). La- 
tency measures have been used successfully 
in social facilitation research, in the usual 
irrelevant drive paradigm (Matlin & Zajonc, 
1968; Zajone, Heingartner, & Herman, 
1969). Changes in speed during trials on 
which responding no longer terminates Ol 
decreases observation may be used as mea- 
sures of resistance to extinction, if cor- 
rected for differences in acquisition asymp- 
tote (Anderson, 1963) (11). Alternatively, 
such trials may be given until the subject 
ceases to respond, according to some explicit 
criterion. ot cessation, and the number of 
trials until this criterion is met may be 
counted (12). i 


Predictions 


Rules of correspondence have been 
assumed between the variables of escape 
conditioning and the variables of social fa- 
cilitation. Once such rules of correspond- 
ence are assumed, the functional relation- 
ships holding among the variables of the 
conditioning model should also hold among 
the corresponding social facilitation. vari- 
ables. The following nine derivations are 
illustrative, not exhaustive. F 

Acquisition. In escape conditioning, re- 
sponse speed is an increasing function of the 
number of reinforced trials. Given Rules of 


Correspondence 1 and 10, it therefore follows 
that speed of the soc! 


ial drive-reducing instru- 
mental response should be an incr 
function of the number of trials on which 
audience observation begins before the but- 
ton push and is terminated by it. 
Extinction. In escape conditioning, re- 
sponse speed is a decreasing function of the 
number of unreinforced trials. Given Rules 
of Correspondence 3 and 10, it therefore fol- 
lows that speed of the social drive-reducing 
instrumental response should bea decreasing 
function of the number of audience-observed 
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trials on which the button 
terminate the observation. : 
Partial reinforcement, In escape condi- 
tioning, speed is slower under partial rein- 
forcement than it is under continuous rein- 
forcement (Bower, 1960). Given Rules of 
Correspondence 4 and 10, it therefore fol- 
lows that subjects given a mixture of trials 
on which the response sometimes terminates, 
the audience observation will respond more 
slowly than subjects whose response always 
terminates the observation, Tt 1s important 
to note that the effects of partial reinforce- 
ment on asymptotic performance in escape 
conditioning are the reverse 
in instrumental reward conditioning, and 
Serve as one of the severa] Ways of testing 
the difference between an aversive and an 
appetitive drive, 
Shock schedules, 


push does not 


Rules of Correspondence 6 and 10, it there- 
fore follows that the speed of the social drive- 
reducing instrumental Tesponse is a decreas- 
ing function of the percentage of unobserved 
trials, 

Magnitude of reinforcement. In escape 
conditioning, response speed is an increas- 
ing function of the magnitude of rein. 
forcement (Bower, Fowler, & Trapold, 
1959; McAllister & McAllister, 1967). 
Given Rules of Correspondence 8 and 10, 
t therefore follows that the speed of the 
ocial drive-reducing instrumenta] Tesponse 
hould be an increasing function of the re- 
uction in the number of Observers or ob- 
rver classes, 

Drive intensity. 
agnitude of reinforcement 
nflicting results (Campbell 


Studies controlling the 


have yielded 
& Kraeling, 


53; McAllister & McAllister, 965 ; 
'o0ds, Davidson, & Peters, 1964), How- 


er, when shock is terminated completely 
er each escape, there is no doubt that 
eed is an increasing function of drive 
S^ Franchina, 1969b ; Trapold & Fowler, 
). Given Rules of Correspondence 7 
0, it therefore follows that speed of an 
trumental response which terminates audi- 
ce Observation Should be an increasing 
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function of the several drive-increasing oper- 
ations discussed near the beginning of the 
paper. 
Delay 
ditioning, 


of reinforcement. In escape con- 


negatively 
delay of 

1962). 
and 10, 
it therefore speed of the social 
' drive-reducing instrumental Tesponse should 
be a negatively accelerated decreasing func- 
tion of the delay in termination of observa- 
tion. 


to determine response speed 
& Trapold, 1962). Given Rules 
of Correspondence 9, 1, and 10 
fore follows that the delay in 
tion of Observation and the number 
on which observation js terminated f, 
the response should combine m 
to determine the speed of the 


» it there. 
termina- 


reducing instrumenta] Tesponse Delay 
effects should thus be Breatest late in train 
ing. 


Interaction: Delay x Magnitude, In 
escape conditioning, magnitude of reinforce- 
reinforcement interact 


Se speed (Woods & 
Tagnitude effects are 


number of 


and the delay 
should combine jy 
i € speed of the 


Instrumenta] response. Tf audience observa. 
tion induced an 


: appetitive drive, a quite dif. 
rent outcome would be ex ected (T, 
1960), p (Logan, 


Boundary Conditions 


Any theory js intended to 
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Since the theory of social facilitation. de- 
veloped here is based on analogies with con- 
ditioning research and theory, some of the 
most essential boundary conditions are also 
based on analogies with conditioning. 

Conditioning versus response competition. 
In conditioning, the aim of the experi- 
menter is to "limit the investigation to the 
study of the strengthening of a single re- 
sponse" and the experimenter therefore 
attempts "to minimize the occurrence of 
competing responses [Spence, 1956, p. 38].” 
These strictures apply directly to the con- 
ditioning of the socially facilitated instru- 
mental response. The differences between 
drive effects in the single response and com- 
peting response situations are well known 
to social facilitation researchers (e.g., Cot- 
trell et al., 1967; Zajonc et al., 1969). 

Trials and dependent variables. Follow- 
ing Zajonc (1965) and Cottrell (1968), 
the analysis presented here is based on 
the Hullian theory of drive (e.g., Spence, 
1956). As previously noted, appropriate 
dependent variables for testing this theory 
in the instrumental escape conditioning para- 
digm are the speed (1/latency) and resist- 
ance to extinction of the social drive-reduc- 
ing instrumental response. These measures 
are obtained in a discrete-trials paradigm 
such as the previously described example. 
It is not possible, therefore, to test the theory 
presented here by means of a free-respond- 
ing operant type of instrumental learning 
task in which response rate rather than re- 
sponse latency is the dependent variable. 
While the discrete-trials and free-responding 
operant escape paradigms differ from each 
other in essential ways, most of the rules 
of correspondence for independent variables 
can be adapted to a free-responding para- 
digm with relatively little difficulty. The 
investigator who prefers such a free-respond- 
ing paradigm should note, however, the spe- 
cific properties of the free-responding escape 
model (Dinsmoor, 1968) and recall that this 
paradigm is irrelevant to Hullian theories 
of drive. 

In Hullian theory, the amplitude, (i.e., 
vigor or forcefulness of a response) has 
regularly been used as a measure of the 
strength of classically conditioned autonomic 


responses (e.g, Hull, 1943). It is worth 
explicitly mentioning, therefore, that such 
measures do not provide an index of the 
strength of a response in instrumental con- 
ditioning of motor responses, whether in the 
discrete-trials or free-responding paradigm. 
For example, the amplitude of an instru- 
mental response does not increase over the 
course of acquisition (e.g, Notterman & 
Mintz, 1965; Weiss, 1961). 

Drive arousal. The theoretical predictions 
apply only to the case in which the presence 
of an audience increases drive. Cottrell et al. 
(1968) have made it crystal clear that there 
are specifiable conditions under which the 
presence of an audience will not induce drive. 
This boundary applies generally to the drive 
theory of social facilitation, rather than being 
special to the present extensions of it, and 
it seems appropriate, therefore, to keep the 
treatment of the topic in this paper fairly 
brief. If the drive induced by audience ob- 
servation is a learned one, then it follows 
that there are two general conditions under 
which the presence of an audience would be 
sufficient to increase drive, at least one of 
which would be necessary to increase drive. 
First, the audience (including audience be- 
havior) is a conditioned stimulus for a 
learned drive of fear or anticipatory frustra- 
tion (Cottrell, 1968; Lott & Lott, 1968). 
Variations on fear drive might include 
shame, embarrassment,  effectance, and, 
occasionally, guilt. While not addressed, in 
so many words, to the problem of social 
facilitation, the accounts of the social condi- 
tions for the learning or arousal of such 
drives in Dollard and Miller (1950, espe- 
cially pp. 116 ff, 243-245) as well as in 
Aronfreed (1968, pp. 241 ff.) and Byrne 
(1969) are indispensible for social facilita- 
tion research. An evaluative stance on the 
part of the audience is the most important 
factor in fear arousal so far isolated in the 
social facilitation situation itself (Cottrell, 
1968; Henchy & Glass, 1968 ; Klinger, 1969: 
Paulus & Murdoch, in press). Variations on 
the conditioned stimulus function of the 
audience include the possibility of audience 
arousal of a cue-producing mediating re- 
sponse (“It's a test") which then serveg as 
the immediate CS for the learned drive, or 


the possibility of the audience serving as 
only one part of a compound cue to which 
the learned drive is conditioned (so that 
evaluative observation of a manual versus an 
intellectual task might prove differentially 
drive arousing). A second (and secondary ) 
way in which the audience can increase drive 
is by weakening the effect of an inhibitor of 
drive. Thus, for example, a child in a stress- 
ful situation may inhibit anxiety by sucking 
his thumb, but he can be deterred from suck- 
ing his thumb by the presence of an audience. 
A second example is of wider significance. 
Fear is inhibited by subject control of a 
frightening situation (e.g., Mowrer & Viel. 
1948; J. M. Weiss, 1968) and the presence 
of an evaluative audience may have the 
effect of shifting some of the control of the 
situation from the subject to the evaluative 
audience. 

The effects of the presence of others on 
drive are not limited to drive increase when 
the above conditions are met and no increase 
when these conditions are not met. There 
are, of course, theoretical reasons for think- 
ing that under some circumstances the pres- 
ence of others can reduce drive (e.g., Bo- 
vard, 1959; Byrne, 1969; Dollard & Miller. 
1950), and some evidence to support this 
is available from both human (Amoroso & 
Walters. 1969; Kiesler, 1966) and animal 
(e.g. Davitz & Mason, 1955; Morrison & 
Thatcher, 1969) studies. The conditions 
determining such effects are being elucidated 
under such rubrics as interpersonal attrac- 
tion (e.g., Byrne, 1969; Lott & Lott, 1968) 
and animal gregariousness (e.g., Morrison 
& Thatcher, 1969) as well as social facilita- 
tion. While recognizing the relevance of 
attraction to social facilitation, it is not sug- 
gested that the two processes are simply con- 
gruent. Thus, for example. while the pres- 
ence of people to whom we are attracted 
may normally be drive reducing, an evalua- 
tive audience of such people may be par- 
ticularly drive arousing, especially when 
there is a realistic probability of a deleterious 
effect on a continuing association. i 

Drive reduction. Termination of audience 
observation should reduce drive in a manner 
analogous to termination of shock in escape 
conditioning. Simple and precise control of 
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such drive reduction leaves much to be de- 
sired, even in the conditioning laboratory, 
Thus, in escape conditioning, termination of 
the shock does not actually reduce drive to 
zero, in part because of perseverative physio- 
logical effects of prior stress, but also in part 
because fear (anticipatory of renewed shock) 
becomes conditioned to the static cues of the 
situation (McAllister & McAllister, 1963). 
Similarly, in classical defense conditioning 
of human subjects, it is believed that ter- 
mination of the noxious airpuff does not 
actually reduce drive to zero because fear 
(Te, anticipatory of renewed noxious stimula- 
tion) becomes conditioned to the cues of the 
situation. (e.g., Spence, Haggard, & Ross, 
1958; Trapold & Spence, 1960). An analo- 
gous difficulty can safely be expected in 
social facilitation research, where fear, an- 
ticipatory of renewed audience observation 
would prevent termination of audience ob- 
servation from redu "ing drive to zero. Pro- 
viding that the main and interactive effects 
of such anticipations are not substantially 
e Dam in escape conditioning, social 
z $ data. should Prove to resemble 
escape conditioning data, 
Dec eption, The deception procedures which 
social psychologists have mastered so skill 
fully are necessary for the study of in 
ditioning of voluntary responses in articulate 
organisms (c.g., Spence, 1966), 
AVOIDANCE AND Punisuuenxy 
If subjects will learn t 
observation, they should 
it, if they are given the opportunity, Thi 
opportunity may be given by presenting the 
cue for responding before the onset af the 
aversive audience observation, and allowing 
the instrumental response to : 


e prevent the 
onset of audience observation, Most of the 


rules of correspondence relating the inde 
pendent variables of social fe i 


1 acilitation to the 
s e 

independent variables of escape Bendlionine 
can be adapted to the avoidance paradigm 
aradig 


with little difficulty. 

The punishment Paradigm requires that 
the subject has already learned instru- 
mental response. The situation may then 
be arranged so that the instrumental re 
sponse is followed by the Onset of the audi- 


© escape audience 
also learn to avoid 
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ence observation. Ii audience observation 
is aversive, this procedure should lead to 
ihe suppression of the instrumental response 
rather than to its facilitation. Given 
this basic paradigm. specific rules of corre- 
spondence are suggested rather directly by 
the previous sections. The properties of the 
punishment model are not identical with 
those of the avoidance and escape models. 
Thus, for example. learning research leads 
us to expect that (a) the amount of social 
suppression should be an increasing function 
of the strength of the audience-induced drive 
(e.g, Appel & Peterson, 1965), (b) at least 
when the number of punished trials is low, 
the amount of social suppression should be 
a decreasing function of the delay between 
the instrumental response and the onset of 
audience observation (e.g« Kamin, 1959), 
and (c) a well-learned instrumental response 
should be less resistant to social suppression 
than one which has received relatively few 
rewarded trials (Karsh, 1962; Miller, 1960). 


CLASSICAL CoNDITIONING 


If a previously neutral CS regularly pre- 
cedes the onset of an aversive drive, that CS 
will acquire the capacity to elicit a learned 
drive. This process is usually described in 
terms of classical conditioning. 1f a pre- 
viously neutral CS regularly precedes the 
onset of audience observation, that CS 
should acquire the capacity to elicit a higher 
order learned drive. Thus, a previously neu- 
tral cue comes to serve as a symbol for 
audience observation. Since response-pro- 
duced cues can serve in this capacity, it can 
scarcely be doubted that the acquisition of 
such symbolic audiences plays a role in the 
development of internalized shame. In a 
broader context, we sce that this form of 
social facilitation research affords us a per- 
spective on a classic problem of social psy- 
chology, which George Herbert Mead 
(1934) referred to as “pringing the social 
process into individual experience.” The 
symbolic audience is, albeit imperfectly. re- 
lated to the symbolic “other” of Mead? 


5 To be sure, Mead rejected classical condition- 
ing as a basis of symbol formation (e.g. PP- 13, 
122), but he had in mind Watsonian crudities and 
not the Hullian pure stimulus acts which his dis- 
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Rules oj Correspondence 

By analogy with the classical conditioning 
of learned drives, there are à number of 
independent variables which should affect 
the strength of the symbolic audience drive. 
The number of classical conditioning trials 
corresponds to the number of times the cS 
precedes audience observation. A nonrein- 
forced trial corresponds to a trial on which 
the CS is presented but not followed by audi- 
ence observation ; extinction and partial re- 
inforcement analogies follow obviously. The 
procedure in which the UCS is presented 
unpaired with the CS corresponds to a pro- 
cedure in which audience observation occurs 
without being preceded by the CS. The CS- 
UCS interval corresponds to the time inter- 
yal between the CS and the onset of audience 
observation. UCS intensity corresponds to 
the strength of the drive aroused by audience 
observation, and several specific methods for 
varying drive strength have been described 
previously. The duration of the UCS cor- 
responds to the duration of audience obser- 
vation. Analogs of CS intensity and CS 
change (stimulus generalization) will be 
immediately obvious from the ideas already 
developed. 

The strength of the symbolic audience 
drive may be assessed by its effects in any 
of the previously discussed drive-testing 
paradigms, including escape conditioning. 
avoidance conditioning, punishment, and 
perhaps most expeditiously by the irrele- 
vant drive method so effectively used in 
existing social facilitation research. 


Predictions 

Rules of correspondence have been 
assumed between the yariables of the classi- 
cal conditioning of learned drives and the 
variables of social facilitation. Theoretically, 
the functional relations between the condi- 
tioning variables should also hold between 
the corresponding social facilitation vari- 


ables. Thus, the social facilitation of a re- 


ciple and editor, Morris (1946), found so accept- 
able. Nonetheless, a more purely Meadian approach 
to the problems discussed here would surely invel 8 
role playing of the audience, rather than clas lve 
conditioning to response-produced cues, sid 
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sponse.by a symbolic audience should be a 
function of the various classical conditioning 
operations used in establishing that sym- 
bolic audience; for example, (a) an increas- 
ing function of the number of classical trials 
on which the CS has acquired symbolic 
audience capacity by preceding audience ob- 
servation (e.g., Goldstein, 1960; Kalish, 
1954); (b) an increasing function of the 
strength of the drive aroused by audience 
observation (Goldstein, 1960) ; (c) a multi- 
plicative function of the interaction of the 
number of classical trials on which the CS 
has acquired symbolic audience capacity by 
preceding audience observation with the 
strength of the drive aroused by that ob- 
servation (e.g., Goldstein, 1960): and (d) 
a decreasing function of the number of trials 
on which symbolic audience capacity has 
been extinguished through presentation of 
the CS unfollowed by audience observation 
(e.g., Armus, 1960; Kalish, 1954). More- 
over, (e) symbolic audience capacity should 
generalize to similar stimuli (e.g., Deside- 
rato, 1964; McAllister & McAllister, 1965). 

Even under ideal conditions in the rat 
laboratory, higher order conditioning effects 
are frequently delicate and transitory. Re- 
searchers may thus require some of the pa- 
tience and thoroughness characteristic of 
workers in the conditioning laboratory. To 
be sure, higher order conditioning appears to 
be more easily demonstrated with learned 
drives than with any other response system 
(e.g., Anderson et al., 1969; Kamil, 1969), 
but it is possible that this effect is more of 
a tribute to the power of electric shock as a 
primary UCS than it is to the condition- 
ability of drives per se. 


Classical Conditioning of Attitudes 


Using the evaluative meaning scale of the 
semantic differential as a measure of attitude, 
Staats and Staats (e.g., 1958; Staats, Staats, 
& Heard, 1960) have shown that it is 
possible to classically condition positive and 
negative attitudes toward verbal or other 
stimuli. This procedure is, of course, not 
one of the standard drive-testing paradigms 
since it cannot be applied to nonarticulate 


organisms. It seems certain, however, that 
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this technique will come to be seen as one 
of the most important tools of researchers 
in human learning and human social learn- 
ing, since it is ideally suited for approaching 
a host of problems, including those addressed 
in this paper. Using previously neutral 
words as conditioned stimuli in the symbolic 
audience procedure, and negative evaluative 
meaning responses on the semantic differen- 
tial as a measure of the classical CR, it 
should be possible to directly apply the above 
rules of correspondence and the classical con- 
ditioning model to the conditioning of atti- 
tudes toward audience observation. This 
attitude conditioning technique would per- 
mit us to measure symbolic audience phe- 
nomena in an early stage, since the words 
which have acquired emotional meaning may 
then be used as punishers or 
(Staats, 1968). 


motivators 


CONCLUSION 


Present research on 
social facilitation has 
limited by restricting itself to only one of 
the several drive-testing paradigms used in 
learning research. These studies are very 
valuable, but the full possibilities inherent in 
the drive theory of social facilitation will 
not be realized until other drive-testing 
paradigms are also studied. Extension of 
the drive theory of social facilitation to in- 
clude such problems as escape and avoidance 
of audience observation, as well as the 
acquisition of symbolic audiences, makes it 
Possible to recognize and study more of the 
Sreat richness of phenomena inherent in one 


of the traditional central problems of social 
Psychology, 


the drive theory of 
been unnecessarily 
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A methodological framework for the analysis of punishment is outlined. 
The methodology, which is called a multiple-response base-line procedure, 


serves two purposes. 
the properties of punishment. 
untested assumptions 
evidence 


found in 
obtained with the 


When an aversive stimulus is contingent 
upon the occurrence of a particular response, 
a decrement in the probability of the re- 
sponse is usually observed. This procedure 
is typically called punishment and the decre- 
ment in response probability is called punish- 
ment suppression. The basic purpose of this 
paper is to delineate some fundamental prob- 
lems with existing punishment theory and to 


suggest an alternative approach to the prob- 
lem of punishment. 


An Overview of Punishment Theory 


Historically, there have been two funda- 
mental assumptions used to explain the phe- 
nomenon of punishment suppression. The 

' first of these assumptions was the strong 
version of the negative Law of Effect pro- 
posed by Thorndike (1913). Thorndike 
assumed that any painful or unpleasant event 
would weaken the response (or assumed 
S-R bond) which preceded that event. 
Thorndike (1932) subsequently rejected this 
notion and it has not enjoyed any serious 
attention since that time. The second funda- 
mental assumption suggested to account for 
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the validity of traditional theoretical assumptions and suggests tw 
l rules for predicting the properties of various punishment operati 


First, it raises a number of new questions about 
Second, it permits the examination of some 


traditional punishment theory. Initial 
multiple-response methodology questions 
o simple 


ions. 


the punishment suppression phenomenon has 
been referred to as the alternative-response 
assumption (cf. Dunham, Mariner, & 
Adams, 1969). In its simplest form, the 
assumption states that the decrement in a 
punished response is caused b 
in some alternative behavior. 

All contemporary expl 
ment suppression are specific elaborations 
of this alternative-response assumption. 
Those specific elaborations which have been 
most formalized fall into two major cate- 
gories. These categories are referred to as 
single-process and two-process theories of 
punishment (cf. Solomon, 1964). The ear- 
mark of the single-process theory is the 
assumption that only one type of learning 
mechanism is involved in the development 
and maintenance of the alternative response 
during punishment training. Two types of 
single-process theory have been suggested 
and are differentiated in terms of suggesting 
either a classical or an instrumental con- 
ditioning mechanism. Estes and Skinner 
(1941), for example, suggested that emo- 
tional responses elicited by the punishing 
event are classically conditioned to stimuli 
which precede the punishing event. The 
classically conditioned behavior is assumed 
to compete with the punished response and 
cause the suppression. Miller and Dollard 
(1941) exemplify the instrumental condi- 
tioning version of single-process theory. 
They suggested that any response which is 
associated with the termination of the 


punishing stimulus will be instrumentally 


y an increment 


anations of punish- 


| 
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conditioned as a response which escapes 
pain and competes directly with the punished 
response. 

Two-process punishment theories specify 
two different learning mechanisms which are 
sequentially involved in the development and 
maintenance of the assumed alternative re- 
sponse. — Dinsmoor (1954, 1955) and 
Mowrer (1947) are formal examples of the 
two-process explanation of punishment sup- 
pression. Dinsmoor suggested that the 
proprioceptive stimulus feedback from the 
punished response acquires secondary aver- 
sive properties via classical pairings with the 
primary aversive event. Any response 
which is instrumental in disrupting this chain 
of conditioned aversive stimulation will de- 
velop and be maintained as a response which 
competes with the punished response. 
Hence, two learning mechanisms, first classi- 
cal then instrumental, are assumed to operate 
in the development of the alternative be- 
havior. Mowrer’s (1947) version of two- 
process theory substitutes the notion of con- 
ditioned fear for the notion of aversive 
stimulation conditioned in the Dinsmoor 
theory. 

There are two basic implications of any 
version of the alternative response theory. 


First, there is the implication that some 
alternative behavior will develop and be 
maintained during punishment training. 


Second, there is the implication that this 
alternative response causes the reduction 
in the punished response. Presumably the 
former implication could be confirmed inde- 
pendent of the latter. But the latter could 
not be confirmed if the former were false. 
In spite of the substantial amount of re- 
search on punishment in the last decade, 
there is no direct evidence to support either 
of the above implications of the alterna- 
tive-response assumption. As Azrin and 
Holz (1966) have stated, the typical pro- 
cedure in punishment research has been 
to infer the presence of the alternative re- 
sponse from the absence of the punished 
response. A minimal requirement for test- 
ing the assumption is the measurement of 
the alternative response independent of the 
phenomenon which it seeks to explain. 


Two reasons can be suggested for the 
lack of direct evidence bearing on the alter- 
native-response assumption. In the case of 
single-process theories, contingencies are 
specified which make the response elicited 
by the punishing event a prime candidate for 
conditioning during the suppression of the 
punished response. It is not a profound ob- 
servation to note that punishment procedures 
which have traditionally been employed in- 
volve organisms, aversive stimuli, and ap- 
paratus which make it difficult to measure 
those responses which are suspected to par- 
ticipate in the relevant contingency. These 
“emotional” behaviors have not been re- 
corded on impulse counters and this has pre- 
yented the accumulation of any evidence con- 
cerning changes in their probability during 
punishment training. The major problem 
with single-process versions of the alterna- 
tive-response assumption at this point would 
appear to be the lack of an adequate method- 
ology to test what would appear to be very 
testable implications. 

With respect to the two-process theories, 
the problem is more serious. As Schuster 
and Rachlin (1968) have suggested ] 


Because both the reinforcer and the response are 
unobserved and unobservable, the two factor theory 
of punishment poses a serious problem for the 
experimenter who wishes to test it: how can it be 
disproved? All the critical events are assumed to 
occur within the organism being punished [p. 784]. 


In addition to the specification of the critical 
contingencies “inside? the organism, it 
should be noted that any increase in some 
alternative behavior observed during punish- 
ment can be taken as support for the opera- 
tion of the assumed internal contingency. 
Hence, the measurement of the unobserved 
behavior referred to by Schuster and Rachlin 
does not make the positions any more sus- 
ceptible to disproof. 

The picture which emerges from this brief 
overview of punishment theory and research 
is that there has been a lack of interaction 
between punishment theory and punishment 
data. The lack of any data relevant to the 
most n re theoretical assumptions 
has permittec ar alternative response inter- 
pretations to persist as originally formulated. 
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In the next section of the discussion, a 
methodological approach to punishment will 
be described which can serve two functions 
in the context of the existing punishment 
literature. First, the methodology permits 
us to examine some questions about the 
properties of punishment which have not 
previously been considered. Second, it 
permits us to examine some previously un- 
tested implications of existing punishment 
theory. 


A Methodological Approach to Punishment 


The methodological approach to be sug- 
gested is called a multiple-response base-line 
procedure. It can be applied to a variety of 
problems in addition to punishment, and 
when viewed in the context of traditional 
methodology, it falls between a typical oper- 
ant procedure, in which a single response is 
shaped under constraints, and typical etho- 
logical procedures, in which behavior is ob- 
served without external laboratory con- 
straints. The most convenient way to de- 
scribe the essential features of the methodol- 
ogy is to elaborate on a specific hypothetical 
example which is representative of several 
obvious variations on the basic approach. 
The hypothetical example should be noted 
with some care since it will be approximated 
in reality when experimental evidence is sub- 
sequently discussed. 

Consider a small animal chamber with 
grid floor and three sources of enjoyment 
for the small rodent commonly called a Mon- 
golian gerbil (Meriones unquiculates). The 
three items of interest in the chamber are a 
food bin with an unlimited supply of stan- 
dard Noyes pellets, a drinking tube with 
unlimited supply of water, and adding ma- 
chine paper which is threaded through a 
slot in the wall of the chamber. 

The reader familiar with the behavior of 
this curious rodent will not be surprised to 
find that the gerbil will spend much of a 
half-hour daily session in the chamber shred- 
ding the adding machine paper, eating the 
food pellets, and drinking from the tube, 
in that order of preference. It is relatively 
"asy to record the duration of each of these 
three behaviors during the session and con- 
vert these duration measures to a scale of 


response probability by dividing the amount 
of time spent in a particular response state 
by the total time possible (cf. Premack, 
1965). Any type of behavior can be mea- 
sured in terms of its duration, including the 
class of behavior which is labeled 


“doing 
nothing." 


With the appropriate manipu- 
landa for a particular organism and control 
of the relevant parameters, a multiple-re- 
sponse steady-state base line of behavior, 
in which several measurable responses are 
observed to fill experimental time, can be 
established. No shaping or active manipula- 
tion of contingencies is assumed to be nec- 
essary. Assume, for the purposes of discus- 
Sion, that our gerbil cooperated and filled 
the half hour with drinking (p = .1), eating 
(p = .3), and paper shredding (p = .6). 
Once the multiple-response base line has 
stabilized, we are in the interesting position 
of actively manipulating the organism's en- 
vironment and assessing the effects of such 
manipulations on all of the responses in the 
repertoire. Obviously, the manipulations of 
most interest in the present context are those 
which we call punishment operations; how- 
ever, it is instructive to digress briefly and 
consider a simple manipulation like making 
a running wheel available. The most visible 
effect of making the running wheel available 
will be introduction of a running response 
into a repertoire of behavior which already 
fills experimental time. Tf running occurs, 
by definition, there will be a decrement in 
the observed probability of one or more of 
the existing responses in the repertoire. 
Curiously, there is little more than intui- 
tion to tell us how the organism will re- 
organize his response hierachy to accommo- 
date the running response. Will he sacrifice 
a little bit of each of the existing behaviors? 
Will he select one response and sacrifice it 
for running privileges? If the latter, what 
is the rule of response selection? 
_ lf an aversive stimulus like electric shock 
is introduced, one is faced with roughly the 
same problem as that posed by the introduc- 
tion of the running wheel. Shock, as an un- 
conditioned stimulus, will define a certain 
probability of unconditioned behavior which 
must be assimilated into a response hier- 
archy. By definition, some decrement in the 
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probability of one or more of the existing 
responses must take place. Again, we are 
not sure how the organism alters the existing 
preference structure to accomodate this addi- 
tional behavior, but the rules which describe 
such alterations would be of importance in 
predicting the immediate effects of any va- 
riety of punishment operation on responses 
in the repertoire. 

The point to be made with the two pre- 
ceding examples is that some manipulations 
which can be used in the context of a multi- 
ple-response base line have the property of 
adding a response to the existing repertoire 
of behavior. Punishment procedures are one 
such manipulation, and it is suggested that 
procedures be arranged which permit the 
a priori specification of the response which 
will be added and subsequent measurement 
of that response during punishment training. 
In this respect, the selection of the gerbil 
was fortuitous. The response which we have 
observed to be associated with the introduc- 
tion of shock is a vigorous biting and chew- 
ing of the grid floor. As reliable, if not 
as desirable, is an aggressive attack on 
another gerbil if the target animal is pro- 
vided. 

When one considers the variety of ways 
in which an aversive event like electric shock 
can be introduced into a multiple-response 
procedure, a number of empirical questions 
are generated which have not received pre- 
vious experimental attention. A brief con- 
sideration of a few of these questions will 
illustrate the heuristic value of the multiple- 
response base-line procedure. 

The traditional literature dealing with 
shock provides us with two major classes 
of operation which are called punishment 
and avoidance, with variations on each 
theme. In the context of the multiple-re- 
sponse base-line procedure, an alternative 
operational dichotomy is suggested. Spe- 
cifically, the shock event can be delivered 
according to a. program which makes refer- 
ence to one or more responses in the reper- 
toire, or according to a program without 
reference to the organism’s response reper- 
toire. The former includes traditional re- 
sponse-contingent punishment and Sidman 
avoidance operations; the latter includes 


various temporal schedules which are 
arranged independent of the organism's be- 
havior. In future discussion, the terms 
response contingent and noncontingent will 
be used to describe these generic operational 
categories. Some measure of the compre- 
hensiveness of any theoretical framework is 
provided by its ability to subsume data in 
both operational categories. Several of these 
operations, most often employed in single- 
response operant research, reveal interest- 
ing new dimensions when considered in the 
multiple-response context. 

Consider the response-contingent punish- 
ment operation where one response is se- 
lected from the repertoire and the onset of 
that response is followed immediately by 
an instance of shock. The suppressive 
effects of this operation on the referent be- 
havior are well known. However, the opera- 
tion defines a version of Sidman avoidance 
contingency for all other responses in the 
organism's repertoire, including the response 
elicited by the shock event. In the gerbil ex- 
ample, response-contingent shock for eating 
also defines a very effective Sidman avoid- 
ance contingency for paper shredding. The 
more time spent paper shredding, the longer 
the interval between shocks and the fewer 
shocks received. Does the gerbil adjust his 
behavior in a manner suggested by the 
contingency ? 

Consider a fixed-interval noncontingent 
shock procedure. For a given density of 
shock, the probability of each response 1n 
the organism's repertoire will determine the 
number of shocks which are associated with 
that particular response. In the gerbil ex- 
ample, paper shredding occupies Over half 
the experimental session, hence more shocks 
will arrive during paper shredding than dur- 
ing any other response in the repertoire. 
Of more interest. with shocks delivered at 
fixed intervals of time, that response which 
most consistently follows the shock event 
will most consistently predict the longest 
interval of “safety.” Again, one must ask if 
the organism changes his behavior accord- 
ing to these implicit contingencies defined by 
the fixed-interval noncontingent schedule 
A phenomenon reported initially iss Morse 
Mead, and Kelleher (1967) is perhaps rele- 
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vant. These investigators reported that 
monkeys which are exposed to fixed-inter- 
val noncontingent shock will aggressively 
bite a rubber tube made available after each 
shock is delivered. The postshock elicited 
biting behavior would be the response in the 
organism's repertoire which most consist- 
ently predicts the longest interval of safety. 
As suggested by the preceding analysis, the 
animal changes its biting behavior accord- 
ingly. The amount of biting behavior in- 
creases during extended punishment train- 
ing and eventually fills the shock-shock 
interval with a "scallop" in the rate of biting 
which appears at the end of the interval. 
Consider also a Sidman avoidance pro- 
cedure with a constant shock-shock interval 
and constant response-shock interval. Once 
we select a single response from the multi- 
ple-response base line as an avoidance re- 
sponse, all other responses in the repertoire 
are initially punished according to a non- 
contingent shock schedule, with the shortest 
possible intershock interval being the Sid- 
man shock-shock interval. Which of the 
responses in the organism's repertoire should 
one select for the avoidance response for 
efficient learning? It would seem obvious 
that the response elicited by the shock would 
enjoy some advantages not enjoyed by the 
other responses in the repertoire. First, that 
response will predict the longest safe interval 
between shocks even if the avoidance con- 
tingency were not in effect. Second, it has 
a relatively high probability during initial 
training sessions, hence it will sample an 
implicit avoidance contingency quite often. 
"Third, if it is explicitly given the avoidance 
property of delaying shock, there should be 
few responses which will compete with it for 
rapid learning. This suggestion is in line 
with Bolles’ (1970) view that the organism's 
innate species-specific defense reactions 
(SSDRs) to an aversive stimulus are most 
readily learned as avoidance responses. I 
would suggest, however, that the rapid learn- 
ing has nothing to do with their “innate 
defensive” properties. It is the property of 
immediately following the shock event which 
optimizes the conditions for avoidance learn- 
ing—that is, not only does the response 
eventually reduce shock frequency. but it 


initially predicts safety for a longer period 
of time than any other response in the 
organism's repertoire. 

The converse situation can also be sug- 
gested. If we give the elicited response 
the initial advantage of consistently pre- 
dicting the longest safe interval of time, yet 
attempt to train a different response as an 
avoidance behavior, it would not be sur- 
prising to observe the animal attempting ini- 
tially to develop the elicited behavior to the 
detriment of the avoidance response. This 
would be particularly true in the case of 
constant shock-shock intervals (e.g. the 
Morse, Mead, & Kelleher, 1967, phenome- 
non), as opposed to variable shock-shock 
intervals where the Predictive properties 
of the elicited response are less evident. 
This also leads to the Suggestion that a 
"variable shock-shock interval will produce 
faster learning of avoidance behavior when 
the avoidance Tesponse is other than the 
shock-elicited response (cf. Bolles & Popp, 
1964). 

Finally, it is of Some interest to consider 
the variations in temporal schedule which 
are possible with noncontingent shock, We 
have already discussed the fixed-interval 
case; now consider some implicit contingen- 
cies established by variable-interval cases. 
When programming shocks to occur at vary- 
ing intervals in time, the distribution of 
intervals may be an important consideration, 
The typical variable-interval tape program 
is a rectangular distribution of intervals with 
Some guaranteed minimum interval. Tt is 
typically described in terms of its arithmetic 
mean and delivers the shocks with a random 
Sequence of intershock intervals. Of more 
interest is an exponential distribution of 
intervals which will deliver the same num- 
ber of shocks at varying intervals, but is a 
continuous distribution. The probability of 
shock at any “moment” in time is a fixed 
value and the animal has nO programmed 
interval which is guaranteed to be safe. In 
typical punishment studies where a single 
operant response is shaped, trained, and 
measured, the effects of shocks delivered 
according to fixed intervals, rectangular dis- 
tributions of variable intervals, and exponen- 
tial distributions ef vata. 
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be the same—the operant is suppressed. 
However, in the multiple-response proced- 
ure, very different effects may be found with 
the three distributions. As mentioned ear- 
lier, the elicited response enjoys the advant- 
age of predicting a safe period most con- 
sistently in the fixed-interval shock schedule. 
This would be true to a lesser degree of 
variable-interval schedules with rectangular 
distributions. The elicited response would 
most consistently predict the minimum inter- 
val of safety programmed in that distri- 
bution. In the exponential distribution, 
however, no response consistently predicts 
a minimum safe interval. According to the 
program, shock is equally probable at any 
point in time. This removes those implicit 
contingencies for certain responses which are 
most evident in fixed-interval schedules. 
Again the question which must be asked is 
whether or not the organism adjusts his be- 
havior according to the presence and absence 
of these implicit contingencies. 

The preceding examples are intended to 
illustrate two points: first, the heuristic 
value of the multiple-response analysis in 
terms of generating testable questions; sec- 
ond, the sparsity of evidence relevant to 
these questions. This sparsity is understood 
when one recognizes the emphasis which has 
traditionally been placed on single-response 
measurement in both free operant and dis- 
crete-trial punishment research. 

The discussion of the multiple-response 
methodology can be concluded with a brief 
consideration of the predictions which single- 
process and two-process punishment theories 
would make in the context of the hypo- 
thetical gerbil procedure. With respect to 
single-process theory, the use of any type 
of punishment operation will introduce the 
grid-biting response into the organism's re- 
sponse repertoire. This grid-biting response 
should be a prime candidate for both classical 
conditioning (Estes & Skinner, 1941) and 
instrumental escape conditioning (Miller & 
Dollard, 1941) since it follows the shock 
event more often than other responses. The 
predictions of the two-process versions of the 
alternative-response assumption must be con- 
sidered separately in terms of predictions 
about the effects of contingent and noncon- 


tingent shock procedures. Basically, the 
iwo-process theories suggest that any re- 
sponse which disrupts the chain of dis- 
criminative stimulus conditions which pre- 
cede the shock event will increase in prob- 
ability and be maintained during punishment 
training. In the case of response-contingent 
punishment, these theories make the general 
prediction that any response other than the 
punished response might increase in prob- 
ability. They do not provide us with a re- 
sponse-selection rule to tell us if one or all 
of the unpunished responses increases. In 
the case of noncontingent shock operations, 
the two-process theories fail to make any 
meaningful prediction. When a nonconting- 
ent shock operation is employed, every re- 
sponse in the organism's response repertoire 
will be part of a discriminative chain of stim- 
uli which terminates with shock on some 
occasions. Hence, it is impossible to suggest 
that any response in the repertoire partici- 
pates in an instrumental contingency which 
disrupts the chain of conditioned aversive 
stimulation. 
Some Rules for Prediction 

The predictions which the single-process 
and two-process explanations of punishment 
suppression make in the context of the multi- 
ple-response methodology have been dis- 
cussed in the preceding section. Prior tc 
considering some evidence, I would like tc 
suggest some alternative rules for predicting 
the effects of a variety of punishment opera 
tions on the various behaviors measured ir 
the multiple-response procedure. The rule: 
are, at this point, tentative, and a systematic 
analysis of punishment in the context of « 
multiple-response base line may modify o 
contraindicate these initial suggestions. 

Once the immediate change Im perform 
ance caused by the introduction of an uncon 
ditioned behavior into the repertoire ha 
taken place (shock is introduced), the or 
ganism has à hierarchy of responses de 
scribed in terms of response probability. Th 
two rules which attempt to predict the fai 
of each of the responses in the organism 
repertoire during subsequent aversive train 
ing are the following: (a) That particul: 
response in the organism's repertoire whi 
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is most frequently associated with shock 
onset and/or predicts the onset of shock 
within a shorter time than other responses 
will decrease in probability and remain be- 
low its operant base line. (b) That par- 
ticular response in the organism's repertoire 
which is most frequently associated with the 
absence of shock onset and/or predicts the 
absence of shock onset for a longer period 
of time than other responses will increase 
in probability and remain above its operant 
base line. 

Consider the spirit in which these two 
rules are formulated. I am suggesting that 
there are two basic contingencies of import- 
ance in any operation for delivering the 
shock. First, there is an instrumental 
punishment contingency. It has two im- 
portant dimensions. A response can be more 
frequently associated with shock onset than 
other responses, or it can predict a given 
frequency of shock onset within a shorter 
period of time, or both. Thus, the two di- 
mensions of the contingency are frequency 
and time. Second, there is an instrumental 
avoidance contingency. It also has two 
important dimensions. A response can be 
more frequently associated with the absence 
of shock onset, or can predict the absence of 
shock onset for a longer period of time, or 
both. 

In traditional single-response operant me- 
thodology, the time not spent on the operant 
is usually assigned the label of “nonresponse” 
and is assumed to be a homogeneous mass of 
behaviors, (cf. Rachlin & Herrnstein, 1969), 
In spite of their tentative nature, the two 
rules described above should make it obvious 
that it may be of more value to recognize 
the “heterogeniety” of the nonreponse class 
—at least in terms of response probability 
differences. Further empirical analysis may 
reveal such factors as the sequential depend- 
encies between different responses to be an 
important determinant of the organism's ad- 
justment to aversive contingencies. This 
criticism of the single-response operant ap- 
proach is equally applicable to the procedures 
which employ appetitive contingencies. 


Some Evidence 
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sists of a series of successive approxima- 
tions to the multiple-response methodology 
which has been discussed throughout this 
paper. Many of the thoughts developed in 
the preceding discussion of punishment orig- 
inated with a serendipitous observation made 
while studying the effects of punishment on 
key pecking in pigeons (Dunham et al., 
1969). Dunham et al. observed that pigeons 
trained to peck a response key for grain on 
a variable-interval schedule missed the key 
and hit the wall area adjacent to the key with 
a certain number of pecks. The introduction 
of key-peck response-contingent punishment 
suppressed key pecking and iner ased the 
frequency of off-key pecks during punish- 
ment. These results suggested that an un- 
punished response, other than 
elicited response, will incre 
and be maintained during 
ing. 

Subsequently, a more deliberate approxi- 
mation to the multiple-response methodology 
was attempted, taking advantage of the phe- 
nomenon of schedule-induced polydipsia (cf. 
Falk, 1966) in order to establish two steady- 
state responses which occupied a large por- 
tion of experimental time. Falk (1966) 
reported that rats trained to lever press for 
food pellets on a vz iable-interval schedule 
will indulge in an excessive amount of drink- 
ing during an experimental session if a 
drinking tube is made freely available. Using 
Falk’s standard procedure, two rats were 
trained one hour each day to the point of 
polydipsic drinking, and a mild .2-milliamp- 
ere, .5-second shock was introduced which 
was contingent on the lever press response 
(Subject 1) or on the drinking response 
(Subject 2). With the relatively mild shock 
intensity to minimize the probability of the 
unconditioned (and unmeasured) responses 
elicited by shock, it was assumed that drink- 
ing would be the most probable unpunished 
alternative if lever pressing were 


the shock- 
ase in probability 
punishment train- 


punished, 
and that lever pressing would be the most 
probable unpunished alternative if drinking 


were punished. According to the rules out- 
lined in earlier discussion, the punished re- 


sponse was predicted to be suppressed, and 
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the most probable unpunished alternative in 
the response repertoire was predicted to in- 
crease in probability following some degree 
of initial disruption by the addition of shock- 
elicited behavior to the response repertoire. 
The results of the initial polydipsia train- 
ing phase, the first punishment training 
phase, a recovery phase, and a second 
punishment training phase are illustrated in 
Figure 1. During initial training, the 
animals developed polydipsic levels of water 
intake similar to those usually reported by 
Falk (1966). In the initial punishment 
phase, Subject 1 was shocked for lever press- 
ing and Subject 2 was shocked for drinking. 
In both cases, the punished response was 
observed to be suppressed immediately and 
the unpunished alternative behavior was ob- 
served, following initial disruption, to in- 
crease to levels which exceeded the estab- 
lished prepunishment base-line level. The 
results obtained with Subject 1 were very 
surprising. In spite of the aberrant (poly- 
dipsic) base line of water intake prior to 
punishment, the response was observed to 
exceed that base line during punishment ses- 
sions. In the third phase of the procedure, 
the punishment contingencies were removed 
and recovery was observed for 14 sessions. 
With the removal of shock, all responses in- 
creased to levels even higher than preceding 
base lines. This overshoot following re- 
moval of punishment is typical of the recov- 
ery of punished responses (cf. Azrin & Holz. 
1966). It is interesting to note that over- 
shoot occurs with both the punished and un- 
punished alternatives, which were measured 
in the present procedure. Following the 14 
days of recovery, punishment contingencies 
were again introduced—this time for the re- 
sponse which had not been punished during 
the first. punishment phase. Basically the 
same phenomena observed during the first 
punishment phase were observed during the 
second. The punished responses suppressed 
and the unpunished alternatives were dis- 
rupted and then increased in probability. 
Unfortunately the increase in drinking ob- 
served in Subject 2 did not consistently 
exceed the prepunishment base line. This 
may be the result of the ceiling problem on 


The final line of evidence from the au- 
thor’s laboratory work to date was an 
attempt to approximate the hypothetical ger- 
bil procedure used as an example throughout 
this article. There were several reasons to 
extend experimentation beyond the case of 
two highly probable responses to three or 
more measured responses. First, the tenta- 
tive rules specify that only one of the several 
unpublished alternative responses will in- 
crease in probability during punishment 
training. In order to test this prediction, 
more than one alternative must be measured. 
Second, a test of single-process theories is 
possible only if the unconditioned response 
introduced by the shock event is measured. 
Third, an alternative approach to the data 
in the polydipsia experiment would be the 
suggestion that the shock has some general 
arousal property which increases all un- 
published responding. The demonstration 
of a single-response increase would make 
such an arousal mechanism less appealing 
than the type of contingency analysis sug- 
gested by the tentative rules. 

To answer these questions, Kennedy 
Muyesu-Kaisha Munavi conducted a bur- 
densome experiment in the author's labora- 
tory which was a very close approximation 
to the hypothetical gerbil procedure de- 
scribed earlier. Nine gerbils were randomly 
assigned to one of three groups: Each ger- 
bil was placed in a small response chamber 
with food bin, drinking tube, and adding 
machine paper for a daily half-hour session. 
The total duration of each of the three be- 
haviors was recorded on elapsed-time meters 
bv an observer looking at the animal: 
through a one-way glass window. When a 

chamber, it had been per- 


gerbil entered the ab nae 
mitted access to only 80% of its norma 
ad lib intake of laboratory chow during the 


234 hours, and the only wate 
; that available during the ex 
perimental session. Under these conditions 
the gerbils managed to fill approximatel 
25 of the 30 minutes available each sessio 
with paper-shredding, eating, and drinkin; 
in that order of preference. Tt should 1 
noted that records were kept of the amou: 
of food and water consumed during bas 
line observations. Subsequently, T A: 


preceding 
permitted was tha 
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was the punished behavior, the animal was 
permitted to make up any deficit in food 
intake during the experimental session in 
its daily ration one hour after the experi- 
mental session. Hence, there was no con- 
founding of changes in either the total daily 
food intake or water intake with the intro- 
duction of the shock (cf. Dunham & Kilps, 
1969). 

After a base line of three behaviors was 
established, Group E was punished for eat- 
ing, Group P for paper shredding, and 
Group D for drinking. The shock was a 
.2-milliampere, .5-second shock delivered at 
the onset of the referent behavior and con- 
tinued at 2-second intervals until the referent 
response was stopped. At this point in the 
experimentation with gerbils, it was not 
known what elicited behavior or behaviors 
would be introduced with this shock in- 
tensity. After running four animals through 
the first session of punishment it was obvious 
that all animals were attacking the grid bars 
through which the shock was delivered. At 
this point a record was started of the dura- 
tion of grid biting as a response which con- 
sistently occurred with the shock in all nine 
animals. Hence, we have the probability of 
grid biting during the first punishment 
Session for only five of the nine subjects in 
this initial experiment, After the first 


ponses plotted in two-session blocks 
(S-2) in the polydipsia procedure. 


session, grid biting was measured along with 
the three other responses during each session. 

Prior to examining the results of the ex- 
periment, it is of some value to consider the 
predictions made by the rules when applied 
to this situation: (a) the rules suggest that 
the response most frequently and immedi- 
ately associated with Shock will decrease in 
probability and remain below its operant 
base line; it is the referent punished response 
in the response-contingent procedure; (b) 
there will be an immediate disruption of all 
responses in the repertoire immediately 
upon the introduction of the grid-biting re- 
Sponse into the three-response repertoire 
which already nearly fills time; (c) after 
the initial disruption, the response in the 
repertoire which is most frequently asso- 
ciated with the absence of shock for the 
longest period of time will increase in prob- 
ability to levels which exceed the prepunish- 
ment base line. This response should be that 
Tesponse which is most probable on the 
assumption that the most probable behavior 
will sample the implicit avoidance conting- 
ency most often. 

Subsidiary predictions are also implicit in 
the rules. For example, if grid biting is not 
the most probable unpunished alternative be- 
havior, it will drop out of the repertoire as 
the shock frequency declines (punished re- 


So —— 


—— ye 


PUNISHMENT: METHOD AND THEORY 67 


— paper-shredding 
= grid-biting 


N © 


200 


No 


RESPONSE PROBABILITY 
oun ouowa 


L 


135 


7911131 3 5 


eating 
drinki 


7911131 3 5 


TWO SESSION BLOCKS 


Fic. 2. 
groups in the gerbil procedure. 


Probability of punished and unpunished responses for the three subjects in each of the three 
(Note that all data points are two-session blocks with the exception 


of the first punishment session which is plotted as a single-session point.) 


sponse suppressed). The latter suggestion 
is contrary to the predictions of the tradi- 
tional single-process punishment theories. 
If, however, the grid biting response is the 
most probable of the unpunished alternatives, 
it will be maintained even in the absence of 
the shock UCS during punishment training. 

The results of the gerbil experiment for 
each subject in each of the three groups are 
presented in Figure 2. 

Consider first the two results which are 
common to all three groups and which are 
of primary interest in terms of the conting- 
ency rules. First, in all cases, the instru- 
mental punishment contingency was suf- 
cient to suppress the referent behavior. 
Second, and of more interest, in all cases, 
only one of the alternative responses in the 


repertoire was observed to increase in prob- 
ability to levels which exceeded the base- 
line probability. These two results are taken 
as support for the operation of the con- 
tingencies outlined in preceding discussion. 
Consider next the results specific to each 
group. Of particular interest in this con- 
text is the response which is selected to in- 
crease in probability during the punishment 
training. The subjects in Group E were 
punished for eating, and all three of the 
subjects were observed to increase the prob- 
ability of paper shredding following an initial 
period of disruption. The subjects in Group 
P were punished for paper shredding, and 
in all three subjects the probability of grid 
biting increased during punishment training. 
Group D was punished for drinking and tie 
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vealed inconsistent results. Subject 1 and 
Subject 2 increased the probability of paper 
shredding, while Subject 3 increased the 
probability of grid biting during punishment. 

To what extent are the responses observed 
to increase in each group predicted by the 
contingency rules? The rules suggest that 
the response which most frequently samples 
the avoidance contingency in early sessions 
will be the most probable of the safe alterna- 
tives. In spite of the fact that the probability 
of the grid-biting response was not measured 
during the first session in four of the animals 
studied, it is safe to assume that two re- 
sponses were the most probable unpunished 
behaviors during early punishment sessions. 
These are grid-biting and paper-shredding 
responses. Both would sample the avoid- 
ance contingency more frequently, and the 
grid-biting response which followed the shock 
event would predict the absence of shock for 
a longer period of time more consistently 
than the other responses which were not 
sequentially dependent on the shock event. 
For these reasons, we would expect all sub- 
jects to reveal an increase in either grid 
biting or paper shredding. However, there 
were some instances in which grid biting was 
initially more probable than paper shredding, 
yet paper shredding was observed to increase 
to levels above base line as grid biting drop- 
ped out (e.g., Group E, Subject 1). Fur- 
ther research, in which more precise mea- 
surements are obtained of the degree to 
which various responses sample both the 
frequency and time dimensions of the two 
contingencies, will provide a more critical 
test of the suggested rules and perhaps sug- 
gest refinements. i 

In addition to suggesting that the me- 
thodology and the contingency analysis are 
tractable approaches to the punishment prob- 
lem, the gerbil data question the validity of 
the single-process versions of the alterna- 
tive-response assumption. Stated quite 
simply, those cases in the present situation 
which revealed a gradual decline in the prob- 
ability of grid biting over the course of 
punishment training are exactly the op- 
posite to changes expected from traditional 
single-process assumptions. This is con- 
sistent with observations of shock-associated 


behaviors in other situations (cf. Dunham, 
ct al., 1969). The emotional behaviors gen- 
erally drop out within a very few sessions 
of response-contingent punishment training. 
Of course, this decrement in grid-biting be- 
havior would not be expected in the case 
of noncontingent punishment in the sense 
that the UCS is maintained at a particular 
frequency independent of the changes in be- 
havior. 

The data also have implications for the 
two-process interpretations of punishment 
suppression. Basically, the very general 
prediction that some alternative behavior will 
increase is supported. It is questionable, 
however, to think that the increase observed 
in the alternative behaviors observed in the 
present experiments could account for the 
suppression in the punished response, The 
time course of the two transition processes 
are very different. The punished behavior 
is suppressed very rapidly relative to the 
initial disruption and slow rise of the alterna- 
tive behavior. 

The preceding evidence involves the re- 
sponse-contingent — punishment operation, 
TThe author has not yet started an empirical 
analysis of the honcontingent punishment 
procedures. There are, however, several 
studies in the literature Which are relevant 
to the predictions made by the rules dis- 
cussed in this article as applied to noncon- 
tingent procedures, As indicated in earlier 
discussion, the organism cannot, by defini- 
tion, reduce the shock frequency when a 
noncontingent operation is employed. This 
leaves the temporal dimension of the con- 
tingency in an important role. The response 
which consistently follows the shock event 
will predict the absence of the next shock 
onset for a longer length of time than any 
other response in the repertoire. The phe- 
nomenon reported by Morse et al. (1967) in 
which an elicited behavior develops and 
modulates under fixed-interval nonconting- 
ent shock suggests that the Organism is 
sensitive to the temporal dimension of the 
avoidance contingency. More recent evi- 
dence reported by Powell and Creer (1969) 
can be subjected to a similar interpretation. 
These investigators delivered noncontingent 
shock at the rate of 100 Shocks per session 
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10 pairs of rats and recorded the frequency 
of aggressive attacks per session. The au- 
thors did not specify the intershock interval 
or the session length in their procedure. 
Ten successive days of noncontingent shock 
sessions were conducted and measures of 
aggressive behavior indicated that an in- 
crease in the amount of aggression occurred 
over the course of the experiment. Although 
the experiment was designed to determine 
the effects of maturation (among other 
things) on aggressive behavior, it is sug- 
gested that the 10 days of maturation were 
confounded with 10 days of avoidance train- 
ing during the experiment. Specifically, 
the aggressive behavior of these rats pre- 
dicted the absence of shock onset for a 
longer period of time than any of the other 
(unmeasured) behaviors in the organism's 
repertoire. Similar to the monkeys in the 
Morse et al. procedure, the rats in the 
Powell and Creer experiment are assumed 
to have recognized this avoidance contin- 
gency and adjusted the behavior appro- 
priately. 

Assuming for the moment that the avoid- 
ance interpretation of the Powell and Creer 
experiment is correct, the present rules have 
implications for the development of aggres- 
sion; namely, that certain stimuli will elicit 
aggressive behaviors, and under a variety 
of procedures used to deliver the stimuli, 
the aggressive behavior will develop as an 
avoidance response and be maintained in 
the absence of the primary UCS. Under 
other conditions, the avoidance contingency 
is not present and one should observe ag- 
gression at the unconditioned level. I know 
of no theoretical account of aggression which 
implicates an avoidance contingency in the 
development and maintenance of aggression 
in animals. Additional work with temporal 
schedules of aversive stimulation which elim- 
inate this avoidance contingency should re- 
veal the extent to which it is involved in the 
conditioning of the aggressive behavior of 
various species, 


Some Concluding Remarks 


In the absence of more data, it would be 
premature to attempt to elaborate on the 
type of theoretical mechanism which is im- 


plied by the empirical rules which have been 
outlined in the preceding discussion. I 
would like, however, to conclude by suggest- 
ing briefly the general directions which one 
might take in the development of such a 
mechanism. First, the data and arguments 
which have been discussed suggest very 
strongly that there is little to recommend 
those traditional theoretical accounts of 
punishment which start from an alternative 
response assumption. In place of this 
assumption, the rules which have been out- 
lined imply a symmetrical conditioning me- 
chanism in which (a) those responses which 
predict the aversive event are actively inhi- 
bited and (5) those responses which predict 
the absence of the aversive event are actively 
excited. 

History provides at least two examples of 
symmetrical excitatory and inhibitory me- 
chanisms which deal with appetitive and 
aversive events. If one is biased toward 
Pavlovian conditioning mechanisms, it is 
possible to conceptualize the multiple-re- 
sponse methodology as a mudtiple-CS me- 
thodology in which different responses are 
viewed as different CSs which predict, to 
varying degrees, the presence or the absence 
of the unconditioned stimulus (aversive or 
appetitive). Once conceptualized in terms 
of Pavlovian operations, some version of 
Pavlov's (1927) concepts of inhibition and 
excitation can be developed to explain the 
effects of punishment on performance. 
Konorski's (1967) inhibitory and excitatory 
processes and some recent modifications of 
these ideas (e.g., Maier, Seligman, & Solo- 
mon, 1969) should, for example, be con- 
sidered in the context of the multiple-re- 
sponse punishment procedure.* 

3 A basic assumption in Konorski's theorizing is 
the notion that noxious stimuli such as shock elicit 
a drive state, fear, which has general inhibitory 
effects on “preservative” drive states such as hun- 
ger, thirst, etc. Punishment would be predicted to 
have suppressive effect on all responses motivated 
by “preservative” drive states. In the experiments 
conducted thus far, the inhibitory effects which 
we have observed are specific to the punished 
response. For example, when the gerbils were 
punished for eating they continued to drink the 
usual amount each session. Hence, if Konorski's 
inhibition mechanism were to be Considered in the 
context of punishment, the inhibitory properties 
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Alternatively, if one is biased toward in- 
strumental conditioning mechanisms, it is 
possible to conceptualize the multiple-rc- 
sponse methodology in terms of instrumental 
contingencies in which each response has 
either rewarding or punishing consequences. 
Once conceptualized in terms of instrumental 
operations, some version of Thorndike's 
(1913) positive and negative Law of Effect 
can be elaborated upon as a symmetrical re- 
inforcement mechanism. Rachlin and 
Herrnstein (1969) have recently made such 
a suggestion. 

In either case, it is hoped that the multi- 
ple-response base-line procedure will help to 
restore the interaction between punishment 
theory and punishment data which has been 
lacking in the punishment literature. 


of shock would have to be more "drive" or "re- 
sponse" specific than he has assumed. Estes (1969, 
p. 69), in his more recent theorizing, makes an 
assumption very similar to Konorski's. Specifically, 
he suggests that the activation of negative drive 
systems (e.g., shock-produced attack or flight) re- 
sults in the inhibition of positive drive systems, 
which, in turn, accounts for the decrement in 
punished responding. Again, it should be noted 
that the gerbil data suggest that the only positive 
drive system which appears to be permanently 
inhibited during punishment is the specifically 
punished drive system—or its associated response. 


REFERENCES 

AzmN, N. H, & Horz, W. C. Punishment. I 
W. K. Honig (Ed.), Operant behavior: 
of research and applications, 
pleton-Century-Crofts, 1966, 

Borres, R. C. Species-specific defense reactions 
in avoidance learning. Psychological Review, 
1970, 77, 32-48. 

Botes, R. C, & Porr, R. J., Jr. Parameters 
affecting acquisition of Sidman avoidance, Jour- 
nal of the Experimental Analysis of Behavior, 
1964, 7, 315-321. " T€ si 

Dinsmoor, J. A. Punishment: [. Ne avoidance 
hypothesis. Psychological Review, 1954, 61, 34- 


n 
Arcas 
New York: Ap- 


Dinsmoor, J. A. Punishment: II. An interpre- 
tation of empirical findings. Psychological Re- 
view, 1955, 62, 96-105. " 

Dunnam, P. J, & Kips, B. Shifts in magnitude 
of reinforcement: Confounded factors or contrast 
effects. Journal of Experimental Psychology, 
1969, 79, 373-374. 

Dunnam, P. J., Mariner, A, & Apaws, H. En- 
hancement of off-key pecking by on-key punish- 


DUNHAM 


ment. Journal of the Experimental zInalysis of 
Behavior, 1969, 1, 156-166. 

Estes, W. K. Outline of a theory of punish- 
ment In B. A. Campbell & R. M. Church 
(Eds.), Punishment and aversive behavior. 
New York: Appleton-Century-Crofts, 1969, 

Estes, W. K., & Skinner, B. F, Some quantita- 
tive properties of anxiety. Journal of Experi- 
mental Psychology, 1941, 29, 390-400, 

FaLK, J. L. The motivational Properties of sched- 
ule induced polydipsia. Journal of the Experi- 
mental Analysis of Behavior, 1966, 9, 19-25, 

Konorskt, J. Integrative activity of the brain. 
An interdisciplinary approach. Chicago: Uni- 
versity of Chicago Press, 1967. 

MAIER, S. F., SELIGMAN, M. E. P., & Sorowox, 
R. L. Pavlovian fear conditioning and learned 
helplessness. In B, A, Campbell & R. M. Church 
(Eds.), Punishment and aversive behavior, New 
York: Appleton-Century-Croíts, 1969. 

Miter, N. E., & DoLLARD, 
imitation, 
1941. 


Morse, W. H., Mean, R. N,& KELLEHER, R. T. 
Modulation of elicited behavior by a fixed-in- 


terval schedule of electric shock presentation. 
Science, 1967, 157, 215-217. 


Mowntn, O. H. On the dual 
a reinterpretation of "conditi 


, » J. Social learning and 
New Haven: Yale University Press, 


nature of learning 


r oning" and "prob- 
lem-solving" Harvard Educational Review, 
1947, 17, 102-148. 


Pavrov, I. V. Conditioned reflexes. (Trans. by 
G. V. Anrep) New York: Dover, 1927. ; 
Powe, D. A, & Creer, T. L. 
velopmental and environmental variables in shock- 
elicited aggression. Journal of Comparative and 
Physiological Psychology, 1969, 69, 219-226. 
Premack, D. Reinforcement theory. Nebraska 
Symposium on Motivation, 1965, 13, 123-188. 
RACHLIN, H., & HERRNSTEIN, R. J. Hedonism re- 
visited: On the negative law of effect. In B. A. 
cinibel & x M Church (Eds.), Punishment 
ane aversive behavior, Ne kt - 
Century-Crofts, 1969. TORUM FN 
Scuuster, R, & Racutin, H. 


tween punishment and free shock: Evidence for 
the negative law of effect. 


- Journal of the Ex- 

Poit Analysis of Behavior, 1968, 11, 777- 
Je 

Sotomon, R. L. Punishment. 
logist, 1964, 19, 239-253. 

THORNDIKE, E, L, Educational Psychology. 
2. The bsychology of learning, + 
Columbia University, "Teacher's Col 
of Publications, 1913. 


THORNDIKE, E. L. The fundamentals "ni; 
c , : of learning. 
New York: Columbia University, "Teacher's 
College, Bureau of Publications, 1932. 


( Received April 21, 1970) 


Interaction of de- 


Indifference be- 


American Psycho- 


Vol. 
New York: 
lege, Bureau 


Psychological Review 
1971, Vol. 78, No. 1, 71-80 


ON THE EXPONENTS IN STEVENS’ LAW AND 


THE CONSTANT IN EKMAN’S LAW* 


ROBERT TEGHTSOONIAN ? 
Smith College 


It follows from Stevens’ psychophysical power law, Y= ag", that the ex- 
ponent » —log R,y/log Re, where Re is the ratio of the greatest to the least 
stimulus intensity and Ry is the ratio of corresponding sensory magnitudes. 
Data from 21 experiments by S. S. Stevens show a correlation (Pearson r) 
of —.935 between log A, and 1/n, implying that log Ry is nearly constant. 
On this basis it is proposed that a single scale of sensory magnitude Serves 
a wide variety of perceptual continua, and that variation in power law 
exponents is primarily due to variation in dynamic ranges. The hypothesis 
that there is just one scale of sensory magnitude suggests that there may 
be just one value for subjective resolving power. When Weber fractions 
are transformed to their subjective counterparts by the psychophysical 


power 
about .03. 


Since the development of the direct judg- 
ment techniques by S. S. Stevens and his 
associates, an impressive array of data has 
accumulated showing that judgments of sen- 
sory magnitude grow as power functions of 
stimulus intensity. Each sensory mode 
yields a characteristic exponent which has 
been thought to indicate the rate at which 
log sensory magnitude grows with log stim- 
ulus intensity. For example, the exponent 
of about 4 for brightness has been inter- 
preted as indicative of the decreasing rate 
at which brightness grows with luminance ; 
at the other extreme, the exponent of 3.5 for 
judgments of the subjective intensity of elec- 
tric shock has been regarded as evidence that 
this experience grows increasingly more 
rapidly than the current which produces it. 
Thus, power law exponents can be viewed 
as measures of an important property of the 
relevant sensory apparatus. 

But other interpretations are possible. 
Poulton (1967), among others, has raised 
the possibility that “the sizes of exponents 


1 The manuscript was prepared while the author 
was a visitor in the Laboratory of Psychophysics, 
Harvard University. The author is indebted to 
S. Sternberg, J. C. Stevens, and Martha Teght- 
soonian for helpful criticisms of an earlier version 
of this manuscript. 

? Requests for reprints should be sent to Robert 
Teghtsoonian, Department of Psychology, Burton 
Hall, Smith College, Northampton, Massachusetts 
01060. 
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law, the result for nine different continua is nearly constant at 


are merely a function of the experimental 
conditions under which they were deter- 
mined [p. 316]." Instead of providing us 
with information about the nervous system, 
exponents may simply reflect certain param- 
eters of the test situation; the most import- 
ant of these, according to Poulton, is the 
geometric range (ratio of extreme inten- 
sities) of stimuli employed. As represented 
in Figure 1, an exponent is the slope of the 
linear function relating log y (sensory mag- 
nitude) to log $ (stimulus intensity). To 
the extent that the subjects’ (Ss') judg- 
ments cover a constant range of log magni- 
tudes, exponents will reflect only the range 
of log intensities selected by the experi- 
menter (E). There is then a possibility that 
Ss’ judgments are to some degree insensitive 
to the subjective magnitudes which they are 
supposed to represent, and tend to cover 
a constant ratio no matter what range of 
stimuli is presented. In that case, exponents 
may only be indirect measures of E’s arbi- 
trary selection of stimulus intensities. In 
support of this view, which may be called 
the “procedural artifact theory,” Poulton 
(1967) reports that for 21 studies conducted 
bv S. S. Stevens and his co-workers, there 
is indeed a moderate correlation. between 
exponent and geometric range of stimulus 
intensities (tau = —.60). 

My purpose in this article is twofold, 
First, a reexamination of the data considered 
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Fic. 1. The effect on power law exponents of varia- 
tion in range log $ when range log y/ is constant. 


by Poulton leads me to suggest that he has 
underestimated the closeness with which 
exponents are related to the stimulus 
ranges? employed. As is shown below, 
about 87% of the variance in exponents is 
accounted for by variation in stimulus range. 
Second, I wish to propose a hypothesis about 
this striking relation which is consistent with 
Stevens' view that exponents reflect an im- 
portant property of the associated sensory 
systems. Briefly, this hypothesis proposes 
that the ratio of the greatest to the smallest 
possible sensory magnitude is approximately 
constant for all perceptual continua, and that 
variation in power law exponents among 
continua reflects variation in the ratio of 
the greatest to the smallest stimulus intensity 
to which S is responsive (a ratio which is 
sometimes called the dynamic range). In 
short, widely varying dynamic ranges may 
all be mapped into the same sensory range; 
and if some nearly constant proportion of 
those dynamic ranges is presented, the result 
will still be a near y constant sensory Tange, 
although proportionately smaller than its 
maximum value. 


This hypothesis differs from the pro- 
cedural artifact theory in two important 


"Throughout this paper, the phrase "stimulus 
range" refers to the ratio of stimulus intensities, 
and "sensory range" refers to the ratio of corre- 
sponding sensory magnitudes. 
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ways. First, it holds that variation in stim- 
ulus range, while certainly under E's control, 
may be heavily constrained by the sensitivity 
of S's nervous system. For example, the 
fact that E presents, say, three log units of 
sound pressure in a loudness scaling experi- 
ment but only one log unit of width in scal- 
ing the apparent extent of finger span, may 
reflect the dynamic ranges of the two sen- 
sory systems involved rather than E’s arbi- 
trariness jn determining experimental 
parameters. Second, this hypothesis holds 
that the approximate constancy in the ratio 
spanned by S's judgments is due to the 
underlying constancy in sensory range rather 
than to the judgmental rigidity posited by 
the procedural artifact theory. But before 
further elaboration of this hypothesis, let 
us review the evidence considered by Poul- 
ton (1967) in assessing the relation between 
exponents and stimulus range, 


S. S. STEVENS’ SCALING DATA 


These data are drawn 
ments conducted in Stevens? laboratory and 
are summarized in Poulton’s (1967) "Table 
1. My analysis of these data is based on the 
following considerations, 

It follows from the 
power law 


from 21 experi- 


simple form of the 


V= age [1] 


that 
- log R, 
log Ry 
Where R¢ is the ratio 
smallest stimulus inter 
ratio of corresponding 
Tf the range of log jud 
Ss is nearly constant, 
for log Ry; Then 


n 


[2] 


of the greatest to the 
nsity, and Ry is the 
Sensory magnitudes. 
Bments provided by 
we may substitute K 


K 
og Re 
An estimate of this co 
tained by determining 
for each of the 21 exp 
ing the mean, 
Was used in Equ 
retical function r 


mom [3] 
nstant K may be ob- 
the value of 5 log Ry 
eriments, and calculat- 
The resulting value, 1.53, 
ation 3 to generate a theo- 
elating the value of the ex- 
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ponent to log Ry. That function is shown 
in Figure 2 together with the 21 points taken 
from Poulton’s table, each represented by a 
filled circle. The exponents are plotted on 
the ordinate against the log stimulus range 
on the abscissa.t (The results of three addi- 
tional experiments are plotted with unfilled 
circles. They do not figure in the data 
analysis, but are included to diminish the 
importance that a viewer might otherwise 
attach to the single points at each end of 
the abscissa.) The quality of the fit can be 
assessed by replotting the data with the re- 
ciprocal of the exponent, 1/n, on the ordinate 
against log Ry on the abscissa. If Equation 
3 is true, the results should be well fitted 
by a straight line. Such a fit was made by 
the method of least squares; a Pearson r of 
935 indicates that over 87% of the variance 
in the reported exponents is accounted for 
by the stimulus range employed.’ 


INTERPRETATIONS 


Although the stimulus range may vary 
from .5 to 6.0 log units, the judgmental 
range is approximately constant at about 
1.5 log units. According to the procedural 
artifact theory, this constancy represents 
S's inflexibility in making judgments and, 
extending Poulton's argument, we could con- 
clude that the exponent is almost entirely 
determined by E's choice of stimulus range. 
The alternative interpretation that I have 
proposed is based on two concepts: (a) dy- 
namic range and (b) the constancy of the 
maximum range of subjective magnitude. 


‘The points for binaural loudness, monaural 
loudness, vocal effort (the autophonic response), 
and vibration intensity are plotted using values 
of log Ry which are just half those shown by 
Poulton, In expressing à given number of decibels 
as a ratio, Poulton has not taken account of the 
fact that in these four cases the reported exponents 
were calculated for ranges of stimulus pressure or 
amplitude; in such cases the number of decibels 
is 20 log Ry, not 10 log Ry as Poulton assumed. 

5Jones and Woskow (1962) report a tau of 
.93 for nine continua tabled by Stevens (1960). 
It should be noted that the exponents came from 
one set of experiments using numerical magnitude 
estimation, while the stimulus ranges applied to 
another set of studies using cross-modal matching. 
Jones and Woskow also were led to surmise that 
judgmental ranges are approximately constant. 


EXPONENT 
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Vic. 2. The filled circles are for 21 experiments 
by Stevens and his associates tabled by Poulton 
(1967). The unfilled circles, from left to right, 
are for saturation of yellow (Indow & Stevens, 
1966), saturation of red (Panek & Stevens, 1966), 
and apparent viscosity (Stevens & Guirao, 1964). 


Dynamic Range 
The notion that the perceptual continua 


differ in the ratio of extreme intensities to 
which S can respond is not a novel one, but 
there is regrettably little direct evidence on 
its validity. Much of the uncertainty con- 
cerns the problem of determining upper 
bounds to the range of sensitivity. In some 
cases, there is a fairly obvious physical 
limitation; for example, finger span simply 
cannot go much beyond 140 millimeters 
(mm.), and similarly the capacity to sense 
a lifted weight will be limited by muscular 
strength. In other cases, the problem is 
more difficult, and indeed some may question 
whether there is an upper limit to sound 
pressures that can be heard or to luminances 
that can be seen. Yet even in these cases 
the occurrence of physical damage to the 
receptor system may establish an upper 

bound. 
Despite the obvious difficulties in achiey_ 
ing precise measurement of dynamic range 
> 
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the concept should not be overlooked as a 
factor influencing E’s choice of stimulus in- 
tensities. For example, when scaling the 
subjective intensity of electric current ap- 
plied to the fingers, it is just not possible to 
vary current by much more than a factor of 
four. By contrast, the ear readily responds 
to sound pressures varied by a factor of at 
least 100,000. It is this striking variation 
in dynamic range that imposes an upper 
limit on E's selection of Rẹ, and according 
to the present hypothesis, it is this same 
variation which accounts for the variation in 
power law exponents. 

Of course it is unlikely that any of Ste- 
vens' 21 experiments made use of the full 
dynamic range; that is, log Ry/log Ry max 
was always less than unity. But to the 
extent that this fraction is nearly the same in 
all experiments, it would still follow that for 
log Ry a constant, the relative size of ex- 
ponents reflects the relative size of dynamic 
ranges. Is log R,/log Ry max in fact a 
constant for the 21 studies in question ? 
Lacking definitive measures of dynamic 
range, a direct empirical answer is not avail- 
able. But since it is the practice in Stevens’ 
laboratory (as in others) to conduct scaling 
experiments with the largest value of Rs 
commensurate with S"s comfort, it is not 
unlikely that a nearly constant proportion of 
the dynamic range is thereby selected, 


Range of Subjective Magnitude 
If the maximum value of Ry is the same 
for all perceptual continua, it is of interest 
to know how large that value may be. The 
estimate of about 1.5 log units reported above 
is probably lower than the true value, Sup- 
pose, as T have argued above, that log Ra 
log Ry max for Stevens’ 21 studies is some 
constant C, less than unity. In that case, 
an estimate of log Ry max is provided by 
1.5/C; and if C is not less than .5, then 
log Ry max is located in the range from. L5 
to 3.0 log units. Only through a determina- 
tion of values for dynamic range can a more 
precise estimate be developed. 
"An estimate of log Ry can also depend on the 


Exact form of the power law which is ie ape 
The version which translates stimulus intensity by 


What kind of perceptual structure might 
Senerate the same value of Ry max for all 
continua? The present data are consistent 
with a model that posits a single central 
mechanism responsible for all judgments of 
sensory magnitude. Such a sensory magni- 
tude monitor would impose the same range 
of possible outputs regardless of the receptor 
system in the periphery to which it was re- 
sponding. In this context, the various re- 


ceptor systems can be regarded as perform- 
ing the necessary expansions or compres- 


sions required to map the widely varying 
dynamic ranges into this constant range of 
magnitudes, Empirical investi- 


[ Onstruct and the development 
of additional defining Properties. For the 
present, it is Proposed only for its heuristic 
value in conceptualizing a Structural basis 
for the constant in Equation 3, 


INTRAMODAL VERSUS INTERMODAL VARIA- 


TION IN STIMULUS RANGE 

Given the two interpret 
3, are there grounds for 
another? Do they generate differential 
predictions? The present hypothesis of a 
constant sensory range relates exponents to 
an inherent characteristic of the relevant 
Teceptor system and does not in itself antici- 
pate any effect of intradmodal Variation in 
Ry. But as the procedural artifact theory 
regards the actual value of Ks as critical, 


the effect on exponents should be no dif- 


ferent whether Rg is varied intermodally (as 
Indeed, 


or intramodally, 
of the pertinent 


review 
literature reveals that for some continua 
hich the expon- 


there are conditions under wl 
ent declines as Ry grows, For example, 
Stevens and Poulton (1956) had separate 
groups of Ss make magnitude estimations 
of a 1000-Hertz (Hz.) tone 6, 10; 20, or 
40 decibels (db.) less than à standard set 
at about 100 db, above threshold. The ex- 
ponents estimated for these conditions are 
2, and .46, respec- 


ations of Equation 
Preferring one to 


approximately 67, .60, 5 


an additive constant, ¥ = a(b— go), can be used 


in conjunction with the present data to generate 
an estimate of log R,-21, 


| 
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i 


) 
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tively. However, if these values are referred 
to the coordinates of Figure 2, it is evident 
by inspection that this effect of variation in 
log Ry falls far short of an inverse relation 
ton. It seems that loudness judgments can- 
not be made to grow as rapidly as judgments 
of the subjective intensity of, say, electric 
shock by using the same value of R, for 
sound pressure as for electric current. While 
more evidence is needed to resolve this point, 
the results of Stevens and Poulton suggest 
that the intramodal effects of variation in Rọ 
may be quite different in magnitude, and 
hence probably in cause, from the intermodal 
effects depicted in Figure 2. 


RESOLVING POWER 


What has been called Ekman’s law 
(Stevens, 1966) states that the subjective 
size of the just noticeable difference (jnd) 
is linearly proportional to the subjective 
magnitude of the standard ; there appears to 
be a relativity for sensory magnitudes analo- 
gous to that defined by Weber's law for 
stimulus magnitudes. But if there is just 
one range of sensory magnitudes, it is 
tempting to consider that there may be 
just one constant of proportionality for Ek- 
man’s law. Thus S might be able to resolve 
a 100 C% change in sensory magnitude 
(where C = Ay/y) regardless of the form 
of stimulus input. Such a conclusion is 
consistent with the hypothesis that discrim- 
inations are based on the output of the sen- 
sory magnitude monitor discussed above. 

If one selects the simplest version of the 
power law, 


V = ad^, 
then it follows that, for any continuum, 
V + Ay = a(ó + Ag)” 
and therefore 
Ay ( Ap y 
{pave ify : 
V to 


where A$/49 is the Weber fraction and Ay/y 
is the associated fractional growth in y. To 
test the constancy of Ay/y requires that 
empirical values of A$/4 obtained for various 
continua be inserted in Equation 4. An 


[4] 


TABLE 1 
INVARIANCE oF [1 + (6/29) = 1 + (AV/) 
Continuum Ag/¢ n Av/P 
Brightness .079 | .33 | .026 
Loudness 048 | .6 | 029 
Finger span .022 | 1.30 | .029 
Heaviness .020 | 1.45 | .029 
Length .029 | 1.04 | .030 
Taste, NaCl 083 41 | .033 
Saturation, red 019 | 1.7. | .033 
Electric shock .013 | 2.5. | .033 
Vibration 
60 Hz. .036 .95 | .034 
125 Hz. .046 | .67 | .031 
250 Hz. 046 .64 | .029 
Mean .031 
Grand mean for nine continua .030 


Note.—A¢/¢ is the Weber fraction, Ay/y is the associated 
fractional growth in sensory magnitude, and » is the exponent 
of the power function relating subjective magnitude to physical 
intensit y. 


unequivocal specification of A$/4$ is notori- 
ously difficult; the value is dependent on 
many parameters of the test situation. 
Nonetheless, it seemed reasonable to search 
for values of Ad/¢ that could be regarded as 
characteristic of a particular continuum. 
The values of A$/$ shown in Table 1 are 
for stimuli that resemble as closely as pos- 
sible the stimuli that were employed in the 
scaling experiments which generated the 
tabled exponents. In the following sections, 
the sources for these data are cited, and 
any analyses which go beyond what the 
authors provide are explained. 


The Data 

Brightness. One must be especially un- 
easy about selecting a single value to repre- 
sent the Weber constant for brightness dis- 
crimination, since measures of resolving 
power depend at least on target size, dura- 
tion, and retinal location, as well as the state 
of the retina at the time of the test. But 
there is also evidence that the measured size 
of the difference limen (DL) becomes in- 
creasingly insensitive to changes in duration 
and area beyond certain critical levels. For 
duration, Anglin and Mansfield (1968) have 
summarized evidence that at very high 
luminances (around 95 db.) 25 milliseconds 
is the critical value; longer exposures will 


£3 
des s i 


` 


ROBERT TEGHTSOONIAN 


TABLE 2 


S TIVE STUDIE F FFERENTIAL SENSITIVITY FOR BRIGHTNESS 
SUMMARY OF Five STUDIES OF DIFFERE T 


E Target Durati Log 
Study I Method No. Ss | m (seconds) Aó/ó 
| 
: 5 | Hines ascending 2 60 .032 to —1.2 
Herrick (1956) | Limits; ascending | | | 2013 | 
inita. asenin 1 60 | .02 -14 
twick (1954) | Limits, asc ending M Ls | 
IRE bom | Constant stimuli 604% point : 2 | 40 02 | = D 
Keller (1941) f Limits, ascending and descending 2 | 88 20 ee 3 
pesn iira ascending 5 $3 
Graham & Bartlett (1940). Limits, ascending and descending | 2 | 56 | 03 f 


not further reduce the DL. In the studies 
to be considered, exposure times were near 
or greater than this critical duration. For 
target size, relatively little improvement in 
differential sensitivity occurs beyond diam- 
eters of about 1° (e.g., Blackwell, 1946) ; 
the studies to be considered all employed tar- 
gets near that size. To obtain from those 
studies the best estimate of the slope con- 
stant in the linear generalization of Weber's 
law, the value of the Weber fraction was 
taken at the highest intensity studied (or 
the point at which the fraction was mini- 
mized). Five experiments which met these 
conditions are summarized in Table 2; they 
are in fairly good agreement about the value 
of log ^$/4$ despite variation in psychophysi- 
cal method and experimental conditions. 
The median value of log ^$/$, —1.1, was 
selected and its antilog is the value shown 
in Table 1. Of course the list of studies in 
Table 2 is not exhaustive, but the reported 
values seem to be representative. Two well- 
known studies which report markedly lower 
values of A$/$, Steinhardt (1936) and 
Blackwell (1946), were excluded because 
of the unusual degree of training provided 
the Ss; both authors comment on the many 
thousands of judgments made over pro- 
longed periods of time. Further, Black- 
well’s Ss were encouraged (successfully) 
to respond at levels of awareness well below 
those usually required by the classical psy- 
chophysical methods. = 
The exponent of the power law describing 
the brightness function has been reported 
as .33 by Stevens and Stevens (1963). 
Loudness. For loudness, the value 048 
is due to Miller (1947) and refers to dis- 


crimination of the intensity of white noise. 
He found that at levels greater than 30 db. 
above threshold, the increment (1.5-second 
pulse) heard 50% of the time was nearly 
constant at .41 db. ; his sample was composed 
of two experienced Ss. A similar value has 
been reported by Harris (1963) for a wide 
range of pure tones (125 Hz. to 6000 Hz.) 
using the method of constant stimuli. The 
exponent of .6 for the power function relat- 
ing loudness to sound intensity was reported 
by Stevens (1956) and has been recognized 
as the standard value. 

Finger span. Differential sensitivity for 
finger span has been studied by Gaydos 
(1958) using the method of adjustment. In 
his Experiments 1 and 2, a total of 100 Ss 
made matches to standards ranging from 
17.7 to 100 mm. Probable errors can be 
plotted against the lengths of the standards, 
yielding a linear function with a slope of 
024. In his Experiment 3, Gaydos reports 
data obtained from a group of 20 Ss for a 
shorter range of standards ; ignoring an 
anomalous value for the shortes 
linear fit of the probable errors 
ards yields a slope of .020, A similar value, 
019, has been reported by Stevens and 
Stone (1959) for data obtained by G. J. 
Huberman.? Giving double weight to the 


value derived from Gaydos' Experiments 1 


and 2, these three results hay 
.022. 


Stevens and Stone 
magnitude estim 


t standard, a 
to the stand- 


'€ a mean of 


(1959) reported that 
ation of apparent finger span 
"Stevens and Stone 


Ste (1959) plotted standard 
deviations rather than probable errors and reported 
a slope of .0286. have multiplied this by .6745 


to obtain the slope for Probable errors. 
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yields a power function with an exponent of 
1.33. More recently, Mashhour and Hos- 
man (1968) have obtained an exponent of 
1.26 using a standard in the middle of a 
series of 11 stimuli ranging from 10 mm. to 


110 mm. The mean of these two results, 
rounded to 1.30, is the value listed in 
Table 1. 


Heaviness. For differential sensitivity to 
lifted weights, the work of Oberlin (1936) 
is most frequently cited. In his Experiment 
II, five Ss judged the second of pairs of 
weights as heavier, lighter, or equal in com- 
parison to the first. Standards varied from 
25 grams (gm.) to 600 gm. Urban’s process 
was used to determine the upper and lower 
DLs. When half of the interval is calcu- 
lated, the average over Ss is nearly linear 
with the standard weight (although the value 
for 600 gm. is aberrant) ; the slope is .02. 
This slope fits the data of Experiment I 
(in which only "heavier" and "lighter" judg- 
ments were allowed) over the lowest four 
standards, 50 gm. to 250 gm., for each of 
four Ss. 

The exponent of the power function for 
apparent heaviness is 1.45 as reported by 
Stevens and Galanter (1957). Although 
Mashhour and Hosman (1968) reported 
1.13 for a range from 70 gm. to 700 gm. 
with a 222-gm. standard, the figure reported 
by Stevens and Galanter is an average over 
several studies and is therefore preferred. 

Length. In measuring DLs for length, 
Ono (1967) employed a variant of the 
method of limits for standard lengths of 5, 
10, and 15 centimeters, at distances of 1.5, 
3.0, and 4.5 meters. Although standard and 
comparison stimuli were displayed together, 
their lateral displacement was sufficient to 
ensure nonsimultaneity of viewing. When 
the size of the DL is plotted against the 
distal size of the standard, the slope of the 
linear relation is 029. Stevens and Galanter 
(1957) reported that magnitude estimations 
of apparent length are "very nearly a linear 
function of physical length" ; the exponent of 
the power function is close to 1.00. Mash- 
hour and Hosman (1968) reported an ex- 
ponent of 1.08 for lengths ranging from 
15 mm. to 150 mm. with a standard of 


55 mm. The mean of these two values is 
.04. 
Saltiness. | McBurney, Kasschau, and 
Bogart (1967) measured the size of the jnd 
for NaCl after adaptation to water, and also 
after adaptation to a .1 molar solution of 
NaCl. The results of the latter condition, 
although representing just one point on 
the function relating the Weber frac- 
tion to intensity, are reported here. The 
average of Ad/¢ for three Ss following a 
method of constant stimuli was .09. Using 
an ascending method of limits with a cri- 
terion of two successive correct responses, 
the average Ad/¢ for two Ss was .075. The 
mean of these two values, .083, is used. 
McBurney (1966) used magnitude estima- 
tion with 10 Ss judging the apparent inten- 
sity of solutions of NaCl ranging from water 
and .001 molar to 1.0 molar concentrations. 
Using empirically determined threshold cor- 
rections for each of several adapting inten- 
sities, the data can be well described by a 
power function with an exponent of 4l. 
Saturation. Panek and Stevens (1966) 
obtained magnitude estimations of the satu- 
ration of red, and measured jnd over a 
range of saturations. The latter was done 
using the method of constant stimuli for 
four Ss and the method of single stimuli 
for four Ss, for saturations varying from 
20% to 80% red. The jnd was defined as 
O. half the distance between the 25% and 
75% values, and the results averaged over 
both methods are reported. The relation 
between Q and the standard saturation 1s 
well fitted by a straight line (discounting 
the value for the weakest standard) with a 
slope of .019. Data are also reported for 
the method of constant stimuli alone at 
standards of 20% and 80% : a straight line 
hese two values of Q has a 


EN 


connecting t 
slope of .019, and this is the value shown in 


Table 1. The results of the several magni- 
tude estimation experiments are well de- 
scribed by a power function with an ex- 
ponent of 17. 

Shock. Hawkes (1961) measured Weber 
fractions for the intensity of electric shock 
to the finger tip, for a range of frequencies 
using both the beat method and the method 
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of successive stimuli. From his graphical 
display of Weber íractions, values of Ag 
were calculated for each of three Ss. Only 
two intensities were studied, 120% and 
200% of each S’s absolute threshold, so it 
is the slope of the straight line connecting 
these two points in the plot of Aó against 
4 that can be calculated. Since there appears 
to be little effect due to psychophysical 
method or frequency, slopes were obtained 
for all three Ss at 100 Hz. and at 1500 Hz. 
using both methods. The median value of 
this (somewhat skewed) distribution of 12 
scores is .013. 

Stevens, Carton, and Shickman (1958) 
scaled the subjective intensity of electric 
shock covering a range from .38 milli- 
ampere (ma.) to 1.15 ma. and reported a 
power function with an exponent of 3.5. 
Ekman, Frankenhaeuser, Levander, and 
Mellis (1966) had 15 Ss judge the unpleas- 
antness of electric shock varying in intensity 
from 1.5 to 3.5 times each S's mean sensa- 
tion threshold. The results together with 
findings from a previous study by the same 
authors are well fitted by Ekman's version of 
the power law (involving an additive transla- 
tion of stimulus intensity) with an exponent 
of 1.54. 'The mean of these two values is 
2.5. 

Vibration. The case of tactile vibration 
is interesting because the exponent of the 
power function varies with frequency. Ste- 
vens (1959) has reported that magnitude 
estimation of a 60-Hz. vibration to the fin- 
gertip follows a power function with an 
exponent of .95. Later, Stevens (1968) 
found that if a 60-Hz. vibration is adjusted 
to match a 125-Hz. vibration or a 250-Hz. 
vibration, the resulting power functions have 
exponents of .70 and .67, respectively. These 
two values may be multiplied by .95 to 
generate estimates of the exponents—.67 
and .64—of power functions obtained by 
magnitude estimation. i 

Knudsen (1928) measured DLs for vi- 
bration at 64 Hz., 128 Hz. and 256 Hz, 
values which are similar to those employed 
in Stevens’ scaling studies, He employed a 
method of limits with both ascending and 
descending series, His data are average 


values for two Ss and are presented in the 
form of Weber fractions. From his graphs, 
estimates of A¢ can be derived, and when 
plotted against $ (expressed relative to ap- 
proximate threshold), are well fitted by 
straight lines. The slopes are .036, .046, 
and .046 for vibration frequencies of 64 Hz., 
128 Hz., and 256 Hz, respectively. 


Invariance of Ay/y 


The apparent Stability in the computed 
values of Ay/y is impressive, especially in 
view of the difficulty of demonstrating, first, 
that the several values of Ad/¢ are compar- 
able, and second, that they are each prefer- 
able to alternatives that may be found in the 
literature. Despite these difficulties, the 
values of Table 1 raise the possibility that 
a constant percentage change in sensory 
magnitude is just discriminable, regardless 
of the nature of the stimulus. The view 
that the steepness of the psychophysical 
function is directly related to differential 
sensitivity was of course Fechner's idea, 
and has in recent years provoked severe 
criticism (eg., Stevens, 1961), as well as 
frequent restatements | (eg. Heinemann, 
1961; Luce, 1959). But it is important 
to recognize, as Heinemann (1961) has 
pointed out, that this aspect of Fechner’s 
speculation is quite separable from his 
ill-fated hypothesis that jnd's define sub- 
jectively 
o of subjective 
magnitudes generated by stimuli one jnd 
apart, and the evidence of Table 1 shows that 
the value of this ratio is constant for a wide 


of caution may be useful. 

critic could of course dispute any one of 
the empirical values in Table 1; the Weber 
constants are especially vulnerable, and 
there are undoubtedly anomalous cases that 
do not fit the pattern. 
that, without being subject to gross distor- 
tion, many of the data in the discrimination 
literature fall into a simple and conceptually 
attractive pattern. What remains to be seen 
is whether this pattern extends over an even 
broader domain than is indicated here, or 
whether it is liable to so many exceptions 
that its value is lost. E 


But it seems evident 
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In summary, one can point to two large 
areas of data suggesting that the experience 
of sensory magnitude is governed by a single 
mechanism. Two of its properties have been 
described here; further research may reveal 
others. 5 


CONCLUSIONS 

Three hypotheses have been proposed to- 
gether with supporting evidence: 

1. There is a common scale of sensory 
magnitude for a wide variety of perceptual 
continua. 

2. The maximum range of log sensory 
magnitudes defined by this scale accounts 
for the inverse relation between power law 
exponents and the range of log stimulus 
intensities, when the latter are proportional 
to their maximum values (dynamic ranges). 

3. Data from a number of discrimination 
studies are consistent with the view that 
just noticeable changes occur when sensory 
magnitudes are altered bv a constant frac- 
tion, and that this fraction is the same re- 
gardless of the form of input energy. 
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LETTERS TO THE EDITOR 


ON ACQUIESCENCE AND SOCIAL 
DESIRABILITY 


In a recent article in this Journal. Dillehay 
(Dillehay, R. C. Sincerity and dogmatism: A 
reassessment and new data. Psychological Re- 
view, 1969, 76, 422-424) criticizes an article by 
Stanley and Martin (Stanley, G.. & Martin. J. 
How sincere is the dogmatist? Ps chological 
Review, 1964, 71. 331-334) as not providing 


a proper test of Rokeach's Hypothesis P: 
(Rokeach, M. The double agreement phe- 
nomenon: Three hypotheses. Psychological 


Review, 1963, 70, 304-309). This hypoth 
asserts (a) that both original and reversed 
items on the F and D scales are high on social 
desirability; (b) that high scorers on the 
original scales disagree with reversed items: 
but that (c) their response to reversed items 
is based on the social desirability dimension, 
resulting in scores apparently reflecting low 
scores on F and D. 

Stanley and Martin correctly deduce from 
this hypothesis that a positive correlation 
should exist between acquiescent subjects? 
responses to reversed items on the D scale 
and their scores on a social desirability scale. 
Unfortunately, they do not make clear either 
in the summary or background that the corre- 
lation is deduced to exist for acquiescent re- 
spondents only and Dillehay's main criticism 
springs from his understandable failure to ap- 
preciate this. 

From Stanley and Martin's failure to find 
such a correlation, it does follow that they 
have "no evidence to support Rokeach's Hypo- 
thesis Bi [p. 333].” True, as Dillehay states. 
they do not “refute” the hypothesis, but this 
the logical reason that failure to find 


is for 
confirmation is not refutation. 

Dillehay correctly deduces irom Rokeach's 
hypothesis that acquiescent respondents are 
higher on social desirability than those who are 
consistently high on dogmatism. His findings 
support this deduction. but it is 
firmation. since there is no need to assume 
active disagreement by acquiescent respond- 
ents: uncertainty would be enough. 

Each of Rokeach's three hypotheses muy 
account for some acquiescence, but Hypothesis 


weak con- 


81 


^ cannot account for all acquiescence, since 
iactors such as the respondents uncertainty 
are important (Bethlehem. D. W. Dogmatism 
and allied personality variables related to 
prejudice in young adults. Unpublished doc- 
toral dissertation, University of London, 1969). 


(Received September 21, 1970) 


Douglas W. Bethlehem 
Subject of Psychology 
University of Zambia 
P.O. Box 2379 
Lusaka, Zambia 


SUBJECTIVE TIME AND 
DECISION MAKING 


Pollay (Pollay, R. W. A model of decision 
times in difficult decision situations. Psycho- 
logical Review, 1970, 77. 274-281) assumes 
that “most decision makers . . . will continue 
to expend effort in the analysis of a decision 
problem only as long as the expected returns 
from that effort exceed the costs of engaging 
in the analysis [p. 274]." It seems more ra- 
tional to assume that they “will continue . . - 
as long as the expected returns irom that 
effort exceed the expected costs. How- 
ever, the expected costs change with each 
elimination of an alternative. Furthermore. 
the expected returns may be inversely related 
to the expected costs, if the decision maker 
finds the decision-making process in itself grati- 
fying because he enjoys its challenges. This 
return may change with time as the decision 
maker becomes either more fascinated or more 
bored with the decision-making process. 
must be considered in terms of 
subjective time for each decision maker and 
not in terms of chronometer time which cannot 
account for the subjective cost to the decision 
maker. Subjective time, however, probably 
does not produce the linear function assumed 
by Pollay in his first "subsidiary assumption." 
It is also not merely a function of the decision 
maker's personality but of the latter in rela- 
tion to the total situation (including what else 
the decision maker may want to do in the near 
future besides deciding among the alternatives 


Thus, time 
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presented to him in the decision-making situa- 
tion). 

True indeterminacy is evident in the prob- 
lem Pollay attempts to solve when his model . 
is seen as an open system (Berrien, F. K. Lewis W. Brandt 
Gencral and social systems. New Brunswick, Department of P 
N. J.: Rutgers University, 1968) in which the 
decision to “quit or give up” is considered as 
a decision within a suprasystem in which the 


continuation of the experimental decision-mak- 
ing situation is only one of several alternatives. 


(Received October 9, 1970) 
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à l LIMBIC-DIENCEPHALIC MECHANISMS 
OF VOLUNTARY MOVEMENT" 


C. H. VANDERWOLF * 


University of Western Ontario 


Data from a series of brain-behavior experiments utilizing techniques of 


ablation, intracranial stimulation, 


importance of the concept of levels of 
Initiation, performance, and cessation oj 
manipulating objects) are associated with activation of ascending pathw 
from the diencephalon to the hippocampus and neocortex. 
chewing, emotional expression) are not directly dependent 
occur as conditioned responses. 


behaviors (e.g 
on these pathways, even when they 


and electrical recording emphasize the 


function in understanding behavior. 


f voluntary behaviors (e.g., walking, 


More automatic 


Elec- 


trical recordings indicate that hippocampal activity, in particular, is closely 


à related to behavior. 


related to different aspects of concurre 
definition and mechanism of "voluntary" and “automa 


ment are discussed. 


This review is an attempt to summarize 

several years of work on the behavioral func- 

, tions of the medial thalamus and hippo- 
campal formation and to speculate on the 
neural mechanisms of higher level control of 
movement. 

In general, the results emphasize the im- 
portance of the concept of levels of function 
in understanding the neural control of be- 
havior. Hughlings Jackson observed that 
voluntary movements are more affected by 
cerebral lesions than automatic movements 
(including reflexes and stereotyped or rhyth- 
mical activities), suggesting that the latter 


1 Research reported here was supported by grants 
from the National Research Council of Canada. 
I am indebted to B. H. Bland, D. o. Hebb, D. 
Hopkins, Doreen Kimura, R. S- Sainsbury, and 
p. Q. Whishaw for helpful criticisms and sug- 
gestions. 

2 Requests for reprints should be sent to C. H. 
Vanderwolf, Departments of Psychology and Phys- 
iology. University of Western Ontario, London 


72, Canada. 


Wave morphology, amplitude, and frequency, are all 


Problems of the 
” control of move- 


nt motor activ 


may normally be performed without the di- 
rect participation of higher centers. Con- 
temporary research on the effects of fore- 
brain damage in animals confirms Jackson’s 
observations. Further, the correlation of 
behavior with the spontaneous electrical ac- 
tivity of the brain yields results consistent 
with Jackson's hypotheses. During volun- 
tary types of movement such as locomotion 
or the manipulation of an object, there is an 
increased activation of the dorsal hippocam- 
pus and neocortex. This does not occur dur- 
ing behavioral immobility nor during the 
performance of automatic behaviors such as 
emotional expression or shivering. There- 
fore, there appear to be special cerebral 
mechanisms which are required for the 
performance of voluntary movements but are 
not necessary for the maintenance of a fixed 
posture or the performance of automatic 
movements. 

There is no generally accepted objective 
definition of volition” or of “voluntary 
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movement" and the difference between it 
and "automatic movement." 
bilities are that voluntary acts are those that 


Automatic motor activities comprise reflexes 
as well as a number of activities which are 
not obviously reflexive but occur only in 
association with particular drive States, 

It is each distinguishable 
with the activation 
of a cluster of automatic responses unique 
drive and whose components are 
interconnected at a hypothalamic or lower 


level, “feeling cold” is 
associated with 


shivering, 


controlled by hunger 
material drive, etc., 
Motor activities such 
tion are more rigidly 
ular drive states, 

The coupling between a drive state and a 
voluntary movement is assumed to occur via 
intermediary in the forebrain 
which establish a “program” determining the 
selection i 


, thirst, fear, sexual or 
on different Occasions, 
as shivering or saliva- 
connected with partic- 


Overt behavior results when a program is 
activated by means of a trigger mechanism, 
largely located in the diencephalon. There 
may be an interaction between this move- 
ment triggering mechanism and a second 
System whose activation results in behavioral 
immobility, The occurrence of automatic 
motor activity is not directly dependent on 
these intermediary processes, 

en different behaviors are deliberately 
conditioned in tests of learning, they appear 


to depend on the same response mechanisms 
as when 


KARES ae 


BETAVIORAL Errects or Mznrar, 
THALAMIC DEsTRUCTION 


Behavioral studies have been carried out 
on a large number Of rats in which the 
medial thalamic nuclei had been destroyed 
electrolytically. The lesions Were relatively 
extensive, involving the medial and midline 
nuclei as well as the intralaminar nuclei and 
habenula, Details of most of the work are 
provided in eatlier publications ( Vander- 
wolf, 1962. 1963a, 1964. 1966, 1967h, 1968, 
1969b), 

The behavioral changes Produced by large 
medial thalamic lesions can be grouped into 
three successive phases, a hyperactive phase, 
an akinetic phase, and a chronic “recovered” 
phase, Immediately following Surgery, while 
the rats are still partially anaesthetized, 
they become extremely active, usually run- 
ning forward or circling but occasionally 
jumping repetitively, A few animals eat 
or gnaw at Protruding objects, These be. 
haviors May occur almost “continuously for 
several hours, One Tat, placed in an activity 
wheel, ran approximately three miles in an 
hour and a half. Usually the rats were kept 
in plastic buckets or small cardboard boxes 
during the period of hyperactivity to prevent 
self-injury or fatal exhaustion, Seizures 
occurred in some cases. The day following 
Surgery, all traces of hyperactivity, had 
ceased, the rats having become almost totally 
Motionless for long Periods. 
not eat or drink Without 


€ HE VANDERWOLF 


Were made to regain the spout 
and the licking and biting Movements would 
stop, but could be restarted if the Spout was 

a second time, Tn contrast, more 
acts such as spontaneous walking 
€covery from 
tic phase was gradual, but within 
2-3 weeks the rats wer easily distin- 
Perate control animals 
They gained weight 
od and water were 
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available. groomed themselves 
adequately, and did not exhibit any ab- 
normality in motor performance. That is. 
such acts as walking, running, etc., were per- 
formed normally. At times, muscular tonus 
seemed somewhat subnormal throughout the 
entire body, but this was never sufficient to 
interfere with normal posture. 

For several days following surgery, à few 
of the rats exhibited exaggerated defensive 
behavior somewhat like the syndrome pro- 
duced by septal damage. Thus, a severely 
akinetic rat might scratch and bite savagely 
when handled. In order to minimze this 
type of behavior, rats were always housed in 
groups of 2-10 rats in a large cage. (Even 
normal rats become savage when kept in iso- 
for long periods—Hatch, Wiberg. 
Balazs, & Grice, 1963). The net result was 
that the medial-thalamus-damaged rats 
tended to be tamer than normal or sham 
operate rats which were similarly housed. 
This observation has been confirmed and 
extended by Wishart (1968) who used a 
rating scale technique to assess emotionality 
following medial thalamic lesions. 

Behavioral testing was usually begun 2-3 
weeks following surgery. An early finding 
was that medial-thalamus-damaged rats were 
almost totally unable to avoid an electric 
shock in a one-way avoidance situation. 
The task required that the rats learn to run 
from a black compartment, in which they had 
been manually placed. into a white one in 
less than 5 seconds. When first shocked in 
this test, the rats reacted in a relatively nor- 
fashion. exhibiting startle, running, 
squealing, defecation, piloerection, exoph- 
thalmia, ete. However, after several train- 
ing trials. the thalamus-damaged rats all be- 
docile and easy to handle 
as compared with normal rats undergoing 
the same treatment. When placed in the 
black (shock) compartment. they remained 
completely motionless until shock was pre- 
sented, whereupon they ran rapidly off the 
grid. This behavior usually persisted. un- 
changed even after several hundred training 
trials. That is, avoidance behavior was vir- 
tually absent. but escape behavior was rela- 
v intact. This dissociation has now 
onstrated in a variety of avoidance 


continuously 


lation 


mal 


came unusually 


tively 
been dem 


MOVEMENT 85 


test situations (Bohus & de Wied, 1967; 
Cardo, 1961; Delacour, Albe-Fessard, & 
Libouban, 1966; Olton & Isaacson. 1967 ; 
Thompson, 1963 ; Vanderwolf, 1962, 1963a). 
The effects are similar if medial thalamic 
lesions are made either before or after train- 
ing or if the rats are tested in either a one- 
way or two-way apparatus (shuttle box). 
Further, the deficit does not disappear with 
time. Rats trained as long as a year after 
surgery show a typical failure to avoid. 

Several factors should be considered in 
the interpretation. of these results. Fibers 
of the spinothalamic system (pain afferents) 
terminate in the intralaminar nuclei (Bow- 
sher, 1961; Magoun, 1963; Mitchell & 
Kaelber, 1967), and various types of fear or 
escape behavior can be elicited by stimula- 
tion in the medial thalamic area (Black & 
Vanderwolf, 1969; Olds & Olds. 1963: 
Roberts, 1962). Thus, the effect of medial 
thalamic lesions on avoidance behavior could 
be the result of reduced fear or an impair- 
ment of pain perception. Tt is also possible 
that impaired learning ability is responsible 
for the effect. Gastaut, Jus, Jus, Morrell, 
Storm van Leeuwen, Dongier, Naquet. 
Regis. Roger, Bekkering, Kamp, and Werre 
(1957) have suggested that the thalamic un- 
specific system (which was extensively dam- 
aged in the foregoing experiments) is the 
site of the “temporary connection” produced 
by conditioning procedures. 

Tt is unlikely, however, for reasons given 
below, that any of these factors are respon- 
sible for the loss of active avoidance be- 
havior observed in rats with medial thalamic 
damage. After several days of avoidance 
training, such rats often began to squeal 
when placed in the black (shock) compart- 
ment although they made no visible attempt 
to run away. They did not squeal (or 
squealed less) when placed in the white 
(safe) compartment. This indicates that 
some fear responses can be conditioned (in 
Pavlov's sense) in a rat with medial thalamic 
damage. ^ similar result was reported by 
Doty, Beck. and Kooi (1959) who found 
that in cats in an aversive conditioning situa- 
tion, large medial diencephalic and midbrain 
lesions abolished a leg flexion response but 
respiratory conditioned 


not a response, 
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Autonomic responses to electric shock, such 
as defecation ( Vanderwolf, 1962) and heart 
rate acceleration (Delacour & Santacana, 
1967), can also be conditioned in rats follow- 
ing medial thalamic destruction. Further, 
medial thalamic lesions do not impair per- 
formance in tests of Passive avoidance in 
which animals avoid an electric shock by re- 
maining motionless (McNew & "Thompson, 
1966; Vanderwolf, 1962; Wishart, 1968). 
Finally, thalamic lesions placed in the pos- 
terior ventral nucleus raise the pain 
threshold, but do not necessarily impair 
avoidance performance, On the other hand, 
medial thalamic lesions can depress avoid- 
ance performance without raising the pain 


Large medial thalamic lesions impair 
learning ability as measured by complex 
maze tasks (Delacour et al, 1966; Gross, 


avoidance tests requiring active 
movement because of inability to learn in 
this particular type of task, even though they 


responses 
However, Tats 
give evidence of an ability to learn what to 
do in an aversive situation even though they 
avoidance responses, 
large media] thalamic lesions 
do not prevent good performance in a three- 
choice brightness-discrimination task moti. 
escape from electric shock (Van- 
derwolf, 1969b). Further, in the standard 
one-way test, medial-thalamus-damaged rats 
that fail to avoid, though usually totally 
motionless, sometimes make intention move- 
ments which appear to be the first Stages of 
an avoidance response. For example, they 


escape threshold for stimulation of the 
E P is raised p, destruction of the para- 

fa m : 

Gi quar Ucleus, 4 finding Which may be related 


the medial location of tri eminal afferents 
(Mitchel & Kaelber, 1967). , l 


3 The 
tooth py 


running movements themselves do not ap- 
Similarly, if the task used requires 
a box (11 inches deep) to 
many of the rats adopt a ver- 
for jumping, al- 


Sufficient time is allowed in such cases, an 
avoidance response may eventually be made. 
Thus, avoidance performance can be im- 
proved considerably if the time allowed for 
a response is increased from 5 to 15 seconds 
( Vanderwolf, 1962). 

Tt is perhaps worth emphasizing that rats 
with large medial thalamic lesions are not 
deficient in primary motor abilities. On 
those occasions When the rats actually make 
an avoidance response, the movements are 
normal. For example, in jumping out of a 


box, they leap accurately to the edge of the 
apparatus and climb out. 


In conclusion, a rat with a large media] 
thalamic lesion is grossly deficient in the 
ability to avoid a painful shock even though 
it is not Paralyzed in the usual sense and 
Possesses adequate sensory and 


abilities and motivational] capacity. The de. 
fect appears to lie in the initiation of move- 
ment, in dysfunction of a mechanism linking 
Perceptual, mnemonic, and motivational Sys- 
tems to the voluntary motor System. There 
net phases in higher level 
motor contro], First, a programming me- 
chanism must select from a large number of 
Possible Movements those which are appro- 
priate in a given situation. The central rep- 
resentations of these Movements must be 
Maintained in a subthreshold State of excita- 
tion for some time in such a way that they 

be activated in a Particular sequential 
order (see Lashley, 1951; Milner, 1970, pp. 
92-98 hen, at an appropriate moment, a 
"hatever motor 
Programmed, Since 
aged rats usually per- 


learning 


Ove at all, the lesion appears 
1 € Severely with the triggering 
than With selec movement, Trigger. 
be accomplished by an externa] 
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stimulus, since the rat performs appropri- 
ately when shock is applied to the feet 
(escape behavior). 

The existence of a movement-triggering 
mechanism involving the diencephalon is 
supported by experiments on intracerebral 
stimulation in conscious animals. There are 
points in the medial thalamus (Endróczi, 
Yang, Lissák, & Medgyesi, 1959) and pos- 
terior hypothalamus where electrical stimula- 
tion invariably produces motor activity of 
some sort, but the actual pattern may vary— 
running, jumping, climbing, digging, etc.— 
(from the same placement with the same 
stimulation parameters) depending on the 
environmental circumstances. In contrast, 
points in the midbrain tegmentum, for ex- 
ample, determine not only the occurrence but 
also the pattern of movement. Environ- 
mental factors have little effect. Thus, dur- 
ing forced circling elicited from the midbrain, 
a rat may be driven into obstacles or off the 
edge of a table (Vanderwolf, Hopkins, & 
Bland, unpublished observations). The be- 
havior elicited in the first case corresponds 
to what one might expect from the activation 
of a movement trigger. In the latter case, 
a specific motor sequence appears to be acti- 
vated more directly. 

An important point in the interpretation 
of the effect of medial thalamic lesions is that 
not all motor patterns are equally affected. 
In an aversive situation, respiratory and 
autonomic responses can be conditioned 
readily in the experimental animals, but 
running and jumping cannot. Such phe- 
nomena call to mind the suggestion of Hugh- 
lings Jackson (see Taylor, 1958) that dif- 
ferent motor activities can be arranged con- 
ceptually in a rough scale extending from 
highly automatic movements such as respira- 
tion, cardiac and vasomotor activity, and 
simple somatomotor reflexes, to highly vol- 
untary behaviors among which were included 
speech, manipulation, and directed locomo- 
tion. Following Hughlings Jackson, one 
may Say that medial thalamic lesions inter- 
fere more with voluntary behavior in a fear- 


producing situation than with the more auto- 


matic behaviors. 
There is à difficulty. 
interfere with the operation 


If medial thalamic 


lesions of a 
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mechanism which triggers voluntary move- 
ments, one would expect an effect on a va- 
riety of different behaviors. In fact, many 
apparently voluntary movements appear un- 
affected. Rats with large medial thalamic 
lesions perform well in simple appetitive 
tests and can easily press levers, run down 
alleys, etc., to obtain food or water (Re- 
plogle, 1960; Vanderwolf, 1962, 1967b). 
This behavior contrasts sharply with the 
poverty of spontaneous movement exhibited 
by the same animals in an aversive situation. 
However, several observations suggest that 
initiation of movement may be impaired by 
medial thalamic lesions even in nonaversive 
situations. When medial-thalamus-damaged 
rats, which had been resting quietly in their 
cages, were suddenly placed on a familiar 
tabletop, they did not begin to walk about 
as quickly as normal animals. They showed 
no sign of fear and became quite active 
within a minute or So. This effect was 
demonstrated quantitatively in two situa- 
tions. In one test. lights and a buzzer were 
presented to animals in their home cages and 
the latency of an investigative response 
(rearing) was recorded. In a second test, 
the time taken to leave a small platform was 
recorded. Medial-thalamus-damaged rats 
were slow to initiate movement in both tests 
(Vanderwolf, 1969b). 

A clue to the nature of the deficit pro- 
duced by large medial thalamic lesions is 
provided by the immediate postoperative be- 
havior of the experimental animals. The 
initial hyperactivity can be attributed to a 
discharging lesion (resulting from injury 
discharges, irritation by metallic deposits, 
etc.) in a movement-controlling system. 
The fact that behavioral seizures sometimes 
accompany the hyperactivity supports such 
a hypothesis. The ensuing akinesia may be 
attributable to a depression of the same sys- 
tem (in part only. since the rats are un- 
doubtedly very fatigued) from which there 
is a gradual recovery during the next few 
weeks. Ultimately, only traces of the ear- 
lier akinesia can be observed. and the rats 
are actually hyperactive in some situations. 
Thus, medial thalamic lesions appear to pro- 
duce a mild generalized impairment in the 
initiation of movement, but in a chronic 


xix 
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preparation, the defect is obvious only in 
aversive situations. . 
Medial-thalamus-damaged rats which are 
actually hyperactive in some nonaversive 
situations may exhibit a total absence of 
spontaneous movement during avoidance 
training. Observation of normal rats helped 
to clarify this puzzling situation, When a 
rat is frightened, it may freeze, that is, re- 
main alert but completely immobile for long 
periods. When head or locomotor move. 
ments occur, they often start Or stop in an 
abrupt fashion, in Contrast to the gradual 
buildup of such movements in an 
turbed rat. 


alert immobility, the 
other resulting in movement, perhaps by 
activation or deactivation of the 


trigger Sys- 
tem discussed above, One might Suppose, 
if the two States act reciprocally, that when 
activity in the trigger System is d : 
by medial thalamic damage, the immobility- 
producing System would dominate behavior 
in aversive situations, The trigger system 
might still work fairly well in appetitive 
situations in which the immobility-producing 
system is not activated, 
A simpler explanation than this might 
seem to be that the effect of medial thalamic 
damage is merely to increase the normal 
tendency to freeze (as a positive response) 
in emotional situations. Such an explanation 
is logically unsatisfactory, however, because 
the primary effect of a brain lesion cannot 
be to add something to the behavioral reper- 
toire. Tt must always depress some func- 
tion, even though other functions may be 
"released" and thus appear to be Strength. 
€ned—a point made by Hughlings Jackson 
Many years ago. [n fact, in absolute terms, 
the tendency to freeze is probably reduced 
by a medial thalamic lesion. Rats with 
medial thalamic damage freeze less than nor- 
mal during exploration of a novel environ- 
ment ( Vanderwolf, 19695), and they also 
less than normal under these cir- 
cumstances ( Vanderwolf, 1962). This, as 
fell ag the observation that a medial thala- 
me lesio Produces a rat Which is hyper 
Ne situations and totally immobi- 
hers, Suggests that more than one 


functional System is disrupted, 


One may 
speculate that medial thalamic damage raises 


the threshold of the central fear mechanism 
slightly in addition to its effect on the initia- 
tion of movement, 

There is considerable evidence that the 
thalamic unspecific system contains at least 
two different functional Subsystems, since it 
can exert either activating or deactivating 
influences on behavior and cortical electrical 
activity (Monnier, Hösli, & Krupp, 1963). 
For example, different workers have re- 
ported that avoidance Tesponses can be either 
Suppressed (Bloch & Hebb, 1956) or fa. 
ciliated (Cardo, 1967) by stimulation of the 
medial thalamus, According to 3user 
(1966), the posterior intralaminar nuclei 
form part of an activating pathway ascending 
to the sensorimotor cortex, while the more 
rostral midline nuclei exert a deactivating 
influence. Both these sets of nuclei, as well 
as the thalamic dorsomedial] nucleus, whose 
destruction is said to result in behavioral 
hyperactivity (Warren & Akert, 1960), 
Were extensively damaged in the experi- 
ments discussed here, "Therefore, complex 
effects on behavior are to be expected, 

It has been shown that a Variety of treat. 
ments including destruction of the subcallosal 
cortex or the septal nuclei (McCleary, 1961, 
1966) and electroconvulsive shock 
derwolf, 19635) interfere with the 
rats to remain motionless in 
as tests of passive avoidance and conditioned 
suppression. This result has been taken to 
mean that the function of 
hibitory mechanisms Was impaired. Pre- 
sumably, the normal balance between the 
tendency to move and the tendency to re- 
main motionless in an aversive situation was 
tipped in favor of movement, Perhaps the 
Somatomotor inhibitory Mechanisms of the 
Orebrain exert an effect on behavior by 
modulating. activity in the movement trigger 
System, Jf SO, it might be Possible to im- 
lance Performance in rats with 
alamic lesions by reducing activity 
s ovement-inhibiting system, thus 
to restore the normal balance he- 
!ween movement and immobility. This was 
attempteg in two experiments, Medial- 
‘halamus-damaged Tats were tested ofi a one- 


ability of 
such situations 


somatomotor in- 
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way avoidance task, given electroconvulsive 
shock, septal lesions, or a control treatment, 
then retested on the one-way task. Both 
electroconvulsive shock and septal damage 
produced an improvement in performance in 
about half the rats tested, while control rats 
remained unchanged (Vanderwolf, 1964, 
1968). These findings support the hypo- 
thesis that septal damage and electroconvul- 
sive shock disrupt mechanisms which nor- 
mally control activity in a movement trigger 
system. 

The septal nuclei, like the medial thalamic 
nuclei, must form part of many functional 
systems. It is widely held that a major 
role of the septal nuclei is to inhibit somato- 
motor activity. However, Peacock and 
Hodes (1951) demonstrated a heavy con- 
centration of points in the septal-preoptic 
area which facilitated movements produced 
by cortical stimulation. Other points in the 
same general region had an inhibitory effect 
(Hodes, Peacock, & Heath, 1951). It has 
also been shown that stimulation of the septal 
nuclei usually results in an increased dis- 
charge rate of cells in the centromedian nu- 
cleus of the thalamus ( Sommer-Smith, 
Powarzynski, Stirner, & Griinberg, 1965). 
Thus, both somatomotor inhibitory and fa- 
cilitatory systems appear to be present in 
the septal nuclei, and a large lesion might be 
expected to produce a complex behavioral 
change. This may account for the finding 
that although septal lesions usually have a 
*disinhibitory" effect on behavior in aversive 
situations (impairment of passive avoidance, 
facilitation. of avoidance in a shuttle box, 
etc), they produce an impairment in “one- 
way” tasks in which the rat must run or 
jump out of a box (Kenyon, 1962; Rich & 
Thompson, 1965), and they reduce spontane- 
ous running in activity wheels (Dirlam, 
1969; Douglas & Raphelson. 1966). 

The hypotheses outlined here may be of 
some use in understanding a number of the 
phenomena associated with shock avoidance 
behavior in intact animals. For example, 
one might expect that engaging the animals 
in some sort of motor activity immediately 
before a test session might activate the trig- 
enough to improve avoidance 


ger system c 
Tn fact, pretest exploration of 


performance. 


a novel environment does improve avoidance 
performance in both normal (Posluns & 
Vanderwolf, 1970) and medial-thalamus- 
damaged rats (Vanderwolf, 1966). Han- 
dling normal rats during the intertrial inter- 
yal (producing struggling movements) has 
a similar effect (Wahlsten, Cole, Sharp, & 
Fantino 1968). The effect is probably not 
restricted to learned tasks. Activities such as 
exploration or swimming reduce response la- 
tency in a simple locomotor task (stepping 
from a small platform to a table) in which 
learning plays a minimal role (Vanderwolf, 
1969b). However, the effect may be re- 
stricted to more voluntary motor patterns, 
since a conditioned vocal response was not 
facilitated by pretest exploratory behavior 
(Vanderwolf, 1966). 

Normal rats trained in a two-way (shuttle 
box) avoidance test with the usual delay- 
conditioning procedure behave in many ways 
like medial-thalamus-damaged rats in a one- 
way test. Acquisition is improved by septal 
damage (King, 1958) or electroconvulsive 
shock ( Vanderwolf, 1963b) or by increasing 
the time during which an avoidance response 
may occur (CS-UCS interval) (Black, 
1963: Low & Low, 1962; Schwartz, 1958). 
The latter effect is attributable to better per- 
formance rather than more efficient learning 
( Anderson, 1969). One could speculate that 
the normal rats' poor performance in the 
two-way apparatus is due to an imbalance 
between immobility and movement trigger- 
ing under these particular conditions. It is 
interesting that dogs, which show little tend- 
ency to freeze in fearful situations, perform 
much better than rats in à shuttle box, and 
do not improve their scor 
the CS-UCS interval is increased (Brush, 
Brush, & Solomon, 1955; Church, Brush, 
& Solomon, 1956). Both results suggest 
ready operation of the movement trigger. 
One might expect comparable results in 
other mammals which have little tendency 
to freeze, such as mustelids, for example. 
In contrast. one might expect that cats, 
which seem to have an efficient mechanism 
for remaining motionless when excited (e.g 
when watching a mouse hole), would Te 
poor performers in a shuttle box and would 
perform better as the CS-UCS interval was 


es greatly when 


increased. In fact, cats do not perform well 
in a shuttle box, but it has not been shown 
that their performance can be improved by 
an increased CS-UCS interval, although the 
data reported by Schrier (1961) show a 
trend in that direction. 

One of the main conclusions of the work 
on medial thalamic damage is that higher 
level motor control can be disrupted by a 
lesion which leaves perceptual-mnemonic 
processes relatively undisturbed. This point 
is demonstrated very clearly in certain 
neurological conditions in man. Lhermitte, 
Gauthier, Marteau, and Chain (1963) de- 
scribed a case in which an 
had destroyed the core of 
from the pontomesencephalic junction to the 
mammillary bodies and massa intermedia, 
The patient was almost totally mute and 
akinetic. She could, however, flex one arm 
and by this means could indicate the cor- 
rect answers to rather complicated questions, 
indicating that Perceptual and cognitive 

elatively intact, A tempo- 
rary but otherwise analogous condition 
occurs in sleep Paralysis and cataplexy 
(Cave, 1931; Daniels, 1934: Wilson, 1928) 
and some types of petit mal attacks (Tizard 
& Margerison, 1963), uring attacks, pa- 
tients may be totally unable to move but are 
aware of ongoing environmental events as 
shown by a correct verbal report following 
recovery. Attacks of cataplexy are often 
precipitated by strong emotion (especially 
laughter or fright), a fact which suggests 4 
parallel to the effect of medial thalamic dam- 
age in the rat. 

A different type of akinesia with rel 
good preservation of perceptual-mn 
abilities is observed in 
disease. Tm this case. inability to initiate 
locomotion, for example. appears due toa 
disturbance of postural mechanisms. pa. 
tients may be able to stand, or even raise 
their feet one at a time, but fail to walk 
because they cannot lean forward or rock 
from side to side, as is required for norma] 


ischemic lesion 


atively 
emonic 
severe Parkinson's 


9comotion. With assistance (being held 
bos Slightly and rocked from side to 
side 


oe they walk quite well (Martin, 1967). 
le defi i 


With gs M such cases appears to lie more 
ex i : : : 
Xecution than with the triggering 
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of movement, and appears to have little in 


common with the defect produced by medial 
thalamic damage. 


The status of perceptual-mnemonic abil- 
ities in animals made totally akinetic by cen- 
tral lesions has never been adequately 
studied. Ingram, Barris, Fisher, and Ran- 
son (1936) were of the opinion that cats 
with lesions of the Posterior hypothalamic 
area were "conscious" although they did not 
move about. Such a hypothesis is consistent 
with the observation that cats made akinetic 
by posterior hypothalamic lesions may ex- 
hibit good activation of the cortical electro- 
encephalogram (EEG) but no concomitant 
behavioral arousal (Feldman & Waller, 
1962; Lindsley, Schreiner, Knowles, & Ma- 


goun, 1950). 


RELATION oF DiENCEPHALIC AND 
Hippocampar®EEG TO SOMATO- 
MOTOR Activity 
Two general co 
by the experiment 
of medial thalamic 
are considerable 


nclusions were suggested 
s on the behavioral effects 
damage. First, that there 
differences in the neural 
organization of voluntary and automatic mo- 
tor activities ; second, that structures in the 
medial diencephalon form pa 
mechanism involved in the ir 
untary movement. 

EEG recording has provided 
to ablation as a method of 
level control of movement, 


Tt of a trigger 
üitiation of vo]. 


à supplement 
studving higher 
Clear relationg 
ctivity and the 
in the dorsal 
alamus (Van. 
twolf & Heron, 
as done on the 


dorsal hippocampus, and this will be dis- 


cussed here, 
Tt is useful to 


classify 
from the dors; 


the EEG derived 
al hippocamy 


f 0 y al pyramidal cells 
m the Waking rat into three main types. as 


was done previously by Stumpf (1965b) in 
the rabbit, Rhythmicaj slow activity (RSA) 
Consists of trains of nearly Sinusoidal waves 


of a frequency Varying fr, 
Herts. (ian 28 Irom about 6 to 12 


: Amplitude Varies widely de- 
pending on the exact location and orienta- 
Mon - the electrode Üps with respect to 
Y€ dipole of the Pyramidal cel] ( Green, 
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Maxwell, Schindler, & Stumpf, 1960), but 
under optimal conditions (when the tips 
of the electrode pair lie on either side of the 
cell layer), potentials of 2 millivolts or more 
can be obtained. At a given electrode site, 
both amplitude and frequency of RSA vary 
in relation to the behavior of the animal. 

Large-amplitude irregular activity (LIA) 
is a second waveform which differs from 
RSA in that it has a component of very 
slow waves and lacks the rhythmical char- 
acter of RSA. Large-amplitude spikes with 
a duration of 50-100 milliseconds may occur 
at irregular intervals. 

A third waveform is small-amplitude 
irregular activity (SIA) which appears as 
a sudden suppression of hippocampal activity 
and ordinarily lasts only 1-2 seconds, in con- 
trast to the two preceding types which may 
continue steadily for many minutes. 

All three of the foregoing patterns will 
occur at a single placement in the hippocam- 
pus. The particular pattern exhibited de- 
pends on the behavior of the animal. Elec- 
trical activity in the dentate gyrus CA-4 
area does not exhibit a comparable variation 
and, to visual inspection, consists of a con- 
tinuous pattern of fast activity (mostly 15- 
50 Hz.) with an amplitude up to 1 millivolt. 
However, the dentate gyrus CA-4 area does 
exhibit an arousal reaction similar to the 
neocortex. Fast activity also appears in 
hippocampal (pyramidal cell layer) record- 
ings, but often has a lower amplitude. At 
the same amplification, a hippocampal record 
contains more slow activity (less than 15 
Hz.) than a dentate gyrus record, whether 
RSA or LIA is present. 

Tf electrodes are placed at random in the 
hippocampal formation, mixtures of RSA 
(usually of rather small amplitude) plus a 
good deal of fast activity are very frequently 
encountered. Such patterns presumably 
represent an intermingling of activity from 
the hippocampus, dentate gyrus, and subicu- 
Jum. 

During drowsiness and slow-wave sleep, 
various other patterns of irregular large- 
amplitude slow waves. spikes. and spindles 
occur throughout the hippocampal formation. 
These waveforms will not be discussed in 


detail here. 


The patterns of EEG activity in the dorsal 
hippocampus are related to concurrent motor 
activity in a consistent way. This correla- 
tion has been studied most extensively in the 
rat (Vanderwolf, 1969c; Whishaw & Van- 
derwolf, 1970) and guinea pig (Sainsbury, 
1970), but has also been examined, in this 
laboratory, in the rabbit, Mongolian gerbil, 
and cat. Observations have been reported in 
the dog by Black, Young. and Batenchuk 
(1970), Dalton (1968), Lopes da Silva and 
Kamp (1969), Storm van Leeuwen, Kamp, 
Kok, and Tielen (1967), and Yoshii, Shimo- 
kochi, Miyamoto, and Ito (1966). Dif- 
ferences between the various species appear 
to lie mainly in the frequencies of EEG activ- 
ity and in the ease with which good RSA 
can be obtained. 

RSA accompanies voluntary types of 
movement, including walking, or running 
straight ahead, or backing up, turning, rear- 
ing, jumping, climbing, struggling when 
held, swimming (studied in rat only). head 
movements, postural changes (such as shift- 
ing weight from one foot to another, getting 
up, lying down, etc.), manipulation (such 
as handling a food pellet or pressing a lever 
in a Skinner box) and digging in sawdust. 
Within this group of motor activities, there 
is a relation between the extent of the move- 
ment (total motor units activated per unit 
time?) and the amplitude of the accompany- 
ing RSA. Rearing and jumping are asso- 
ciated with large-amplitude RSA, walking 
with somewhat smaller amplitude, handling 
an object with still smaller amplitude, and 
so on. Tiny movements of the head or one 
limb are associated with RSA of minimal 
amplitude, sometimes only 20%-25 7 of the 
maximum occurring at à particular place- 
ment. The amplitude of the RSA accom- 
panying a given motor act is not rigidly 
fixed, but varies (a5 does the motor act it- 
self) from one occasion to another. For 
example. a sudden fast run, as when a rat 
has been startled, is usually accompanied by 
a larger amplitude RSA than ordinary walk- 
ing. . " 

Variations 1n amplitude of RSA are prob- 
ably responsible for the following phenom- 
enon. At optimal electrode placements, acts 
such as walking or isolated WioWeniants of 
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the head or limbs are always accompanied 
by RSA, but at mixed activity sites (RSA 
plus a great deal of fast activity) it may be 
found, for example, that while RSA is pres- 
ent on 100% of the Occasions that a rat 
Tears, it may appear on only 7595 of the 


when it manipulates a piece of food or moves 
the head sideways slightly while standing 
In such cases, Presumably, only large- 
amplitude RSA can be “seen” by the record. 


The frequency or hippocampal RSA 
usually varies over 4 range of about 6-12 Hz. 
in the rat. Somewhat Surprisingly, 
is very little relation between the amplitude 
and period of individual way 
Samples are taken during a 
behaviors, Suggesting that Wave period (fre. 
quency) and amplitude are controlled by dif- 


y increases, on the 
other hand, are associated w 


The latter point can be clearly 
demonstrated in a Shock-avoidance 
Which requires Tats to jump out of a box, 
This task has Several advantages : 
moment of initiation of muscular activity can 
be defined accurately, (b) à large movement 
is required, ensuring large-amplitude RSA, 
and (c) the use of shock reinforcement sup- 
presses irrelevant Spontaneous motor activ- 
ity, permitting the observation of a sudden 
transition from immobility to vigorous move- 
Using well-trained rats Which ex. 
hibit a minimum of irrelevant motor activity 
immediately Preceding the jump response, it 
can be shown that RSA with a frequency of 
Z. precedes the jump by Several sec. 

etween 1 and 2 seconds before the 
jump, the frequency Tises, reaching a peak 
value, usually 8-10 Hz., just before or at 
at overt motor activity begins, 


onds, 


tiat €quency shifts accompany the ini- 
ati . 5 - 

od running, whether it occurs as an 
in a a TeSPonse, or Spontaneously, or 


=i “driven acti: j wha 
activit J activity wheel, 


If motor 
Nues steadily 


Once it has Deen 


initiated, RSA frequency declines, although 
the mean frequency remains somewhat 
higher than during the period of preparation 
for movement, This has been demonstrated 
ina motor-driven activity whee] where a rat 
can be made to Tun as long as S hours 
without stopping, frequency 
shifts associated with the onset of running, 
RSA persists unchanged throughout this 
period (Whishaw & Vanderwolf, 1970). 
Thus, initiation of movement is accompanied 
bya high-frequency burst of RSA, but RSA 
continues at a lower frequency as long as 
movement continues. 

Finally, it has been found that small move- 
ments, such as handling food or moving the 
head slightly without other movement, are 
accompanied by RSA of a lower mean fre- 
Qquency than is seen during major move- 
ments, such as walking or rearing. The rela- 


tion of this to the other findings has not yet 
been determined, 


example, the RSA walking 
Seems much the same during exploration, 
food getting, shock avoidance, or forced run- 
ning in a motor-driven activity wheel, 
3ehavioral immobility is usually accom- 


- A rat Standing motionless 
on four legs, on two legs, or hanging motion- 


42Y. the forepaws from a Support will 
exhibit this EEG Pattern. Thug mainte- 


n of muscular tonus 


commonly regarded as relatively 
i licking, chewing, 
aS à component of de- 
Tat and guinea pig). 
anything in the mouth, 
(observed in one rat 
OY food stuck in its 
» Piloerection (observed in rat and 
defecation, pelvic 
ejaculation during | coitus 
ea pig only), face washing, 


thrusting and 
(studied in euin 


^. d 
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licking and biting the fur on any part of 
the body, scratching with one hind foot, 
vocalization (squealing in rat, barking in dog, 
etc.), shivering (observed in rat and guinea 
pig), and stamping on the ground with the 
hind feet (observed in the gerbil by Kramis 
& Routtenberg, 1969). As in the case of 
RSA, the reason a given act is performed 
has no obvious influence on the pattern of 
LIA exhibited in the hippocampus. A 
similar waveform appears during licking, re- 
gardless of whether the thing licked is wa- 
ter, the animals’ own fur, or the vulva of a 
female in estrus. 

Tt often happens that a motor pattern 
which (when performed alone) is accom- 
panied by RSA is performed simultaneously 
with a motor pattern which (when per- 
formed alone) is accompanied by LIA. In 
such cases, RSA always occurs. Examples 
are numerous. Movements of the vibrissae, 
as isolated acts, are accompanied by LIA, 
but if they occur together with head move- 
ments or walking (the usual situation) the 
combination is accompanied by RSA! A 
male guinea pig while copulating exhibits 
LIA during pelvic thrusting, but if obliged 
to step forward with the hind limbs or use 
the forelimbs to restrain the movements of 
the female, RSA appears. When a rat 
scratches its flank, it stands motionless ex- 
cept for the movements of the active leg. 
Continuous LIA appears in the hippocam- 
pus. However, when scratching the head, 
the rat assumes a rather unstable posture in 
which the neck and trunk are curved la- 
terally. Small postural adjustments occur 
continually, and the hippocampal record con- 
tains considerable small-amplitude RSA. 

Yawning appears to be a rather automatic 
act in man, and therefore one might expect 
it to be accompanied by hippocampal LIA 
in the rat (see also p. 98). However, yawn- 
ing is associated with stepping movements or 
walking in rats and the total behavioral pat- 


4 According to Komisaruk (1970), vibrissae 
movements are correlated with RSA and tend 
to occur in phase with it. However, there are in- 
stances when RSA is present while the vibrissae 
motionless, as in swimming, and other in- 
vibrissae move and RSA is 


are 
stances when the 


absent. 
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tern is associated with RSA. Whether 
yawning, as an isolated act, is associated with 
RSA is difficult to determine. The righting 
reflex is another example of this type. Al- 
though the act of righting undoubtedly has 
a reflex basis, if an animal is placed on its 
back, RSA is always present as it struggles 
and as it rolls over. One may speculate that 
in an intact animal, voluntary movements 
collaborate with a brainstem reflex in the 
restoration of normal posture. 

Another factor which can pose difficulties 
in interpretation is that RSA amplitude is 
small when very slight voluntary movements 
are performed. Therefore, it might be argued 
that licking, for example. is really. accom- 
panied by RSA, but of such small amplitude 
that it cannot be identified easily. In the 
case of blinking, for example, this argument 
may be valid, but for the most part it has 
been clear that tiny head or paw movements 
(eg, manipulation) only 1-2 millimeters 
in extent are associated with clear RSA, 
while much larger movements such as biting, 
face washing, shivering, and pelvic thrust- 
ing are accompanied by LIA. Thus, there 
appear to be two distinct classes of behavior 
of which one is accompanied by hippocampal 
RSA while the other is not. 

SIA occurred infrequently under the con- 
ditions studied so far and its behavioral cor- 
relates are less well known. Tt appears to 
occur on occasions when an ongoing move- 
ment is abruptly stopped. It has been ob- 
served under the following circumstances: 
when drowsy rats were startled so that they 
leapt to their feet but did not run; in rats 
in an avoidance test, jumping to the edge of 
a box and almost going over the other side 
(requiring a sudden halt of forward move- 
ment); in rats in a Skinner box, thrusting 
their heads into the food cup and stopping 
abruptly to seize à food pellet which had 
just been delivered ; in rats in a motor-driven 
activity wheel where behavior consisted 
alternately of jumping (with RSA) and 
clinging to the wheel (with STA). A simple 
demonstration can be made in the gerbil. Tf 
a gerbil is running, a whistle will sometimes 
make it stop suddenly. The stop is accom- 
panied by a transition from RSA to STA 
then back to RSA as the gerbil begins to 
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ze again. If the gerbil remains motion- 
awe adt time, SIA is replaced by LIA. 
It appears that SIA is associated only with 
an abrupt halt of ongoing movement and 
that when an animal stops moving ordinarily, 
SIA does not appear. 


BRAINSTEM CONTROL oF HIPPOCAMPAL 
ACTIVITY AND OF MOVEMENT 


The findings described in the preceding 
section indicate that three types of hippocam- 
pal rhythm can be identified by simple in- 
spection in an EEG record, One type is 
LIA which is present during alert immobil- 
ity and during the performance of automatic 
movements. A second type is RSA which 


s the performance 
of voluntary types of movement. Large- 


quency. The third type of hippocampal ac- 
tivity, SIA, i 
halt of ongoin 


strated that electrical 
brainstem reticular formation, medial thala- 
mus, or hypothalamus produced RSA in 
the hippocampus. Septal lesions abolished 
the effect, indicating an ascending pathway 
through the septal nuclei. Later physio- 
logical studies (Corrazza & Parmeggiani, 
1963; Petsche, Stumpf, & Gogolák, 1962; 
Torii, 1961; Yokota & Fujimori, 1964) have 
indicated that the ascending pathway runs 
through the medial hypothalamus and medial 
Preoptic area to the medial Septal nucleus 
and, eventually, to the hippocampus. An 
additional pathway probably runs forward 
through the medial thalamus (Eidelberg, 
White, & Brazier, 1959; Kawamura, Naka- 
mura, & Tokizane, 1961 ). Anatomical evi. 
dence confirms the existence of such ascend- 
TE pathways, although details remain ra- 
ther Obscure (see reviews by Adey, 1959; 


Green, 1964 ; Lewis & Shute, 1967, Stumpf, 
19653). 


It has been found that stimulation at some 
Subcortical sites produces a Suppression or 
flattening of hippocampal activity, a reaction 
Which resembles the SIA pattern observed 
during normal behavior, although the elec- 
trically induced pattern can be made to con- 
tinue for a longer period. This type of effect 
has been observed following high-frequency 
stimulation of the septum (Brücke, Petsche, 
Pillat, & Deisenhammer, 1959), lateral hypo- 
thalamus, and lateral preoptic area (Torii, 
1961), possibly suggesting the existence of 
a second, antagonistic, ascending functional 
system running parallel to the RSA path- 
way from the brainstem to the hippocampus. 
This system might be responsible for nor- 
mal SIA. Activity in both systems can be 
controlled from midbrain as well as dien- 
cephalic levels. Stimulation of the dorso- 
lateral tegmentum yields RSA in the hippo- 
campus; stimulation of the ventromedial 
tegmentum leads to a flattening of the hippo- 
campal record (Torii, 1961). These effects 
probably result from an ascending activation 
of the diencephalic mechanism, since large 
hypothalamic lesions abolish RSA completely 
when the midbrain is intact; elimination of 
the midbrain does not abolish RSA (Kawa- 
mura et al, 1961; Kawamura & Domino, 
1968). 

In addition to RSA or hippocampal flat- 
tening, increased fast activity (15-50 Hz.) 
May appear in the hippocampus during 
brainstem stimulation, According to Stumpf 
(19652), fast activity occurs independently 
of other hippocampal patterns of activity and 


)y brainstem stimula- 


ruction, suggesting a 
totally Separate functiona] system. 


ctivation of the RSA ‘or hippocampal 
flattening (SIA ?) Systems has opposite 
effects on somatomotor activity. — Stimula- 
tion of points in the amygdala, preoptic 
area, and ventromedial tegmentum which 
Suppresses hi slow activity with 
increased fast activity, depresses 
olysynaptic spinal re- 
© medial hypothalamus, 
a, and dorsolateral igg- 
ippocampal RSA when 


» tcilitate spinal reflexes 
Fujimori, 1964). Analogous 


stimulated and f 
Yokota & 


———————— 


x. 
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effects can be demonstrated in a conscious 
animal. Ito (1966) found that septal stim- 
ulation produced both behavioral arrest and 
a flattening of hippocampal activity. Bland 
and Vanderwolf (1970a) have shown that 
unilateral stimulation of the dentate gyrus 
CA-4 area suppresses hippocampal slow ac- 
tivity bilaterally and produces an immediate 
arrest of ongoing locomotion, bar pressing, 
etc., but does not affect respiration, heart 
rate, shivering, or lapping water. No seizure 
discharges were detected, and the rats moved 
about normally when stimulation was stop- 
ped. Stimulation of the hippocampus at the 
same voltage does not have a comparable 
effect, although arrest can be obtained at 
higher voltages, suggesting à spread of cur- 
rent to the dentrate gyrus. Therefore, the 
hippocampus and dentate gyrus may have 
different. functional roles in the control of 
voluntary movement. 

Simulation of the paramedian hypothala- 
mus produces both hippocampal RSA and 
motor activity of a nonforced type as dis- 
That is, the movements pro- 

running forward, turning, 
jumping, etc. depending on the environ- 
mental circumstances. Hypothalamic points 
that fail to produce hippocampal RSA upon 
stimulation also fail to produce significant 
voluntary motor activity (Bland & Vander- 
wolf, 1970b). 

Brainstem mechanisms control both the 
frequency and amplitude of hippocampal 
RSA, although it has not been demonstrated 
that different control systems are involved, 
as the behavioral work suggests. However, 
Whishaw (1970) has recently found that 
the two parameters can be manipulated in- 
dependently. Cooling a rat reduces RSA 
frequency without affecting amplitude. Dur- 
ing the period of postictal depression fol- 
lowing hippocampal seizure activity, RSA 
frequency is normal but amplitude is much 
reduced. 

The frequency and amplitude of RSA 
elicited by brainstem stimulation vary, de- 
pending on the site stimulated. Kawamura 
and Domino (1968) have shown that RSA 
produced by medial thalamic stimulation has 
a lower frequency and smaller amplitude 
than RSA elicited by hypothalamic stimula- 


cussed above. 
duced may be 


tion. The frequency of RSA is also related 
to the intensity of brainstem stimulation. 
Stronger stimuli yield higher frequency RSA 
at the onset of stimulation than weaker stim- 
uli (Sailer & Stumpf, 1957; Yoshii et al., 
1966). 

If the posterior hypothalamus is stimulated 
in a freely moving animal, the onset of run- 
ning behavior is accompanied by à high-fre- 
quency burst of RSA very similar to that 
accompanying the normal initiation of move- 
ment (Bland & Vanderwolf, 1970b). Thus, 
both the initiation of movement and the in- 
crease in RSA frequency may be due to ac- 
tivity in an ascending brainstem system. 
This conclusion is consistent with the hypo- 
thesis of a diencephalic movement trigger 
mechanism discussed above. Further, con- 
tinued performance of a movement pattern 
is associated with lower frequency RSA 
than initiation of the same movement, This 
suggests that triggering of a movement is 
accomplished by a burst of activity in the 
diencephalic system which then continues to 
discharge at à reduced rate as movement 
continues. The experiments on medial 
thalamic lesions suggested a similar con- 
clusion. During avoidance testing, it was 
often observed that when a rat had once be- 
gun to run, the movements could be com- 
pleted without difficulty. 

Medial thalamic lesions appear to inter- 
fere with the initiation of voluntary types of 
movement, but the effect is relatively slight. 
Posterior hypothalamic-subthalamic area 
lesions have a more severe effect, amounting 
to chronic akinesia (Magoun, 1950). The 
more ventral lesions also prevent the ap- 
pearance of RSA in the hippocampus. while 
medial thalamic lesions may result only in 
a slight slowing of the rhythm ( Kawamura 
et al., 1961). More rostrally, this relation 
disappears. Septal lesions, which prevent the 
appearance of RSA in the hippocampus, or 
removal of the entire hippocampal formation, 
do not result in akinesia. The defect pro- 
duced by such lesions consists, in part, in a 
failure to organize motor activity effectively 
rather than an jnability to initiate a motor 
act (see below pP. 105-106). In terms of 
the concepts presented here, one might say 
that the hippocampal formation takes bec 
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in the programming. of an intended act but 
that no actual behavior occurs until the pro- 
gram is activated by the diencephalic trigger. 
According to the views presented here, 
the presence of RSA in the hippocampus in- 
dicates an activated state. In opposition to 
this, it has often been suggested that the 
hippocampus has a predominantly inhibitory 
role with respect to behavior (Douglas, 
1967; Kimble, 1969; McCleary, 1966) and 
that the presence of RSA indicates that the 
hippocampus is inactive, that is, that a 
structure which inhibits the brainstem is 
itself inhibited (Grastyán, Lissák, Madarász, 
& Donhoffer, 1959), 7 is based 
partly on the finding that animals with large 
lesions of the hippocampal formation are 
“disinhibited” (i.e, hyperactive, performing 
poorly in tests Of passive avoidance, tend- 
ing to continue performing inappropriate 
movements, etc ) and partly on the assump- 
tion that regular "synchronous? activity 
necessarily indicates an inactive state. The 
latter assumption may be unjustified, The 
mammalian olfactory bulb, for example, 
generates Very regular waveforms when an 
air blast or Strong odor is presented, sug. 
gesting that Strong excitation may favor 
the development of synchronous activity in 
& neurone population (Adrian, 1942, 1950), 
Many hippocampal units fire in phase with 
the RSA waveform (Green, 1964), Ag 
RSA frequency increases during the initia- 
tion of movement, such units presumably 
fire more rapidly, indicating an activation of 
function of some sort. Finally, RSA can 
be recorded from the thalamus and hypo. 
thalamus as well as the hippocampus, Tt 
Seems improbable that all these Structures 
would be inhibited during active movement, 
The fact that “disinhibition” of behavior 
Occurs when the hippocampal formation js 
damaged does not prove that the hippocam. 
pus normally plays an inhibitory role, The 
Dehavioral change may be partly due to re- 
moval of the dentate gyrus, since both struc. 
tures are usually destroyed in a surgical 
lesion, Zlectrica] stimulation of the dentate 
Syrus CA.4 area suppresses voluntary move- 
Ment; Stimulation of the hippocampus has no 
obvious effect Unless seizure activity occurs, 
ould mean that it is the dentate 


This view 


t 
Syrus rather than the hippocampus which 
has an inhibitory role, The finding that the 
hippocampal Pyramids generate RSA while 
the dentate Syrus CA-4 area apparently gen- 
erates fast activity Supports the hypothesis 
of a functional differentiation of the two 
(also see Jackson, 1968), However, it 
should be borne in mind that the methods 
used in behavioral stud 


gross to permit demonstration 


of possible 
functional 


the dentate 
ber systems 


There may be consider 


able functional dif- 
ferentiation 


in the Septal nuclei as well as 
in the hippocampal formation, Donovick 
(1968) found that only those Septal lesions 
which abolished RSA in the hippocampus 
produced impairment of discrimination re- 
versal learning. Lesions anywhere in the 
Septal area facilitated shuttle-box perform- 
ance, regardless of whether RSA was abo- 
lished or not. This May suggest that the 
ascending RSA pathway plays a role in the 
Organization of 
discrimination reversal learning, but is not 


lieved to be responsible for poor shuttle-hox 
Further, lesions of 


hippo- 

with the pro- 

duction of motor activity, since they depress 

running and make rats “placid” 

(Clody & Carlton, 1969) as well as interfere 
with avoidance pe i 


which the rat must ; 


& Thompson, 1965) 
lateral se 
With inhj 
results i; 


- Perhaps it is the more 
ptal structures which are concerned 


bitory effects and whose destruction 
1 ferocity, impaired Passive avoid- 


RELATION OF HIPPOCAMPAL RSA 


TO AUTONOMIC Activity 
R There is evidence that hippocampal RSA 
1S associated with cardiovascular changes as 
well as with vo] pes of movement. 


À Oluntary ty, 
orii and Kawamura ( 1960) Observed that 


Spontaneous occurrences of RSA in a 
curarized cat were associated with sharp 
rises 1n blood Pressure ypothalamic 
stimulation at poi i 


: 
E 


E 


c: 
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campal RSA also produced rises in blood 
pressure. However, stimulation at points 
in the amygdala, septal nuclei, and rostral 
hypothalamus resulted in a flattening of the 
hippocampal record and a lowering of blood 
pressure. At a number of points in the 
brainstem, Yokota and Fujimori (1964) ob- 
served a relation between pressor effects, 
hippocampal RSA, and facilitation of spinal 
reflexes on the one hand, and depressor 
effects, flattening of the hippocampal record, 
and inhibition of spinal reflexes on the other 
hand. In an aversive-conditioning situation, 
Black and de Toledo (1971) often observed 
a relation between the appearance of RSA 
and increased heart rate in curarized dogs. 
An anatomical basis for correlations be- 
tween hippocampal RSA and cardiovascular 
activity is suggested by evidence (Ban, 
1966) that a sympathetic zone occupies a 
paramedian position throughout the hypo- 
thalamus and preoptic area, apparently 
overlapping extensively with the ascending 
RSA pathway. A parallel parasympathetic 
zone occupies a more lateral position in re- 
gions where electrical stimulation produces 
a flattening reaction in the hippocampus as 
well as depression of spinal reflexes. 
These findings are consistent with the view 
that there is a close association of somato- 
motor and cardiovascular control in the 
central nervous system (Rushmer & Smith, 
1959). Stimulation of the motor cortex pro- 
duces not only movements, but also a rise in 
systemic blood pressure and cardioaccelera- 
tion, as well as a shift in distribution of 
blood from the viscera to the somatic 
musculature. Renal blood flow, for ex- 
ample, is reduced. A similar constellation 
of effects can be obtained from points in the 
amygdala, hypothalamus. and midbrain 
(Bard, 1960; Fulton, 1949 ; Gellhorn, 1967 ; 
Hoff, Kell, & Carrol, 1963; Smith, Rushmer, 
& Lasher, 1960). The circulatory phe- 
nomena are due, at least in part, toa direct 
central nervous influence on the heart and 
vessels rather than to a secondary effect of 
muscular activity, since they can be demon- 
strated in curarized animals. Thus, there 
appears to be a synkinesis between somato- 
motor and vasomotor control analogous to 
the relation between convergence of the eyes, 


miosis, and contraction of the ciliary muscle. 
During movement, or preparation for move- 
ment, the central nervous system apparently 
activates the vasomotor system as well as 
the respiratory and general somatic muscula- 
ture, as was suggested by Krogh and Lind- 
hard (1913, 1917) many years ago. Once 
movements have been initiated, of course, 
many other factors such as postural changes, 
muscle pumping, input from sensory recep- 
tors in the heart, vessels, joints, and muscles, 
as well as metabolic changes. will also affect 
the circulation. 

An interesting aspect of the relation. be- 
tween hippocampal RSA and behavior is 
that the central rhythm may precede move- 
ment by several seconds. As movement be- 
comes imminent, RSA frequency rises. 
Similar anticipatory changes occur in the 
respiratory and circulatory systems, as was 
pointed out by Krogh and Lindhard and 
many subsequent investigators (Mountcastle, 
1968, pp. 542-543). The heart rate tends 
to rise in anticipation of spontaneous walking 
in dogs or in anticipation of sucking move- 
ments in human infants (Gottlieb & Simner, 
1966; Perez-Cruet & Gantt, 1959).5 

A relation between movement and sym- 
pathetic activity is also indicated by the 
dilatation of the pupil which occurs in asso- 
ciation with key pressing. speech, lifting 


weights, etc. (Hakerem & Sutton, 1966 ; 
Nunnally, Knott, Duchnowski, & Parker, 
1967; Simpson & Paivio, 1966, 1968). 


Slight mydriasis can precede actual move- 
ment aud can also occur when no visible 
movement is performed, as when a subject 
simply imagines something. Even in this 
case, however, minimal activity may be gen- 
erated in movement-producing systems. 
According to numerous investigations, im- 
agining a visual scene or imagining the per- 
formance of some motor act is accompanied 
by tiny eye movements, muscular activity in 
the limbs, etc. (Whatmore & Kohli, 1968). 


5 Simner (1966) reported that heart rate de- 
creased prior to vocalization in the chick. Since 
chicks tend to stand still while making distress 
calls, it seems possible that the fall in fessi ne 
was associated with immobility rather than the 
onset of vocalization. Unfortunately, mov he 
was not recorded. id vement 
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Thus, although there is evidence of in- 
creased sympathetic activity when a motion- 
less animal develops RSA and begins h 
move, in general there is probably no simple 
relation between RSA and sympathetic ac- 
tivity. The cardiovascular system has 
evolved primarily to meet the metabolic re- 
quirements of active tissue, and during vig- 
orous motor activity, the heart Tate, for 
example, is high regardless of whether RSA 
is present or not. Thus, heart rate is higher 
during face washing than during 
walking in the rat, even though face Wash- 
ing is not accompanied by RSA 
wolf & Vanderwart, 1970). 

It is widely accepted th 
cular system, in co 
nomically innervated Structures, 
fully affected by emotional sta 
without relation 
ever, it seems possible that many 
effects on the au 
related to the 


terns of movement, but that the 


How- 
« : » 
emotional 


Y, occurring well in 
advance of actual movement. For example, 


It seems obvious that penile erection is part 
of a general Preparation for copulatory be- 
havior, and that Salivation can be a prepara- 
tion for feeding. The cardiovascular activ- 
ity characteristic of fear resembles the pat- 
tern seen during severe exercise, suggesting 
a preparation for rapid or prolonged running 
behavior. Urination and defecation during 
fear can be thought of in much the same Way. 
Perhaps these activities have evolved in rela- 
tion to fear because they are a means of re- 
ducing the abdominal contents, permitting 
freer movement of the diaphragm during 
maximal motor activity. 


Arousat, ATTENTION, AND SLEEP 


There is evidence that in the neocortex, 
electrica] 


Somewhat the same way 


campus, Clemente, Sterman, and Wyrwicka 
(1964 and Roth, Sterman, and Clemente 
(1967) 


drinkin, PPear in the neocortex of a cat 
e milk or 8tooming, but do not occur 
suring walki n this laboratory, these 
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waveforms have been observed (in the rat 
and guinea pig) in the hippocampal forma- 
tion as well as the neocortex during chewing, 
face washing, shivering, or immobility, but 
have never been observed during the per- 
formance of behavior associated with hippo- 
campal RSA. Thus, these observations are 
consistent with the Seneral rule that hippo- 
campal RSA is associated with low-voltage 
fast activity in the neocortex, while irregular 
hippocampal activity may be accompanied by 
Slow waves and spindling (Green & Arduini, 
1954), ] 

It may seem contradictory that slow activ- 
ity appears in the neocortex following the 
initiation of milk drinking in cats but does 
not appear when the cats eat solid food 
(Grandstaff, 1969). The difference between 
the two behaviors is probably that head 
movements (associated with low-voltage fast 
Cortical activity and hippocampal RSA) 
occur almost continuously during eating in 
cats but occur only Occasionally during millc 
drinking. Rats, unlike cats, chew their food, 
and apart from the continuous jaw move- 
ments, are often quite motionless while eat- 
ing. Therefore, during chewing, spindling 


and slow waves often appear in the neo- 
cortex, 


Another line 
cortical surface 
sured against 


of evidence is that the neo- 
Positive de potential, mea- 
a reference electrode in the 
frontal bone in the rat, is large during be- 
havioral immobility or face washing but 
smaller during walking, Tearing, or head 
movement (Caspers, 1961). With the use 
ofa transcortica] recording technique, phasic 
Surface positive "activation Waves” have 
been reported in the neocortex of the cat 
uring head movements but not during 


Scratching, licking, or yawning (Calvet, Cal. 
vet, & Langlois, 1965), 


Sappear again when 
(Jasper & Penfield, 
OF cortex in man, like 


I wer animals, seems to 
be activated during voluntary movement but 


ae 


J^ 
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not during the maintenance of a fixed pos- 
ture. 

‘All these observations indicate that dur- 
ing voluntary movement there is more ac- 
tivity in the ascending nonspecific projec- 
tions to the neocortex, as well as to the 
hippocampus, than is present during be- 
havioral immobility or automatic types of 
movement. A possible interpretation of this 
is that the neocortex and hippocampus play 
an active role in voluntary behavior but are 
less crucially involved in automatic behavior 
or in the maintenance of an unchanging 
posture. Such an interpretation assumes 
that neocortical slow waves and spindling 
and hippocampal LIA or spindling are 
indicative of an inactive functional state. 

Other lines of evidence also suggest that 
normal activity in the hippocampus or neo- 
cortex is less important for automatic than 
for voluntary behaviors. Local seizure ac- 
tivity produced by stimulation of the hippo- 
campus of a cat, while the animal was drink- 
ing milk, did not interrupt the ongoing lap- 
ping movements (Akert, 1961). Hippo- 
campal seizure discharges, however, do pre- 
vent the occurrence of a more voluntary act 
such as an instrumental avoidance response 
(Flynn & Wasman, 1960; MacLean, Flani- 


gan, Flynn, Kim, & Stevens, 1955). Den- 
tate CA-4 stimulation arrests voluntary 


movement but not shivering or lapping wa- 
ter (Bland & Vanderwolf, 1970a). The 
different susceptibility of automatic and 
voluntary motor activities to diencephalic 
damage has already been discussed (see 
also Dirlam, 1969; Glickman, Higgins, & 
Tsaacson, 1970). 

The findings discussed here are relevant 
to the classical concepts of the relation of 
“arousal” to behavior (Hebb, 1955; Linds- 
ley, 1951; Malmo, 1959). It might be sug- 
gested that behaviors such as chewing, face 
washing, and licking the back occur during 
states of low arousal or relaxation and are 
associated with hippocampal and cortical 
deactivation for this reason. However, it is 
important to distinguish cerebral activation 
from behavior arousal (Feldman & Waller, 
1962; Lindsley et al., 1950). Face wash- 
ing and licking of the back, in particular, are 
vigorous muscular activities 1n a rat and 


can be shown to have a higher heart rate 
than walking or moving the head, behaviors 
which are always associated with neocortical 
and hippocampal activation (Vanderwolf & 
Vanderwart, 1970). Also, as the work dis- 
cussed here shows, if a frightened rat or 
guinea pig freezes or chatters its teeth, or 
shivers in response to lowered body tempera- 
ture, the cortex and hippocampus show signs 
of deactivation (slow waves, spindling, and 
LIA) even though the animal would ordi- 
narily be regarded as aroused at such times. 

Hippocampal RSA is probably not asso- 
ciated with changes in attention in any way. 
Slight environmental stimuli will make a 
rat pause momentarily during the perform- 
ance of such activities as lapping water OT 
washing the face. Presumably this indi- 
cates "attention," yet the ongoing pattern 
of hippocampal LIA usually shows no altera- 
tion whatever. Of course, RSA will appear 
if the stimulus results in a head movement, 
postural adjustment, locomotion, etc. Simi- 
lar results appear in a conditioning situation. 
RSA does not occur in dogs during the proc- 
ess of discriminating one conditioned stim- 
ulus from another, unless the animals move 
at the same time (Dalton & Black, 1968; 
Young & Black, 1970). 

Tt would be interesting to know whether 
perceptual processes might not generally be 
more efficient when cortical mechanisms are 
not occupied with the control of voluntary 
movement. In man, it is well known that 
close attention is correlated with behavioral 
immobility, and according to J. O'Keefe 
(personal communication, 1969), cells in 
the cuneate nuclei are actually less excitable 
during the performance of behaviors asso- 
ciated with hippocampal RSA than during 
behaviors associated with LIA. 

Hippocampal RSA appears during sleep 
(activated or paradoxical sleep) as well as 
in the waking state. An interpretation of 
this is suggested by evidence that there is a 
great deal of activity in the motor system 
during activated sleep but that the spinal 
motoneurone and its associated reflex cir- 
cuitry are “switched off" by descending in- 
hibitory activity, making gross movement 
impossible. T he descending motor barrages 
result only in tiny twitches of the Bins. 


MEET 
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Whit and Vane (RU en EDD BSA dining, rt expe 1o 
i nderwo 
acl ur sleep, hippocampal and that RSA does no 
RSA has a higher frequency and amplitude 
during periods of "twitching than in the 
intervening quiet intervals. mur ein 
activity is triggered by dien. 
Reels Vil hhnienss during activated — A 
sleep in much the same Way as in the waking RSA ae : E ‘ta d 
“à ot a higher frequ cy than js 
ici number of situations, low-frequency ly in training (Elazar & 
RSA can be Observed in the absence of Adey, 1967). 
movement in waking animals, Ag a rat £3 h | 
prepares to jump out of a box, RSA is pres- hippocampus Ty play a critica] role in | 
ent, but the jump occurs only when the fre- motor performance but is not essential to | 
quency of the RSA reaches 8 Hz. " g in lower animals. Flynn and Was- | 
In rabbits, Sensory stimuli can Produce long man (1960) showed that if conditioning | 
trains of RSA without any accompanying trials are Presented during hippocampal 
visible movement. Such RSA jg always of seizure activity, performance (leg flexion) 
lower frequency than is seen during actual is blocked, but learning is stil] possible ap- 
voluntary movement (Harper, 1968), sug- parently, since the animals perform quite 
gesting that, here too, a movement (run. well when subsequently 


tested in the normal 
ning?) was “Programmed” but that the fre- state. 
quency necessary for its in 


Adey, Dunlop, & Hendrix, 1960). None- 
and his colleagues agree that 
changes in hippocampal activity do occur 


i 

| 

itiation was not The changes in hippocampal electrical ac- | 

achieved, Cats also may exhibit low-fre- tivity during training can also be accounted | 

quency RSA at 4 time when they are not for on the hypothesis that such activity is | 

obviously moving about (Brown, 1968). related to movement rather than the learn- 1 
Anima] hypnosis is yet another state in ing process itself. For example, whether 
which low-frequency RSA is associated with an animal exhibits hippocampal RSA or 
absence of movement, but in addition there not when a novel stimulus js Presented de. 
is a loss of the normal righting reflex. pends on the type of behavior produced, 
There may be an inhibition of spinal moto- “xploratory behavior js associated with 
neurones in this case, as in Paradoxical sleep RSA; behavioral immobility usually is not, 
(Harper, 1968; Klemm, 1966, 1969; Mce- Zarly in training, in the type of apparatus 
Bride & Klemm, 1969), (alley) used by Adey, an animal is likely 
i i zng tO Spend part of its time standing motion- 

Hirrocampat Acrivrry AND LEARNING less but s its head about. Dos it is ; 
It has been reported that the hippocampal more likely to walk Promptly toward the 
EEG is altered during the course of train- 90d reward, Since head Movements in an 
ing in a learning task, According to Grastyán Otherwise motionless animal are associated 

et al. (1959), the initial Tesponse to a novel With lower fr l 


stimulus is a desynchronization of hippo- 
campal activity, Ag conditioning Proceeds, 


ings) which occur early in acquisition (Tol. 
man, 1948), Afte 


€ learned Tespor 
lished, RSA 
ings might 
Teports that 
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periods of up to 8 hours per day for several 
days in succession. RSA remained virtually 
unchanged throughout this period. Further, 
RSA appears during feeding behavior when 
a rat manipulates a large piece of solid food, 
a behavior which the animal performs hun- 
dreds of times every day of its adult life. 

However, under other circumstances, 
RSA may be reduced or even seem to dis- 
appear during the course of a learning ex- 
periment. It was pointed out above that ex- 
tensive voluntary movements are associated 
with higher amplitude RSA than smaller 
movements. If the recording electrodes are 
not placed in an optimal position and orienta- 
tion with respect to the dipole of the pyra- 
midal cells, RSA may be present only occa- 
sionally during the performance of extensive 
movements and seldom or never during the 
performance of small movements. At such 
electrode placements, RSA may “disappear” 
during training as the extensive irrelevant 
movements characteristic of the early phases 
of acquisition drop out. Such “disappear- 
ance” was demonstrated by Vanderwolf and 
Whishaw (1969) who showed that RSA 
was often present at “poor” hippocampal 
placements (little RSA present) on the first 
day of training in a bar-press task. On 
subsequent days, RSA was much less com- 
mon as the rats began pressing regularly 
and stopped walking about. At "good" 
hippocampal placements, RSA did not dis- 
appear, but was reduced in amplitude as the 
rats’ behavior changed from walking and 
rearing to regular bar pressing. 

These results indicate that hippocampal 
RSA is related to the higher level control 
of movement and is not a sign of the activity 
of the learning process. A final line of 
evidence which supports this statement is 
that automatic behaviors can occur as con- 
ditioned responses in the absence of RSA. 


Posluns (1962) found that rats drugged with 
make voluntary 


chlorpromazine do not 
although con- 


shock-avoidance responses 
ditioned vocal responses may still. occur. 
Hippocampal recordings taken _from rats 
given chlorpromazine during avoidance test- 
ing ( Vanderwolf, 1969a) showed that when 
the rats vocalized, no RSA appeared. How- 


ever, RSA was still present if the rats 
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walked or jumped, regardless of whether the 
behavior was a conditioned response, an 
escape response, or a spontaneous movement. 
Young and Black (1970) report that LIA 
occurs in the hippocampus of rats perform- 
ing a licking response to avoid shock. 

Thus, an automatic conditioned response 
does not appear to be controlled by the 
same neural mechanisms as a voluntary con- 
ditioned response. EEG findings confirm 
the results of the studies on the effect of 
medial thalamic lesions. In fact, there is 
considerable evidence that the performance 
of automatic behaviors as conditioned re- 
sponses is less dependent on forebrain struc- 
tures than more voluntary learned responses. 
In addition to points already mentioned. 
hippocampal seizure activity may have à 
more severe effect on a conditioned leg 
flexion response than on a conditioned res- 
piratory response (MacLean et al., 1955- 
1056). This is also true of seizures initiated 
by application of acetylcholine to the neo- 
cortex (Fleming, 1963). Similarly, condi- 
tioned cardiac responses are still possible 
following removal of the neocortex (Bloch- 
Rojas, Toro, & Pinto-Hamuy, 1964) or cor- 
tical spreading depression (Morgenson & 
Peterson, 1966), although a behavioral 
avoidance response is impaired or abolished. 


DEFINITION OF VOLUNTARY AND 
Automatic MOVEMENT 


Various theoretical schemes have recog- 
nized differences in the level of organization 
of different behaviors. A distinction be- 
tween reflexive and voluntary behavior has 
long been widely accepted (Fearing, 1930). 
Sherrington (1961, pP. 322-330) distin- 
guished precurrent from consummatory acts 
and emphasized the dependence of the for- 
mer on the distance receptors and the de- 
pendence of the latter on contact receptors. 
An analogous distinction was made between 
appetitive and consummatory behavior by 
Craig (1966). It is often emphasized that 
appetitive behavior initiates a behavioral 
sequence, while consummatory behavior ter- 
minates it. 

The writings of Hughlings Jackson pro- 
vide the most extensive existing treatment 
of levels of function of the nervous pm 


— 
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(see Taylor, 1958), Jackson proposed that 
simple reflexes and stereotyped rhythmical 
movements, such as breathing, were con- 
trolled by the lower levels of the nervous 
system, while the more varied voluntary 
movements were dependent on the operation 
of higher, more recently evolved structures, 
especially the cortex. Lower (more auto- 
matic) motor centers could be controlled 
(either excited or inhibited) by higher 
centers, which could be controlled by still 
higher centers, and so on, the whole form- 
ing a multilevel hierarchical System. An im. 
portant feature of Jackson's ideas was that 
à given function might be represented many 
times at different levels, Thus, one might 
say that the movement of flexing a lim} is 
represented at the lowest 
At a higher level, limb f 
integrated with 
mit walking, as may be observed ina decere- 
brate animal, At a still higher level, limb 
exion or walking js integrated with other 
sensorimotor activities to form an adaptiy 
behavior such as food getting. 


exion is 
to per. 


nonfunctional, the next highest center is “re- 


behavior js Correspondingly sim- 
plified and made more automatic, 


For reasons of historical Priority, as well 
as easy relation to Present day neurological 


and neurophysiologica] concepts, it has 
seemed advisable to adop 


of “voluntary” and “automatic” 
motor organization of different degrees of 
complexity. In Opposition to this, it might 
be argued, for example, that behaviors asso- 
ciated with hippocampal RSA should be 
called “appetitive” While those associated 
with hippocampal LIA are “consummatory,” 
However, this type of distinction jg often 
difficult to apply and is not easily related to 
questions of physiological mechanism, 
hreat behaviorg Such as piloerection and 


to refer to 


' since they precede fighting. 
ghlings Tackson's terminology 


everyone under- 
aning of the term “voluntary” 
VÉ way, it is difficult to define 
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It seems 
Is no single identifying characteristic. In 


objectively, probable that there 
common usage, voluntary acts are those 
controlled directly by an intention or idea, 
whereas a reflexive act is One which is elic- 
ited and controlled by some definite stimulus 
(see Hebb, 1966). However, such activ- 
ities as salivation or penile erection can be 
aroused solely by imagination, but are none- 
theless not fully voluntary since they can- 
not be performed readily on verbal command 
and become totally unavailable during fear. 

One factor Suggested by the preceding ex- 
amples may be that motor patterns differ in 
the degree to Which their Occurrence is de- 
pendent on the presence of a particular mo- 
tive state, When someone “feels cold," 
shivering movements occur and, when se- 
Vere, cannot be Suppressed ( Denny-Brown, 
1935; Uprus, Gaylor, & 
Shivering does not 
occur during the state of “feeling warm" 


and cannot then be made to 
tary effort, 


is depressed į 


et, on Occasions relate 
tion occurs and cannot 


produce when some other mo- 
In contrast, a voluntary 
easily brought 
€ of a number 


For example, in 
man, walking or manipulative behavior can 


be brought into the service of food getting, 


» etc., but laughing 


not equivalently available 
to all Motive states, -Veryone knows that 


during weeping it js all but impossible to 
Prevent sobbing, keep the corners of the 
mouth up, etc., and also that during a gay 
mood, these motor patterns cannot be repro- 
duced convincingly, Laughter is much the 
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same. Giggling can be almost uncontrol- 
lable, especially in young people. On the 
other hand, attempts to feign such behaviors 
when one is not amused are described as a 
“strained smile," “forced laugh," etc. 

Such phenomena suggest a dual-level con- 
trol of facial expression in man. There ap- 
pear to be a number of species-specific pat- 
terns of emotional expression which develop 
without specific training or imitation (Dar- 
win, 1965, Goodenough, 1932; Thompson, 
1941), although the occasions on which a 
given pattern occurs are obviously very much 
influenced by learning (Ekman, Sorenson, 
& Friesen, 1969). With increasing age 
and experience (Fulcher, 1942), the facial 
musculature comes under the control of an 
additional system which permits voluntary 
movements, including acting and the feign- 
ing or concealment of emotions which is so 
characteristic of man, but also occurs in 
chimpanzees (Hebb & Thompson, 1954). 
The two systems can be dissociated by brain 
damage. Patients in whom the cortical mo- 
tor pathway is believed to be bilaterally dam- 
aged may laugh or cry uncontrollably, with 
abundant facial movement, yet be incapable 
of showing the teeth upon command. A 
hemiplegic patient may retain full emotional 
expression on both sides of the face although 
voluntary facial movements are largely re- 
stricted to one side. In contrast, following 
brainstem damage, voluntary facial move- 
ment may be preserved while emotional ex- 
pression is lost (Brown, 1967; Ironside, 
1956; Wilson, 1924). Thus, there appears 
to be a subcortical mechanism for “genuine” 
emotional expression, but when facial move- 
ments are to be controlled by some other 
motive, such as the desire to comply with a 


verbal request, à cortical mechanism is 


brought into play. : 
f dual control is not restricted 


This type of 
to “emotional expression” as conventionally 


understood. Shivering can be mimicked to 
some extent by a normal person. In cases 
of hemiplegia, such voluntary control is pre- 
sumably lost unilaterally, although the 
shivering in response to cold is normal on 


des of the body (Uprus et al., 1935). 
a great many motor 


animals are of the 


both si i 
It seems possible that 


patterns in nonhuman 
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type which is restricted to one or a small 
number of motive states. Breland and Bre- 
land (1961, 1966) report a large number 
of observations on the difficulty of training 
a hungry pig not to root or à chicken not 
to scratch prior to the delivery of food in 
operant training situations. Such a require- 
ment may be analogous to asking a hungry 
man not to salivate when food is presented. 
Further, Breland and Breland did not suc- 
ceed in teaching a goat to bleat or a cow 
to run in order to obtain food, perhaps be- 
cause these behaviors are not a part of nor- 
mal feeding behavior in these animals. 

Kellogg (1968) reports that it is very 
difficult to train a chimpanzee to vocalize to 
obtain food even though the animal has a 
wide repertoire of “spontaneous” calls 
(mostly linked to various emotional states). 
Nonetheless, the chimpanzee is very teach- 
able in other respects, and Gardner and 
Gardner (1969) have successfully taught 
one to communicate using more than 30 signs 
of American sign language (the language 
used by the deaf in North America). 

Bolles (1969) found that rats do not 
easily learn to rear to avoid shock, although 
running is rapidly acquired. Tt contrast, it 
is well known that rats can be quickly taught 
to rear to obtain food or water, even in 
classroom demonstrations. Grooming be- 
havior in rats, unlike head movements or 
locomotion, is not susceptible to punishment 
or escape training in which noise is used as 
a reinforcing stimulus (Bolles & Seelbach, 
1964). 

As another type of example, it appears to 
be common knowledge among farmers and 
ranchers that horses in winter will dig 
through snow with their forefeet to reach 
the buried grass. Cattle do not appear tc 
do this and do not fare as well unless fed 
Nonetheless, cattle can easily ac. 
astrumental responses (sucl 
for themselves by pressing 
a panel with their muzzles) and are quit 
capable of making the movement of pawin; 
the ground, since bulls do this as part o 
their threat behavior. Apparently, the move 
ments of pawing cannot be easily cometa 
with the central representation of hunger i 
cattle. i 


by man. 
quire simple ir 
as pumping water 
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In conclusion, it seems likely that many 
motor patterns in animals resemble shiver- 
ing, emotional expression, penile erection, 
etc., in man in that certain patterns will bs 
performed whenever a given motive state is 
present in sufficient intensity, but not at 
other times. The actual structure of motor 
control in different species is almost totally 
unknown. Breland and Breland (1966, 
p. 87) suggest that higher animals (por- 
poises as well as primates) are more sus- 
ceptible to “cross-drive conditioning” than 
lower forms. . . 

Many of the phenomena of drive-linked 
behavior might be explained by assuming 
that different motive states each possess a 
unique cluster or constellation of relatively 
automatic somatic and autonomic effector 
activity, adapted to the particular behavior 
which is to be performed, A given com- 
ponent of such a reaction cluster may be 
quite rigidly linked to the total cluster, as 

shown by consistent appearance when the 

drive state is sufficiently intense and failure 
to appear in association with other drive 
states. Such complex reaction patterns are 
sometimes regarded as reflexes. In Pav- 
lovian physiology, the readiness of an animal 
to take food, to salivate, etc., is regarded as 
an indication of the excitability of a feeding 
reflex or feeding center, The term “center” 
in this instance does not refer to a particular 
locus but to a multicomponent system orga- 
nized at all levels of the neuraxis (Bykoy, 

1958, p. 525). Elaborations of an essen- 
tially similar idea are found in ethological 
theory and have proved extremely useful in 
the analysis of instinctive behavior ( Craig, 
1966; Tinbergen, 1951). The concept of 
activation of an innate response system can 
also be applied to many conditioned beha- 
viors. Tt has long been known that when a 
neutral stimulus is repeatedly paired with 
electric shock, for example, the “conditioned 
response” includes a vast constellation. of 
visceral and somatic phenomena which re- 
semble the response to the shock itself. 
However, voluntary types of learned be. 
havior cannot be interpreted in this way. 

n innate feeding or defensive reflex can 
ude all the motor patterns (walk- 
Jumping, manipulation, etc.) which 
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animals will perform to obtain food or avoid 
electric shock. Perhaps, when an animal 
learns to perform an act such as running to 
obtain food, the central representation of the 
running pattern becomes coupled to the 
central representation of feeding in some way 
and is controlled by it, so that temporarily 
the running response behaves in much the 
Same way as an innately coupled motor pat- 
tern such as salivation, The comparisons 
presented by Kimble (1961) show that run- 
ning or bar pressing for food follow much 
the same rules of acquisition, generalization, 
extinction. spontaneous recovery, etc., as a 
salivary conditioned response. Tf the feed- 
ing system (including salivation, chewing, 
etc.) is activated artificially by hypothalamic 
Stimulation in a satiated animal, a previously 
learned voluntary motor pattern will also ap- 
pear (Maire, 1965; Miller, 1958). Such 
facts suggest that the two types of response 
Share the same Sensory and motivational 
control mechanisms. The control mechanisms 
cannot be identical however, since automatic 
behaviors, occurring as conditioned re- 
sponses, are often less affected by forebrain 
damage than voluntary behaviors (see above, 
p. 101) and may occur independently of the 
RSA system. According to the point of 
view advanced here, the difference lies in 
the greater complexity of the motor orga- 
nization of voluntary behaviors, one aspect 
of which is dependence on forebrain control 
Systems, 

Some of the Sources of 


this complexity 
have already 


been Suggested. Some sort 
of switching circuitry must be necessary to 
control voluntary movement patterns if they 
are to be connected with different drives on 
different occasions. The more automatic 
components of the cluster of motor activities 
associated with a drive could be intercon- 
nected more simply, since little flexibility is 
required, Complex circuitry must also be 
required to achieve the varying sequential 
pattern of voluntary movement, 
given movement of the han 
or followed by any of a ver 
movements, but it is difficult 
rearrangement whatever of 
Sequences of muscular 
swallowing. The pr 


to produce any 
Some automatic 
Contractions, such as 
ogramming circuitry 


à 


MECHANISMS OF VOLUNTARY MOVEMENT bs us 


which permits voluntary movements to be 
brought under the control of a drive and 
performed in an effective sequence is pre- 
sumably located largely in the cerebral hemi- 
spheres, in the neocortex, hippocampal 
formation, etc. Presumably, a program does 
not result in actual behavior unless the dien- 
cephalic movement trigger is also activated. 
the effect of the two summating to produce 
overt motor activity. 

According to this interpretation, the pres- 
ence of RSA in the hippocampus during 
walking, for example, may indicate the acti- 
vation of a cerebral mechanism which tem- 
porarily couples the central representation 
of walking to the particular motive system 
dominant at the moment and permits regula- 
tion of rate, direction, ete. An instant later, 
running Or head movement might be so 
coupled. Analysis of stimulus input and 
activation of memory engrams might be in 
progress as well, but these processes do not 
seem to require activation of the RSA sys- 
tem. During an automatic act such as 
shivering. RSA does not occur (one may 
speculate) because shivering is innately 
coupled to a cold defense motive system at 
a brainstem level, and the cerebral mecha- 
nism becomes superfluous. 

However, not all movement patterns which 
occur in the absence of RSA are restricted 
to a single motive state. Licking, for ex- 
ample occurs during feeding, grooming. 
sexual, and maternal behavior. Rats can 
learn to lick a water spout to avoid shock 
(Young & Black, 1970). Biting and gnaw- 
ing seem much the same. Despite this ap- 
parent flexibility, cerebral mechanisms are 
not activated during continuous licking or 
biting, since LIA occurs in the hippocampus 
and slow waves and spindling may occur in 
the neocortex. In explanation of this, it is 
possible that both licking and biting com- 
prise a number of distinct motor patterns, 
each of which is innately coupled to a par- 
ticular motive state. Biting the fur during 
grooming, for example, occurs at a much 
faster rhythm than biting and chewing food. 
Another po sibility is that acts like licking 
] biting are not under the direct. control 
are operated in- 


anc 
of the cerebrum at all, but 


directly. as reflexes. 


An example of the indirect control of a 
reflex by a higher level mechanism is swal- 
lowing in man. Movements of the tongue 
(relatively voluntary) press food, saliva, etc., 
into the pharyngeal afferent zone of the swal- 
lowing reflex. Swallowing results. If the 
mouth is completely dry and empty, OT it 
the pharyngeal surface is painted with co- 
caine, afferent stimulation is reduced and 
swallowing becomes difficult or impossible 
(Pommerenke, 1928). This type of control 
of reflex activity may occur fairly widely. 
MeDonnell and Flynn (1966) have shown 
that the “killing bite” produced by hypo- 
thalamic stimulation in the cat is abolished 
if afferents from the lips are sectioned. Fur- 
ther, semicomatose rats with medial thalamic 
lesions (pp- 84-85) often lick and bite vig- 
orously at a water spout applied to the lips 
at a time when they do not move the head 
in pursuit of the spout, even when it is 
moved only a few millimeters away. Chronic 
mesencephalic rats exhibit similar reflex 
licking and biting (Woods, 1964). Thus, 
when a normal rat licks or bites some object, 
it may do so by yoluntary positioning of the 
body and head in such a way that the lower 
level automatisms are activated reflexively. 
If so, training a rat to lick water to avoid 
shock involves training voluntary movements 
of the head and limbs. Licking occurs auto- 
matically when the water spout is contacted. 

The interpretations offered here may help 


account for the different sensitivity of vol- 
untary and automatic behavior patterns to 
An auto- 


many types of forebrain damage- 1 
matic motor pattern may be coordinated 
largely by brainstem mechanisms 1 @ nor- 
mal animal. Presumably, it can be activated 
relatively directly by sensory input, and 
therefore is minimally dependent on the 
cerebral hemispheres- Tn a decerebrate rat, 
walking and head movements as well as 
licking and piting occur automatically 
(Woods, 1964). However, a decerebrate 
rat does not walk toward a food pellet and 
bite off a piece. presumably because the 
mechanisms the ng 
movement under the joint control of hunger 


and sensory input are absent. More re 
stricted lesions may sometimes exert. their 


effects by disrupting the control drive states 
states 
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normally exert on voluntary movement. For 
example, Kimble, Rogers, and Hendrickson 
(1967) found that following hippocampal 
lesions, parturient female rats did not gather 
their pups into the nest and hover over 
them as normal females do. The rats still 
possessed all the components of maternal 
behavior, but the total pattern was dis- 
ordered. One rat, for example, would exe- 
cute normal hovering behavior some dis- 
tance removed from her pups. It is con- 
ceivable that such behavior occurs because 
the maternal drive is no longer able to con- 
trol activity in the locomotor system and so 
bring the rat to its nest and pups when the 
hovering response has a low threshold, 


OPERANT AND CLASSICAL CONDITIONING 


A common distinction between voluntary 
and involuntary motor activities is based on 
the differences between smooth and striped 
muscles and their respective innervations, 
Since 1800 or even earlier (Fulton, 1949), 
what is now called the autonomic system has 
been associated with involuntary vegetative 
activities, while the neural systems controll- 
ing striped muscles have been associated 
with voluntary movement. A modern ver- 
sion of this idea is the suggestion that auto- 
nomic activities are susceptible to classical 
(Pavlovian) conditioning, while somatic ac- 
tivities are conditionable by instrumental or 
operant procedures, 

Such ideas are not supported by present 
day knowledge. Neurological as well as 
everyday experience indicates that some so- 
matic motor activities (e.g., shivering, sneez- 
ing, etc.) are not easily subjected to volun- 
fary control. On the other hand, various 
autonomic activities such as Piloerection or 
heart rate can sometimes be controlled in a 
voluntary manner in man or by operant 
training in animals (Kimmell, 1967; Linds- 
ley & Sassaman, 1938; Miller, 1969). 

Tt seems probable that operant contro] of 
autonomic activity can often be understood 
in the context of control of movement in 
&enera], 

Matic or autonomic reflexes in iso- 
1949, » vii), and cardio- 
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In conditioning situations, increases in heart 
rate are associated with movement, decreases 
in heart rate with immobility (Obrist, 1968 ; 
Obrist & Webb, 1967; Obrist, Webb, & 
Sutterer, 1969 ; Roberts, 1970) in both ap- 
petitive and aversive situations. Behavioral 
experiments, apparently, have not established 
that emotion, motivation, attention, etc., have 
any effect on heart rate independent of their 
effects on movement, 

It is sometimes assumed that the effect 
of movement can be eliminated in experi- 
ments on conditioning heart rate, blood pres- 
sure, etc., simply by curarizing the experi- 
mental subjects, However, as pointed out 
above, the motor System can probably alter 
heart rate or blood pressure directly without 
prior muscular activity, In fact, the occur- 
rence of hippocampal RSA in a curarized 
animal is accompanied by increased heart 
rate and other 


ity. Now, operant control of RSA can be 


achieved in curarized dogs, and if the animals 


normal state, it is 
have learned to struggle, 
: in response to the GS. 
Dalton, 1968). Heart 
and renal blood flow 
can all be altered by operant procedures 
e all these variables 
stimulation of fore- 


to it seems possible that 
the conditioning phenomena 


on the curarized animal le 


it should þe remembered 


for example, in the absence of 
somatic motor activity. 


tion to motor activity m 
able by hippocampal recording, 

Traditional distinctions betw 
and classical conditioning 
to maintain, 


any obvious 
However, the rela- 
ight still be detect- 


een operant 
now seem difficult 
According to Hebb (1956), 


even the classical salivary conditioned re- 
sponse should be regarded as an operant, 
since 


an anima] must 


UE Salivate in order to 
eat. Similarly, the leg 


flexion response in 
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classical aversive conditioning may be a 
type of voluntary trial-and-error behavior. 

Tt is sometimes suggested that operant 
and classical conditioning should be dis- 
tinguished on the basis of the procedures 
used. This type of distinction seems ir- 
relevant. A valid classification. should be 
based on the biological properties of the re- 
sponses considered. Further, since the dis- 
tinction is purely logical, there is no reason 
why it should not be continued indefinitely, 
dividing and subdividing learned behavior 
into many types, all based on minor pro- 
cedural differences in training. 

On the other hand, consideration of the 
response required, and a distinction between 
higher and lower levels of control, may often 
aid in understanding what actually happens 
in a conditioning situation. For example, 
during any conditioning procedure, some 
autonomic motor activities may appear as 
part of the response cluster associated with 
a drive. Others may occur in preparation 
for, and as an accompaniment of, voluntary 
somatomotor activity. Thus, a rat trained 
to run an alley for food might show in- 
creased heart rate in anticipation of running 
(voluntary). and salivation in anticipation 
of eating (drive response cluster). If the 
animal had learned to sit motionless to get 
food, the heart rate would presumably be 
lower but salivation would still occur. Dur- 
ing eating, Or subsequent face washing, 
heart rate would also be elevated, but this 
presumably would be due to lower level 
automatic regulation, since the RSA sys- 
tem would not be active. 

During learning, voluntary motor activity 
may become coupled to an active drive state 
and may also be brought under the control 
of specified sensory stimuli. How all this is 
accomplished is almost a total mystery. The 
suggestion of Glickman and Schiff (1967) 
that activation of species-specific motor pat- 
terns is crucial to reinforcements seems read- 
ily compatible with the proposals made here. 
A discussion of possible neurological me- 
chanisms of reinforcement is Prov ided by 
Milner (1970). 

This discussion has emphasized that while 
different drives may control voluntary move- 


ment via à hippocampal mechanism, hippo- 
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campal RSA is not, of itself, a correlate of 
motivational or reinforcement processes. 
Evidence supporting the latter conclusion in- 
cludes the following: (a) RSA is not pres- 
ent during reinforcing activities such as 
chewing. licking, etc.. (b) animals trained 
to remain motionless by an avoidance pro- 
cedure or by the use of intracranial stimula- 
tion do not exhibit RSA (Dalton & Black, 
1968; Paxinos & Bindra, 1970), and (c 

brainstem points which produce hippocampal 
RSA when stimulated can be neutral, re- 
warding, or punishing in tests of self-stim- 
ulation (Ito, 1966; Kramis & Routtenberg. 
1969: Routtenberg & Kramis, 1968). Tn all 
these situations RSA remains correlated with 
voluntary movement, however. 


REFERENCES 


Avey, W. R. Recent studies of the rhinencephalon 
in relation to temporal lobe epilepsy and be- 
havior disorders. International Review of 
Neurobiology, 1959, 1, 1-46. 

Apey, W. R. EEG studies of the hippocampal 
system in the learning process. In P. Passouant 


(Ed.), Physiologie de Vhippocampe. Paris: 
fiditions du Centre National de la Recherche 


Scientifique, 1962. 

Avery, W. R. Hippocampal states and functional 
relations with cortico-subcortical systems in 
attention and learning. In W. R. Adey & T. 

Tokizane (Eds.), Progress in brain research: 
Structure and function of the limbic system. 
Amsterdam: Elsevier, 1967. 

Aey, W. R., Duxror, C. W., & HENDRIX, [0598127 


Hippocampal slow waves: Distribution and 
phase relationships in the course of approac 
learning. American Medical Association Ar- 


chives of Neurology, 1960, 3, 74-90. k 

Aprtan, E. D. Olfactory reactions in the brain 
of the hedgehog. Journal of Physiology, 1942, 
100, 459-473. 

ApnrAN, E. D. The electrical activity of the mam- 
malian olfactory bulb. Electroencephalography 
and Clinical Neurophysiology. 1950, 2, 377-388. 

Axert, K. Diencephalon. In D. E. Sheer (Ed.), 
Electrical stimulation of the brain. Austin: 
University of Texas Press, 1961. 

Anperson, N. H. Variation of CS-US interval 
in long-term avoidance conditioning in the rat 
with wheel turn and with shuttle tasks. Journal 
of Comparative and Physiological Psychology, 
1969, 68, 100-106. r 

Ban, T. The septo-preoptico-hypothalamic sys- 
tem and its autonomic function. In T. Tokizane 
& J. P. Schadé (Eds.), Progress in brain re- 


search. Correlative neurosciences. Part A- 
Fundamental mechanisms. Amsterdam: Els d 
vier, 19 


108 


Bard, P. Anatomical organization of the et 
HEr TOUS system in relation to control of His 
heart and blood vessels. Physiological Reviews, 
1960, 40, Suppl. No. 4, 3-26. . 

Brack, A. H. The effects. of CS-US interval 
on avoidance conditioning in the rat. Canadian 
Journal of Psychology, 1963, 17, 174-182. . 

Brack, A. H., & pe TorEpo, L. The relationship 
among classically conditioned responses: Heart 
rate and skeletal behavior. In A. H. Black 
& W. F. Prokasy (Eds.), Classical conditioning 
II; Current research and theory. New York: 
Appleton-Century-Crofts, 1971, in press. 

Brack, A. H. Younc, G. A, & BarENCHUK, C. 
Avoidance training of hippocampal theta waves 
in Flaxedilized dogs and its relation to skeletal 
movement. Journal of Comparative and Physio- 
logical Psychology, 1970, 70, 15-24. 

Brack, S. L, & VANDERWOLF, C. H. P 
behavior in the rabbit. 
1969, 4, 445-449. 

Brawp, B. H. & VANDERWOLF, C. H. Effects of 
dentate gyrus stimulation on behavior and hippo- 
campal electrical activity. Proceedings of the 
Canadian Federation of Biological Societies, 
1970, 13, 47. (a) 

Brann, B. H., & VaNpERWoLr, C. H. Effects of 
hypothalamic Stimulation on behavior and hippo- 
campal EEG. Paper presented at the meeting 
of the Eastern Psychological Association, Atlan- 
tic City, N. J., April 1970, (b) 


humping 
Physiology and Behavior, 


phénoménes 
* comportement par 


stimulation thalamique et reticulare chez le rat 
non-anaesthésié, Psychologie Francaise, 1956, 

Brocn-Rojas, S., Toro, A, & Pinto-Hamuy, Te 
Cardiac versus somatomotor conditioned re- 
sponses in neodecorticate rats, Journal of Com- 
parative and Physiological Psychology, 1964, 58, 
233-236. 

Bonus, B., & DE Wiep, D. Avoidance and escape 
behavior following medial thalamic lesions in 
rats. Journal of Comparative and Physiological 
Psychology, 1967, 64, 26-29. 

Borrzs, R. C. Avoidance and escape learning : 
Simultaneous acquisition of different responses, 
Journal of Comparative and Physiological Psy- 
chology, 1969, 68, 355-358. 

Botts, R, C., & Srennacn, S, F. Punishing and 
reinforcing effects of noise onset and termina- 
tion for different responses. Journal of Com- 


parative and Physiological Psychology, 1964, 58, 
127-131 


Bowsner, D. 


The termination of Secondary so- 


matosensory. neurons within the thalamus of 
od Mulatta: An experimental degeneration 
Study, 


17 213-222 of Comparative Neurology, 1961, 


BRELAND, M. The misbehavior 
681-684. ms. American Psychologist, 1961, 16, 


Corazza, R, & Parmec 


C. H. VANDERWOLF 


BarLAND, K., & BnELAND, M. 
New York: Macmillan, 1966. 

Brown, B. B. Frequency and phase of hippocam- 
pal theta activity in the spontaneously behaving 
cat. Electroencephalography and Clinical Neuro- 
physiology, 1968, 24, 53-62. 

Brown, J. W. Physiology and phylogenesis of 
emotional expression. Brain Research, 1967, 5, 
1-14. 

Brücke, F., PETsCHE, H., Pruat, B., & DEISEN- 
HAMMER, E. Die Beeinflussung der “Hippocam- 
pus-arousal-Reaktion" beim Kaninchen durch 
elektrische Reizung im Septum. Pflügers Archiv 
für die Gesamte Physiologie, 1959, 269, 319-338. 

Enusn, F. R, Bnusn, E. S,& SoLowow, R. L. 
Traumatic avoidance learning: The effects of 
CS-US interval with a delayed-conditioning 
procedure. Journal of Comparative and Physio- 
logical Psychology, 1955, 48, 285-293. 

& P. Subcortical controls of pyramidal ac- 
tivity. In D. P. Purpura & M. D. Yahr (Eds.), 
The thalamus. New York: Columbia University 
Press, 1966. 

Bykov, K. M. (Ed. Textbook of bhysiology. 
Moscow: Foreign Languages Publishing House, 
1958. 

CALVET, J., Carver, M. C, & LaNGLors, J. M. 
Diffuse cortical activation waves during so-called 
desynchronized EEG patterns. Journal of 
Neurophysiology, 1965, 28, 893-907, 

Carbo, B. Rapports entre le niveau de 
et le conditionnement chez l'animal, 
Physiologie, 1961, 53, 1-212. 

Campo, B. Effects de la stimulation. du 
parafasciculaire thalamique sur l'acquisition d'un 
conditionnement d'évitement chez le rat, Phy- 
siology and Behavior, 1967, 2, 245-248, f 

Caspers, H. Changes of cortical D.C, potentials 
in the sleep-wakefulness cycle. In G: E. W. 
Wolstenholme & M. O'Connor (Eds.), The 
nature of sleep. London: Churchill, 1961. 

F a Narcolepsy, American Medical 
Association Archives of Neurology and Psy- 
chiatry, 1931, 26, 50-101. : ` 
Cnuncu, R. M 


Animal behavior. 


vigilance 
Journal de 


noyau 


of Comparative an 
1956, 49, 301-308. 
CLEMENTE, C. Di, STERMAN, M. B,& WYRWICKA, 
. Post-reinforcement EEG Synchronization 
during alimentary behavior. Electroencephalo- 


ira md Clinical Neurophysiology, 1964, 16, 


Copy, D, E,& CARLTON, D. T 


i Behavioral effects 
f the medial septum 


‘um of rats. Journal 
Physiological Psychology, 


ANT, PL L. Central 
m eliciting the theta 
nulation. — Helvetica 


course of the 


afferent s 
rhythm upon 


Sciatic “stir 


MECHANISMS OF VOLUNTARY MOVEMEA® 


Physiologica ct Pharmacologica Acta, 1963, 21, 
C10-C12. 

CRAG, W. Appetites and aversions as constituents 
oi instincts. In T. Verhave (Ed.), The experi- 
mental analysis of behavior. New York: Apple- 
ton-Century-Crofts, 1966. 

Darron, A. Hippocampal electrical activity in 
operant conditioning in the dog. Unpublished 
doctoral dissertation,  MeMaster University, 
Hamilton, Ontario, 1968. 

DALTON, A., & Brack, A. H. Hippocampal elec- 
trical activity during the operant conditioning of 
movement and refraining from | movement. 
Communications. in Behavioral Biology, 1968, 
2, 207-273. 

Dantes, L, E. 
1-122. 

Danwix, C. 
man and animals. 
chago Press, 1965. 
don: Murray, 1872.) 

DELACOUR, Je At 
Róle chez le r 
dans le conditionnement instrumental. 
psychologia, 1960. 4, 101-112. 

DELACOUR, J, & Santacana, M. p. Rôle du 
thalamus médian dans l'établissement et le re- 
tention de conditionnements défensifs classiques. 
et instrumentaux. Neuropsychologia, 1967, 5, 
237-252. 

Denny-Brown, D., GAYLOR, J. B. & Uprus, V. 
Note on the nature of the motor discharge in 
shivering. Brain, 1935, 58, 233-237. 

Dram, D. K. The effects of septal, thalamic, 
and tegmental lesions on general activity in the 
hooded rat. Canadian Journal of Psychology. 
1969, 23, 303-314. 

Doxovick, P. J Effect of localized septal lesions 
on hippocampal EEG activity and behavior in 
rats. Journal of Comparative and Physiological 
Psychology, 1968, 66, 569-578. 

Dory, R. W., BECK, E. C, & Koor, K. A. Effect 
of brain-stem lesions on conditioned responses 
of cats. Experimental Neurology, 1959, 1, 
360-385. 

Doveras, R. J. 


Narcolepsy. Medicine, 1934, 13, 
The expression of the emotions in 
Chicago: University of Chi- 
(Orginally published: Lon- 


sssaRp, D. & LIBOUBAN, S. 
at de deux noyaux thalamiques 
Neuro- 


J. The hippocampus and behavior. 
Psychological Bulletin, 1967, 67, 416-442. 

Dovcras, R. J, & RarHELSON, A. C. Septal 
lesions and activity. Journal of Comparative 
and. Physiological Psychology, 1966, 62, 465-467. 

Erernerc, E., Wate, J. C., & Brazier, M. A. B. 
The hippocampal arousal pattern in rabbits. 
Experimental Neurology. 1959, 1, 483-490. 

EKMAN, P., SORENSON, E. R., & FRIESEN, W. V. 
Pan-cultural elements in facial displays of emo- 
tion. Science, 1969, 164, 86-88. 

ELAZAR, Z, & Avery, W. R. Spectral analysis of 
low frequency components in the electrical ac- 
tivity of the hippocampus during learning. 

zlectroencephalography and Clinical Neuro- 

phys logy, 1967, 23, 225-240. 

Enproczt, E, YANG, T. L, Lissák, K, & MeD- 

"The effect of stimulation of the brain 


cyest, P. 


stem on conditioned reflex activity and on be- 
havior. Acta Physiologica Academiae Scien- 
tarum Hungaricae, 1959, 16, 291-297. 

Fearne, F. Reflex action: A study in the history 

of physiological psychology. Baltimore: Wil- 

liams & Wilkins, 1930. 

MAN, S. M, & WALLER, H. J. Dissociation 
electrocortical activation and behavioural 
arousal. Nature, 1962, 196, 1320-1322. 

Fiemrxc, D. E. Response arrest associated with 
the corticifugal projection of chemically induced 
local seizure discharge. Electroencephalography 
and Clinical Neurophysiology, 1963, 15, 84-90. 

Fiyny, J. P, & WASMAN, M. Learning and cor- 
tically evoked movement during propagated hip- 
pocampal after discharges. Science, 1960, 131, 
1607-1608. 

Futcuer, J. S. “Voluntary” facial expression in 
blind and seeing children. Archives of Psychol- 
ogy, 1942, 38, 1-49. 

Furrow, J. F. Physiology of the nervous system. 

(ew York: Oxford University Press, 1949. 

Gastaut H. Jus, A, Jus, C, MORREUL, F, 
STORM VAN LEEUWEN, W., Doxcier, S., NAQUET, 
R., Recs, H., Rocer, A., BEKKERING, D., KAMP, 
A, & Werre, J. Étude topographique des 
réactions électroencéphalographiques conditionées 
chez l'homme. Electroencephalography and 
Clinical Neurophysiology, 1957, 9, 1-34. 

GARDNER, R. A, & Garpyer, B. T. Teaching 
sign language to a chimpanzee. Science, 1969, 
165, 664-672. 

GELLHORN, E. Principles of autonomic-somatic 
integrations. Minneapolis: University of Min- 
nesota Press, 1967. 

GuickwAN, S. E, & Scur, B. B. A biological 
theory of reinforcement. Psychological Review, 
1967, 74, 81-109. 

GuickMAN, S. E., Hicorxs, T., & Isaacson R. L. 
Some effects of hippocampal lesions on the be- 
havior of Mongolian gerbils. Physiology and 
Behavior, 1970, 5, 931-938. 

Goopenoucu, F. L. Expression of the emotions 
in a blind-deaf child. Journal of Abnormal Psy- 
chology, 1932, 27, 328-333. 

GorrtLIEB, G., & SIMNER, M. L. Relationship 
between cardiac rate and nonnutritive sucking in 
human infants. Journal of Comparative and 
Physiological Psychology. 1966, 61, 128-131. 

Granpsrarr, N. W. Frequency analysis of EEG 
during milk drinking. Electroencephalography 
and Clinical Neurophysiology, 1969, 27, 57-65. 

GnasrvAw, E, LISSÁK, K, Mapamász I, & 
Donnorrer, H. Hippocampal electrical activity 
during the development of conditioned reflexes. 
Electroencephalography and Clinical Neurophy- 


O! 


siology. 1959, 11, 409-430. 
Green, J. D- The hippocampus. Physiologi 
Reviews, 1964, 44, 561-608. ystological 
Green, J. D^ & AmgDUINL A. A Hippocampal 


electrical activity in arousal. 


Journal o : 
cee tiology, 1954, 17, 533-557. NO Ning 


-A 
110 


AAXWELL, D. S., ScHINDLER, W. Ja 

NI 9. 7 ake “theta” rhythm: its 

anatomical source and relation to activity in 

single neurons. Journal of Neurophysiology, 
3, 403-420. 

me E G., CHorover, S. L., & Conex, S. M. 
Caudate, cortical, hippocampal, and dorsal thalas 
mic lesions in rats: Alternation and Hebb- W il- 
liams maze performance. Neuropsychologia. 
1965, 3, 53-68. : 

HAKEREM, G., & SUTTON, S. Pupillary response at 
visual threshold. Nature, 1966, 212, 485-486. 
Harper, R. Behavioral and electrophysiological 
studies of sleep and animal hypnosis. Unpub- 
lished doctoral disseriation, McMaster Univer- 
sity, Hamilton, Ontario, 1968, 

Harcn, A, Wers, G. S., Barazs, T., & GRICE, 
H. C. Long-term isolation | stress in 
Science, 1963, 142, 507. 

Hess, D. O. Drives and the C 
nervous system). 
62, 243-254, 

Hess, D. O. The distinction between "classical" 
and “instrumental,” Canadian Journal of Psy- 
chology, 1956, 10, 165-166, 

Hess, D. O. 4 textbook of Psychology. Phila- 
dephia : Saunders, 1966. 

Hes, D. O, & THompson, W. R. 
significance i In G. Lindzey 
(Ed.), Handbook of social Psychology. Cam- 

Addison-Wesley, 

Peacock, S. M, & 

the forebrain on s 
Inhibition, Journal 
Neurology, 1951, 94, 381-408, 

Horr, E. C, Kr, J E, Jr, & CannoL, M, N., 

Effects of cortical stimulation and lesions 


on cardiovascular function, Physiological Re- 
views, 1963, 43, 68-114. 


INGRAM, W. R. Barris, R. W., 
Ranson, S. - Catalepsy : 
study. American Medical Ass 
of Neurology and Psychiatry, 
1197. 

Ironsi, R. Disorders of lau 
lesions. Brain, 1956, 79, 589-609, 

Iro, M. Hippocampal electrical Correlates of self- 
Stimulation in the rat. Electroencephalography 
and Clinical Neurophysiology, 1966, 21, 261-268, 

Jackson, W. J. A comment on 
pus and behavior.” 
69, 20-22. 

Jasper, H., & PrNrIELD, W. 
in man: Effect oí volunt 
electrical activity of 


rats. 


NS (conceptual 
Psychological Review, 1955, 


The social 


Heatu, R. G. 
omatomotor ac- 


tivity. I. of Comparative 


FISHER, C & 
An experimental 
ociation Archives 
1936, 35, 1175- 


ghter due to brain 


"The hippocam. 
Psychological Bulletin, 1968, 


Electrocorticograms 
ary movement upon the 
the  precentral gyrus. 
Archiv für Psychiatrie und Zeitschrift Neuro- 
logie, 1949, 183, 163-174. 

AWAMURA, H., & Domino, E. F, 


» & Tokizane, T. 
rain stem lesions on the elec- 
Í the limbic system and neo- 


ities o 


C. H. VANDERWOLF 


cortex. Japanese Journal of Physiology, 1961, 
11, 564-575 

KELLocc, W. N. 
the home-raised 
423-427. 

Kenyon, J. The effect 
motivated behavior in 
doctoral dissertation, 
treal, Quebec, 1962, 

Kimgre, D. p. Pi 
the hippocampus. 
235-244, 

KIMELE, D. Pa Rocers, Ls 
Hippocampal lesions 
sexual behavior in the 
Comparative and 
1967, 63, 401-407. 

KIMBLE, G. A. Hilgard and 


Communication and language in 
chimpanzee, Science, 1968, 162, 


of septal lesions upon 
the rat. Unpublished 
McGill University, Mon- 


ossible inhibitory functions of 
Neuropsychologia, 1969, 7, 


& HENDRICKSON, C. W. 
disrupt maternal, not 
albino rat, Journal of 
Physiological Psychology, 


Marquis’ conditioning 
and learning, New York: Appleton-Century- 
Crofts, 1961, 

KiwMELL, H. D Instrumental conditioning of 


autonomically mediated behavior, 
ical Bulletin, 1967, 67, 337-345. 
Kine, F. A. Effects of septal 
lesions on emotional behavior 
avoidance responses in the rat 
vous and Mental Disease, 1 
KrtEMM, W. R. Electroenc, 
ioral dissociations c 


Psycholog- 


and amygdaloid 
and conditioned 
- Journal of Ner- 
958, 126, 57-63. 

€phalographic-behay- 
luring animal hypnosis, Elec- 
troencephalography and Clinical Neurophysiol- 
ogy, 1966, 21, 365-372. 
KrEMM, W. R. immobility re- 


SEG and multiple 


between limbic sys- 


behavior in 
rats. Journal of Comparative and Physiological 
Psychology, 1970, 70, 482-492, nos 
Kramis, R, E OUTTENBERG, A. Rewarding 
rain stimulation, hippocampal ac 


: E x x tivity, and foot- 
Stomping in the gerbil, Physiolo Y and Be- 
havior, 1969, 4, 7-1]. " 
Knoon,, A, & The regulation of 
respiration during the initial 


ar work. Journal of Physiol- 


ogy, 1913, 47, 112-136. 
A Comparison þe- 
Y induced mus. 
Physiology, 1917, 
-ASHLEY, K. S. Th 


y e problem of seri 
behavior. In 


Tef p al order in 

x t elress (Ed, 1 " l 
mechanisms in behavior, New Y ee SER 
1951, MN 


Wiley, 


The cholinergic 
the hippocampal 
: of the ascending 
System, and the subfornical 
organ e Supraoptic Crest — Brain 1967, 90, 


X, nuclei 


-HERMITTR, E. GAUTHIER I 


Cham, p Troubles dà 


Martrau, R., 
mutism akinetique. 


É là conscience et 
tude anatomo-clinique d'un 


| 


1 


f 


MECHANISMS OF VOLUNTARY MOVEMENT 111 


paramédian, bilaterale du péd- 
Revue Neuro- 


ramollissement 
oncule cerebral et du thalamus. 
logique, 1963, 109, 115-131. 

Linpstey, D. B. Emotion. In S. S. Stevens 
(Ed.), Handbook of experimental psychology. 
New York: Wiley, 1951. 

Linpstey, D. B., & SASSAMAN, W. H. Autonomic 
activity and brain potentials associated with 
“voluntary” control of the pilomotors (MM. 
arrectores pilorum). Journal of Neurophysiol- 
ogy, 1938, 1, 342-349. 

LixpstEv, D. B., SCHREINER, L. H., KNOWLES, 
W. B., & MAGOUN, H. W. Behavioral and EEG 
changes following chronic brain stem lesions in 
the cat. Electroenccphalography and Clinical 
Neurophysiology, 1950, 2, 483-498. 

Loves pA Suva, F. H, & Kamp, A. Hippo- 
campal theta frequency shiíts and operant be- 
havior. Electroencephalography and Clinical 
Neurophysiology, 1969, 26, 133-143. 

Low, L. A, & Low, H. I. Effects of CS-US 
interval length upon avoidance responding. Jour- 
nal of Comparative and Physiological Psychol- 
ogy, 1962, 55, 1059-1061. 

MacDonnet, M. F., & Finn, J. P. Control of 
sensory fields by stimulation of hypothalamus. 
Science, 1966, 152, 1406-1408. 

MacLean, P. D. FLANIGAN, S, FLYNN, E Ps 
Kim, C, & STEVENS, eR Hippocampal func- 
tion: Tentative correlations of conditioning, 
EEG, drug, and radioautographic studies. Yale 
Journal of Biology and Medicine, 1955-1956, 
28, 380-395. 


Macoun, H. W. Caudal and cephalic influences 


of the brain stem reticular formation. Phy- 
siological Reviews, 1950, 30, 459-474. 
Macoun, H. W. The waking brain. Springfield, 


Tll.: Charles C Thomas, 1963. 

Marre, F. W. Cited in T. C. Ruch & H. D. Pat- 
ton (Eds.), Physiology and biophysics. Phila- 
delphia: Saunders, 1965, p. 501. 

Marmo, R. B. Activation: A neuropsychological 
dimension. Psychological Review, 1959, 66, 
367-386. 

Martin, J. P. The basal ganglia and posture. 
London: Pitman, 1967. 

McBrivz, R. L., & KLEMM, W. R. Mechanisms 
of the immobility reflex (“animal hypnosis”) 
T. Influences of repetition of induction, re- 
striction of auditory-visual input, and destruc- 
tion of brain areas. Communications in Be- 
havioral Biology, 1969, 3, 33-41. 

McCreary, R. A. Response specificity in the be- 
havioral effects of limbic system lesions in the 
cat. Journal of Comparative and Physiological 
Psychology, 1961, 54, 605-613. 

McCreary, R. A. Response-modulating functions 
of the limbic system: Initiation and suppresion. 
Progress in Physiological Psychology, 1966, 1, 
209-272. 

McNew, J. J., & THompson, R. Role of the lim- 
bic system in active and passive avoidance con- 
ditioning in the rat. Journal of Comparative and 
Physiological Psychology, 1966, 61, 173-180. 


Mier, N. E. Central stimulation and other new 
approaches to motivation. and reward. Amer- 
ican Psychologist, 1958, 13, 100-108. 

Mutter, N. E. Learning of visceral and glandular 
responses. Science, 1969, 163, 434-445. 

MILNER, P. Physiological psychology. New York: 
Holt, Rinehart & Winston, 1970. 

Mircuett, C. L., & KAELBER, W. W. Unilateral 
vs. bilateral medial thalamic lesions and reactiv- 
ity to noxious stimuli. American Medical Asso- 
ciation Archives of Neurology, 1967, 17, 653-660. 

Mocenson, G. J., & PETERSON, R. J. Effects of 
spreading cortical depression on cardiac and 
somatomotor conditioned responses. Canadian 
Journal of Physiology and Pharmacology, 1966, 
44, 39-45. 

Monnur, M., Hosur, L, & Knurr, P. Moderat- 
ing and activating systems in medio-central 
thalamus and reticular formation. In R. Her- 
nández Péon (Ed.), The physiological basis of 
mental activity. Electroencephalography and 
Clinical Neurophysiology, 1963, Suppl. No. 24, 
97-112. 

MovuxrcAsmLE, V. B. Medical physiology. Vol. 
1. Saint Louis: Mosby, 1968. 

NuxxALLY, J. C, Kort, P. D., DucHNOWSKI, 
A. & Parker, R. Pupillary response as a gen- 
eral measure of activation. Perception an 
Psychophysics, 1967, 2, 149-155. 

Onnrsr, P. Heart rate and somatic-motor coup- 
ling during classical aversive conditioning in 
humans. Journal of Experimental Psychology, 
1968, 77, 180-193. 

Onerst, P. A, & WEBB, R. A. Heart rate during 
conditioning in dogs: Relationship to somatic- 
motor activity. Psychophysiology, 1967, 4, 7-34. 

Onrist, P. A., WEBB, R. A, & SUTTERER, p. 
Heart rate and somatic changes during aversive 
conditioning and a simple reaction time test. 
Psychophysiology, 1969, 5, 696-723. p 

Oros, M. E, & Ors, J. Approach-avoidance 
analysis of rat diencephalon. Journal of Com- 
parative Neurology, 1963, 120, 259-295. 

Orron, D. S, & ISAACSON, R. L. Effects of 
lateral and dorsomedial thalamic lesions on re- 
tention of active avoidance tasks. Journal of 
Comparative and Physiological Psychology, 
1967, 64, 256-261. m " 

Paxinos, G, & BINDRA D. Rewarding intra- 
cranial stimulation, movement, and the hippo- 

theta rhythm. Physiology and Be- 
havior, 1970, 5, 227-231. 

Peacock, S. M. Jg, & HODES, R. Influence of 

atomotor activity : II. Fa- 


the forebrain on som i 
cilitation. Journal of Comparative Neurology, 


1951, 94, 409-426. 
Perez-Crvet, J- & Gaxrr, H. W. Relation be- 
tween heart rate and “spontaneous” movements. 
Bulletin of the Johns Hopkins Hospital, 1959 
105, 315-321. 
perscue, H- Srumpr, C. & Gocorax, G. The 
significance of the rabbit's septum as a relay st 
tion between the midbrain and the C a 
T. he control of hippocampal arousal activity 


112 


he septum cells. Electrocncephalography 
dia Clinical Neurophysiology, 1962, 14, 202-211. 
PoMMEREKE, W. T. A study of the sensory areas 
eliciting the swallowing reflex. American Jour- 
nal of Physiology, 1928, 84, 36-41. . 
PowPEIANO, O. The neurophysiological ! 
nisms oi the postural and motor events during 
desynchronized sleep. Research Publications, 
Association for Research in Nervous and Men- 
tal Disease, 1967, 45, 351-423. am 
Postuns, D. An analysis of chlorpromazine-in- 
duced suppression of the avoidance response. 
Psychopharmacologia, 1962, 3, 361-373. 
Postuns, D, & VANDERWOLF, C. H. Improved 
avoidance performance following exploratory 
movement. Canadian Journal of Psychology, 
1970, 24, 499-504. . 
RapuLovaCkr M., & ADY, W. R. The hippo- 
campus and the orienting reflex. Experimental 
Neurology, 1965, 12, 68-83. 
REPLOGLE, A. Motivational effects 
the thalamus. Unpublished 
McGill University, 1960, 
Ricu, I, & Tuomrson, R. Role of the hippo- 
campo-septal system, thalamus, and hypothalamus 
in avoidance conditioning. Journal of Compara- 


tive and Physiological Ps chology, 1965, 59, 
66-72. 


mecha- 


of lesions in 
master's thesis, 


criminative classical 
Paper presented at the meeting of the E 
ern Psychological iati 
N. J., April 1970, 

Rozerts, W. W., Fear-like behavior 
dorsomedial thalamus of cat, 
parative and Physiological Psy 
191-197, 

Rozerts, W. W., & Carey, R. J. Effect of dorso- 
medial thalamic lesions on fear in cats, Journal 
of Comparative and Physiological Psychology, 
1963, 56, 950-958. 

Rorty, S. R., Sterman, M. B,& CLEMENTE, C. D, 
Comparison of EEG correlates of reinforcement, 
internal inhibition and sleep. Electroencephalo. 
graphy and Clinical Neurophysiology, 1967, 23, 
509-520. 

Rourrenserc, A., & Kramis, R. C. Hippocampal 
correlates of aversive midbrain stimulation. 
Science, 1968, 160, 1363-1365. 

UsHMER, R. F., & Smita, O. A, JR. Cardiac 
control. Physiological Reviews, 1959, 39, 41-68. 

Sater, S, & Stumpr, C. Beeinflussbarkeit der 
rhinencephalen Tatigkeit ^ des Kaninchens, 

Yaunya-Schmiedeberg’s Archiv fur die Experi- 
mentelle Pathologie und Pharmakologie, 1957, 
231, 63-77, 

Saryspury, R & Hippocampal activity during 
natural behavior in the guinca pig. Physiology 
and Behavior, 1970, § 317-324. 

Scrum, A. M. Effects of CS-US interval on 
avoidance Conditioning of cats. Journal of Gene- 
tie Psychology, 1961 98, 203-210. 


elicited from 
Journal of Com- 
chology, 1962, 55, 


C. H. VANDERWOLF 


Scuwartz, M. Conditioned stimulus variables in 
avoidance learning. Journal of Experimental 
Psychology, 1958, 55, 347-351. 

SHERRINGTON, C. S, The integrative action of the 
nervous system. New Haven, Conn.: Yale Uni- 
versity Press, 1961. (Originally published: Lon- 
don: Scribner, 1906) 

Smyer, M. L, Relationship between cardiac rate 
and vocal activ yin newly hatched chicks. Jour- 
nal of Comparative and Physiological Psychol- 
ogy, 1966, 61, 496-498, 

Simpson, H. M, & Parvio, A, Changes in pupil 
size during an imagery task without motor re- 


sponse involvement. Psychonomic Science, 1966, 
5, 405-406. 


Srupson, H. M,& Patvio, A, 
size of manual and verbal 
task fulfillment. 
1968, 3, 185-190, 

S ity, B. M. The polygraph. Scientific Amer- 
ican, 1967, 216, 25-3]. 

Smita, O. A. JR, Rusmurn, R. F, 
E. P. Similarity of cardioy 
exercise and to diencephalic 
ican Journal of Physiology, 1960, 198, 1139-1142. 

SOMMER-SMITH, J. A, POWARZYNSKI, Tos STIRNER, 
A, & GrüNnERG, V. Décharges cellulaires. du 


Effects on pupil 
indicators of cognitive 
Perception and Psychophysics, 


» & LASHER, 
ascular responses to 
stimulation, Amer- 


Neurologica Latinoamerica, 
Storm van LEEUWEN, W., Kant, A, 
& TIELEN, A, M. Rel 
dogs and electrical activities in y 
the brain, In E. A, 
in brain research. 
Elsevier, 1967, 

Sruwpr, C. Drug action on the electrical activ- 
ity of the hippocampus. International Review of 
Neurobiology, 1965, 8, 77-138. (a) 

Stumpr, C. The fast component in the electrical 
activity of the rabbits" hippocampus, Electro- 
encephalography and Clinical Neurophysiology, 

4,1965, 18, 477-486. (b) ee 

TAYLOR, hS Selected writings of John Hughlings 

__ Jackson, : Staples Press, 1958 

THOMPSON, J. Development of 
in blind and seeing children, 
chology, No. 264, 1-47 
HOMPSON, R, Thalamic struc 


facial expression 
Archives of Psy- 


tures critical for 
1 conditioned response 
Mrats. Journal of Comparative and Ph ysiological 
Psychology, 1963, 56, 261-267, i 
TINBERGEN, N. The study o 
Oxford University Press, 
Tizanp, B,& MARGERISON, J H 
functions during wavespike discha 


Journal of Social and 
3, 6-15 


Torman, E. C. Cognitive maps in rats and men. 
SYchologica] Review, 1948, 55 189-208. 
Torn, S. Two types of pat 
electrica] activity induced by 


thalamus and Surrounding 


instinct, London: 
1951. 

Psychological 
ikg rge. British 
Clinical Psychology, 1963, 


tern of hippocampal 
stimulation of hypo- 
parts of rabbits 


MECHANISMS oF VOLUNTARY MOVEMENT 


brain. Japanese Journal of Physiology, 1961, 
11, 147-157. 

Tort, S, & Kawamura, H. Effects of amygda- 
loid stimulation on blood pressure and electrical 
activity of hippocampus. Japanese Journal of 
Physiology, 1960, 10, 374-384. 

Uprus, V., Gavron, G. B., & CARMICHAEL, E. AL 
Shivering: A clinical study with especial refer- 
ences to the afferent and efferent pathways. 
Brain, 1935, 58, 220-237. 

VaxpERwoLr, C. H. Medial thalamic functions in 
voluntary behaviour. Canadian Journal of 
Psychology, 1962, 16, 318-330. 

VANDERWOLF, C. H. The effect of medial thalamic 
lesions on previously established. fear-motivated 
behaviour. Canadian Journal of Psychology. 
1963, 17, 183-187. (a) 

Vanperwotr, C, H. Improved shuttle-box per- 
formance following electroconvulsive shock. 
Journal of Comparative and Physiological Psy- 
chlogy, 1963, 56, 983-086. (b) 

VawpERwoLr, C. H. Effect of combined medial 
thalamic and septal lesions on active avoidance 
behavior. Journal of Comparative and Physio- 
logical Psychology, 1964, 58, 31-37. 

VaxperwoLr, C. H. Warm-up effects in the 
avoidance performance of rats with medial 
thalamic lesions. Animal Behaviour, 1966, 14, 
425-429. 

Vanperwotr, C. H. Behavioural correlates of 
“theta” waves. Proceedings of the Canadian 
Federation of Biological Socicties, 1967, 10, 41- 
42. (a) 

VawpERWOLF, C. H. Effects of experimental dien- 
cephalic damage on food hoarding and shock 
avoidance behavior in the rat. Physiology and 
Behavior, 1967, 2, 399-402. (b) 

Vanperwoir, C. H. Recovery from large medial 
thalamic lesions as a result of electroconvulsive 
therapy. Journal of Neurology, Neurosurgery. 
and. Psychiatry, 1968, 31, 67-72. 

VawpERWOLr, C. H. Effects of chlorpromazine on 
avoidance behavior and on hippocampal electrical 
activity. Paper presented at the meeting of the 
Canadian Physiological Society, St. Adele, Que- 
bec, January 1969. (a) 

VaxpERWOLr, C. H. Effects of medial thalamic 
damage on initiation of movement and learning. 
Psychonomic Science, 1969, 17, 23-25. (b) 

Vanperwotr, C. H. Hippocampal electrical activ- 
ity and voluntary movement in the rat. Electro- 
encephalography and Clinical Neurophysiology, 
1969, 26, 407-418. (c) 

VawpERWOLE, C. H., & Heron, W.  Electroen- 
cephalographic waves with voluntary movement : 
Study in the rat. American Medical i 
tion Archives of Neurology and Psychiatry, 
1964, 11, 379-384. 

VawpERWOLE, C. H., & VANDERWART, M.L. Re- 
lations of heart rate to motor activity and 


113 


arousal in the rat. Canadian Journal of Psy- 
chology, 1970, 24, 434-441. 

VaxpERWOLE, C. H., & WHISHAW, I. Q. Is hip- 
pocampal electrical activity altered by training? 
Paper presented at the meeting of the Psycho- 
nomic Society, St. Louis, Missouri, November, 
1969, 

Wantsten, D., Cote, M., SHARP, D., & FANTINO, 
E. Facilitation of bar-press avoidance by han- 
dling during the intertrial interval. Journal of 
Comparative and Physiological Psychology, 1968, 
65, 170-175. 

Warren, J. M., & Axert, K. Impaired problem 
solving by cats with thalamic lesions. Journal 
of Comparative and Physiological Psychology, 
1960, 53, 207-211. 

WHATMORE, G. B., & Koxtt, D. R. Dysponesis: 
A neurophysiologic factor in functional disorders. 
Behavioral Science, 1968, 13, 102-124. 

Wuisnaw, I. Q. The influence of body tempera- 
ture on hippocampal electrical activity in freely 
moving rats. Paper presented at the meeting 
of the Eastern Psychological Association, Atlan- 
tic City, N. J., April 1970. 

WurisuaWw, I. Q., & VANDERWOLF, C. H. Hippo- 
campal correlates of movement. Proceedings of 
the Canadian Federation of Biological Societies, 
1970, 13, 48. 

WiLsox, S. A. K. Some problems in neurology. 
IL. Pathological laughing and crying. Journal 
of Neurology and Psychopathology, 1924, 4, 299- 
333. 

WiLso, S. A. K. The narcolepsies. 
51, 63-109. 

Wisuanr, T. The effects of lesions of the hippo- 
campus, septum, and medial thalamus on a pas- 
sive avoidance task. Unpublished master's 
thesis, University of Western Ontario, 1968. 

Woops, J. W. Behavior of chronic decerebrate 
rats. Journal of Neurophysiology, 1964, 27, 
634-644. ; 

Yoxora, T, & Fusrorr, B. Effects of brain- 
stem stimulation upon hippocampal electrical ac- 
tivity, somatomotor reflexes and autonomic func- 
tions. Electroencephalography and Clinical 
Neurophysiology, 1964, 16, 375-383. 

Yosir, N., SHTMOKOCHI, M., Miyamoto, K, & 


Iro, M. Studies on the neural basis of be- 
havior by continuous frequency analysis of EEG. 
In T. Tokizane & J. P. Schadé (Eds.), Progress 
Correlative neurosciences: 

Amsterdam: Else- 


Brain, 1928, 


in brain research. | 
Fundamental mechanisms. 


vier, 1966. . 
YOURS G. r & Brack, A. H. The relationships 


between electrical activity of the hippocampus 
and overt skeletal behavior 1m the dog and rat. 
Paper presented at the meeting of the Eastern 
Psychological Association, Atlantic City, N. J., 


April 1970. 
(Received June 2, 1970) 


TWO 


logical Review 
Tor orn No. 2, 114-121 


Cam 
1968 
1963 


eta; 
publ 
deve 


the tilt aftereffe 
in terms of th 


ing possible analogou 


VISUAL FEATURE-ANALY 


EFFECTS OF TILT 


ZERS AND AFTER- 


AND CURVATURE: 


MAX COLTHEART 2 


University of 


satiation.” 


ly (e.g. Hubel & Wiese 
and 

pbell & Kulik 

Sev 


Je 


; 


Psychophy 
iled explanatio 
ished, 


lop an ex 


ner, 1937). 
fic contour 


forms of aftereffect. 


The 


Weis 


(a) The 
the cat 


TATE 


relevant 
els findings 


aspects of Hubel 
are, briefly, 

re exist units in the visu 
and the monkey which 


and 
as follows: 
al cortex of 
respond to 


N€ preparation of this paper was Supported by 
Grant No, APA-0350 from the N; 


Colth 


ational Research 
ted to Charles S. 
and for criticisms 


I am indeb 
OUS suggestions 
raft. 


CQUests for reprints 


E Should be sent to Max 


Psychology, University 
ada. 


114 


Waterloo 


stimulation. 
is in a state 
1 it recovers 
8 normal sensi- 
it responds best 
apPropriate orienta- 
preferred orientation"). 
Strongly, to 
€nt orienta. 


nulation, the unit 
ty, from whic] 
regainin 


n and the 
No response 

ce exceeds a certain 
° in the cat, +5° to 10° 
(Hubel & Wiesel, 1968, p. 
i ue for humans 
below), but all 


data that when a 
g inspected, a 
nits will be 


Most strongly, 
Strongly, their 


FEATURE ANALYZERS 


gree of departure of the stimulus orientation 
from the unit's preferred orientation. We 
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yielded by this pattern of response is given 
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to which the unit is responding above its 
maintained discharge level. 

Let us assume, for expository purposes, 
that for humans the range of orientations to 
which a single unit will respond is 520° 
about its preferred orientation (as suggested 
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figure, is now inspected, units from —20° 
to 4-20? will be stimulated. Units with pre- 
ferred orientations from —20° to —10° will 
respond with normal vigor, since they were 
not stimulated during inspection of the T 
figure, and hence will not have adapted. 
However, units with preferred orientations 
from —10° to +20° will be in a state of re- 
duced sensitivity because they were stim- 
ulated throughout the inspection period. 
They will respond less strongly therefore to 
the vertical stimulus than they ought. Asa 
consequence, the weighted average response 
to the vertical line will be biased in the nega- 
tive direction, instead of being 0°, as would 
normally be the case. Therefore, the vertical 
line will be perceived as having a counter- 
clockwise tilt; that is, there will be a nega- 
tive aftereffect. 


COMPARISON WITH OTHER THEORIES 


There are two major theories of the tilt 
aftereffect: the normalization theory (Gib- 
son & Radner, 1937) and the satiation theory 
(Kohler & Wallach, 1944). According to 
the normalization theory, the vertical and 
the horizontal are norms of visual space, and 
the effect of viewing a line which departs 
somewhat from one of these norms is to 
shift the norm in the direction of the in- 
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spected line. A line which formerly coin- 
cided with this norm will now appear to de- 
part from it in a direction opposite to the 
direction of tilt of the I figure. Thus a nega- 
tive aftereffect will occur. 

Difficulties with this theory have been 
pointed out by Morant and Harris (1965). 
The main difficulty is that inspection of an 
I figure which coincides with a norm (65 
is vertical or horizontal) generates an after- 
effect (Kohler & Wallach, 1944, pp. 309- 
311; confirmed by Templeton, Howard, & 
Easting, 1965). This is clearly inconsistent 
with the normalization theory because this 
kind of I figure could not distort the norms 
of visual space. On the other hand, this 
effect would be expected to occur if the tilt 
aftereffect is a consequence of the adaptation 
of a range of orientation-specific units, be- 
cause this adaptation would be produced by 
I figures of any orientation. 

According to the satiation theory, pro- 
tracted inspection of a visual stimulus leads 
to “satiation” of the area of the visual cortex 
to which this stimulus is projected. Stimuli 
subsequently projected to adjacent cortical 
areas will be displaced from the satiated 
area. This theory, too, suffers from an in- 
superable difficulty: the “indirect effect." 
Viewing a line tilted slightly clockwise from 
the vertical will make an objectively vertical 
line appear to have anticlockwise tilt (the 
“direct effect") ; it will also make an ob- 
jectively horizontal line appear to be slightly 
tilted counterclockwise from the horizontal 
(the “indirect effect”). This indirect effect, 
discovered by Gibson and Radner (1937) 
and verified by Morant and Harris (1965), 
cannot be explained by the satiation theory. 

The indirect effect can be understood in 
terms of the analysis being proposed in this 
paper, by virtue of a property exhibited by 
one class of orientation-sensitive unit, the 
"higher order hypercomplex cell" (Hubel 
& Wiesel, 1965, 1968). This type of unit 
has fwo preferred orientations, 90° apart. 
Viewing a +10° line would adapt such units 
with preferred orientations from —10° to 
430° and +80° to +120°. Now, when a 
--90? line is viewed, the adapted state of the 

em i as 
--80? to +120° units will bias the weighted 
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average response in an Pe odas difec. 
tion, thereby yielding an "indirect effect, 
Clearly, units of this type cannot be the 
only units involved in tilt detection; other- 
wise we could not discriminate 0° from 90°, 
Simple cells, complex cells, and lower order 
hypercomplex cells, which all have single 
preferred orientations, must also be involved, 
This means that the adaptation of severa] 
classes of cell contributes to the direct ef- 
fect, whereas the indirect effect is produced 
by the adaptation of only 


It is not surprising, therefore, that 


effect, a result which embarras 
malization theory. 
direct effect to a 
tween the vertical and horiz 
the norms is dis- 
placed, the Other must be too, He then sug- 
indirect effect is smaller 


Ontal norms, Tf 


e, there being 


“a certain amount of ‘play’ or ‘lost motion’ 


between them [Gibson & Radner, 1937, 
p. 464]." his explanation js patently ad 
hoe. 


ation theory nor 
Onsistent with what 


fect in terms of the 
units in 
encompassing 
aftereffect. The 


the visual System js Capable of 
the major features of the tilt 
direct relationship between aftereffect mag- 
duration of inspection of the 

I figure, and the inverse relationship between 
aftereffect magnitude and time since I-figure 
offset, are obviously deducible from this 
analysis. So is the “distance paradox”: Ty 
general, the magnitude of the tilt aftereffect 
is inversely related to the angular Separation 
between the I figure and the T figure, ex- 
cept when the two figures have almost iden- 
i orientations, the aftereffect is 
increase in the angular 
two figures can produce 
This is to be expected 

| figure and T figure al- 
distribution of adapted 

symmetrical about the T 
when the two figures differ 


a larger 4 


units jg a 


Max COLTHEART 


will a sizable 
ating the magni- 
to the difference 
* orientation and I-figure 
orientation has been described in detail 
(eg., Morant & Harris, 1965). If the 
selectivity of orientation-spe i (i.e. 
the range of orientations capable of stimulat- 
ing a given unit) were kn 


be possible to Predict the exact form of the 


angular function of the tilt aftereffect, rather 
than icti its curvilinearity. 
Thi test of any theory 
explain the tilt aftereffect in 
terms of the adaptation of orientation-specific 
units, Unfortunately, precise Physiological 
i lectivity in humans are as 
attempts to estimate it 
data have yielded very 
onsequently it would be 
Premature to attempt to Predict the function 
relating aftereffect Magnitude to I-figure 
orientation using data rom experiments di- 
rectly investigating orientation; selectivity, 


tude of the tilt aftereffect 
between T-figur 


own, then it would 


his would be & powerfu] 
attempting to 


Two DIFFICULTIES 


An analysis of the tilt aftereffect in terms 
of the adaptation of orientation-specir units 
appears to offer considerable Promise, Dif- 
ficulties for such an analysis are raised py 
the distinction between adaptation and after- 


effect, and also by Day and Wade's (1969) 


a Parlee (1969), using single lines for both test 
and adapting stimuli found that even when the two 
red in Orientation by 90°, some contour- 
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conclusion that it is orientation with respect 
io gravity, not to the retina, that is crucial 
lor the tilt aftereffect. 

Two sorts of effect are noted during ex- 
perimental studies of the tilt aftereffect. 
An aftereffect is manifested during the post- 
inspection presentation of the T figure, and 
is measured by a comparison of preinspection 
and postinspection responses to the T figure. 
dldaptation. (which will be called “neutral- 
ization,” following Taylor, 1962, in view 
of the variety of uses of the term "adapta- 
tion") is manifested during the inspection 
period by a progressive change over time in 
the appearance of the I figure, which appears 
to approach verticality if it was originally 
close to the vertical, or horizontality if it 
was originally close to the horizontal (Gib- 
son, 1934; Gibson & Radner, 1937; Morant 
& Mistovich, 1960; Vernon, 1934). 

It might seem obvious that neutralization 
and aftereffect are merely two facets of a 
single process, as Gibson originally assumed ; 
he still holds this view (Gibson, 1966, p. 
309). However the analysis of the tilt 
aftereffect in terms of orientation-specific 
units applies only to aftereffects, not to neu- 
tralization. There is no way to make such 
an analysis predict changes in the appear- 
ance of the I figure while it is being con- 
tinuously observed. The I figure will adapt 
units equally on both sides of its own 
orientation, so no bias will be introduced. 
This does not indicate that the proposed 
analysis is deficient, however, because the 
aftereffect and neutralization may well be 
quite separate effects. "They behave differ- 
ently in several ways: for example, an I fig- 
ure of any orientation produces an after- 
effect, whereas vertical, horizontal, or 45° 
1 figures do not produce neutralization. 
Second, neutralization is not retinally lo- 
calized, whereas the magnitude of the after- 
effect declines markedly as the retinal loca- 
tion of the T and T figures are moved apart 
(Morant & Mikaelian, 1960).* Consequently, 
as Taylor (1962) has pointed out, the two 
phenomena require different explanations, 
and a single theory cannot explain both. In- 


1 Since this was written, Muir and Over (1970) 
have reported a failure to replicate Morant and 
Mikaelian's results. 
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deed. Morant and Harris (1965) concluded 
that tilt experiments involve two underlying 
processes, satiation and normalization. They 
suggested that neutralization and the "in- 
direct effect" reflect normalization, whereas 
the aftereffect is typically the algebraic sum 
of normalization and satiation effects. 

It is only the aftereffect which is here 
attributed to the adaptation of orientation- 
specific units. A possible neurophysiological 
mechanism for the neutralization effect re- 
mains to be discovered. One such mecha- 
nism might involve units which signal, not 
orientation, but departure from the vertical 
or horizontal. If response rate were pro- 
portional to departure from the “norm,” and 
if adaptation reduced response rate. then 
neutralization effects would occur during 
prolonged viewing. Whether this type of 
unit contributes to the aftereffect, or is 
merely responsible for neutralization, re- 
mains to be determined. Templeton et al. 
(1965) found that the aftereffect obtained 
with a vertical I figure was significantly 
smaller than the aftereffect obtained with a 
+10° I figure. Differences between their 
two conditions make this comparison a dubi- 
ous one, but if the result is accepted, it might 
be interpreted as being due to the fact that 
when the I figure is vertical, only orienta- 
tion-specific units are adapted, whereas both 
“norm-devia- 


orientation-specific units and 
tion" units are adapted by an oblique I 
This interpretation is quite con- 
that the mechanisms 
identical 


figure. 
sistent with the view 
underlying neutralization are not 


5]t might be suggested that differences 1n the 
properties of neutralization and aftereffect could 
be explained in terms of differences 1n the sensitiv- 
ity curves of units having different preferred 
orientations. Such differences were postulated by 
Andrews (1967), who inferred. from his data that 
units maximally sensitive to vertical, horizontal, 
or 45° contours have symmetrical sensitivity dis- 
tributions, whereas other orientation-specific units 
have skewed sensitivity distributior Perhaps, 
then, units with symmetrical sensitivity distribu- 
tions would not. while adapting out, lead to a 
drift of apparent orientation, but would still pro- 
duce an aftereffect. How this unitary explanation 
could be worked out In detail is not evident, and 
at any rate it fails to account for the fact that 
neutralization is not retinally lo ali ed, whereas the 
aftereffect 15 (Morant & Mikaelian, 1960), 
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with the mechanisms underlying the after- 
effect. = 

The second difficulty for this analysis is 
raised by the work of Day and Wade (1969) 
who used body tilt in order to separate ret- 
inal and gravitational orientation. They 
found a tilt aftereffect when the I and T fig- 
ures were retinally identical, but differed 
in gravitational orientation. This finding 
poses a problem for any analysis that is 
based on units sensitive to retinal orienta- 
tion, as is considered to be the case for the 
type of unit studied by Hubel and Wiesel 
(1962, 1965, 1968). Fortunately, recent 
physiological evidence (Horn & Hill, 1969; 
Spinelli, 1970) has revealed that there are 
visual-system units which respond to partic- 
ular gravitational orientations. [n these ex- 
periments, receptive-field orientation in the 
cat was studied during body tilt. Fora small 
number of orientation-specific units, the re- 
ceptive-field axis did not tilt as much as the 
cat’s retina did. In fact, some units ex- 
hibited almost no change in receptive-field 
axis (relative to the environment) with a 
head tilt of 30° ; that is, these units were re- 
sponding almost entirely to gravitational 
Orientation rather than retinal Orientation. 

There seems to be no doubt that a differ- 
ence between gravitational Orientations of 
I and T figures is sufficient to generate an 
aftereffect, which could be mediated by the 
gravitation-sensitive units mentioned above, 
However, Day and Wade also argued that a 
difference in gravitational Orientation is 
necessary for a tilt aftereffect, since they 
found no aftereffect when the I and T fig- 
ures differed in retinal orientation but had 
the same gravitational orientation, This 
second conclusion is difficult to accept, and 
further work is required. Day and Wade 
did obtain aftereffects in the predicted direc. 
tion in two experiments when only retinal 
Orientation was varied (though the effects 
Were not significant) ; moreover, the after- 


effect was much greater when both retinal 
and gravitational orientation were varied 
than when only the latter was varied. More- 


Over, as they mention, ocular countertorsion 
(which Occurs when the body is tilted) 
would enhance any aftereffect attributable to 
8Tavitational Orientation and reduce any 
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aftereffect attributable to retinal orientation. 
Clearly this is an issue which needs further 
investigation, The existence of an after- 
effect due solely to differences in gravita- 
tional Orientation, at any rate, is not incon- 
sistent with the view that tilt aftereffects are 
due to the adaptation of orientation-specific 
units, since some Such units are responsive 
to gravitational orientation. 


OTHER Forms or AFTEREFFECT 

There are two other 
Properties are similar to those of the tilt after- 
effect: the curvature aftereffect (Carlson, 
1963; Gibson, 1933) and the “third-dimen- 
sional” or median-plane tilt aftereffect 
(Bergman & Gibson, 1959. Kohler & 
Emery, 1947 ; Wenderoth, 1967 ; Wenderoth, 
Rodger, & Curthoys, 1968). The resem- 
blances between these aftereffects and the 
tilt aftereffect Suggest that it might be fruit- 
ful to attempt to analyze them in the same 
Way as the tilt aftereffect, 

Prolonged inspection 
makes a straight line ap 
in the opposite direction, 
existence of units Sensitive to direction or 
degree of curvature, For example, an 
analysis of the curvature aftereffect pre- 
cisely analogous to that given for the tilt 
aftereffect could be developed Starting from 
the postulate that units exist which respond 
best to contours having a particular degree 
of curvature, but which also respond (though 
less strongly) to contours whose curvature 
approximates the preferred curvature. One 
Prediction entailed by this analysis is that 
an extremely curved T figure should produce 
a smaller curvature aftereffect than an I fig- 
ure of lesser curvature, assuming a straight- 
line T figure, This was found by Bales and 
Follansbee (1935), : 

It is not difficult to imagine how a set of 
orientation-specific cells could be wired up 
to a higher order unit in such a way as to 
Produce a unit maximally Sensitive to con- 
tours having a particular degree of curva- 

" Blakemore and Sutton 

lifferent 


that a ¢ form of aftereffect, the displace- 
ment aftereffect (Kohler & 
be explicable in 


specific units, 


aftereffects whose 


of a curved line 
pear to be curved 
This hints at the 
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ture. In fact Hubel and Wiesel have oc- 
casionally made guarded references to an 
interpretation of some of the units they 
have found as curvature detectors (Hubel 
& Wiesel, 1965, p. 286; 1968, p. 238). Fur- 
thermore, one class of unit described in the 
frog by Lettvin, Maturana, Pitts, and 
McCulloch (1961) has as its preferred 
stimulus a curved contour. 

The median-plane tilt aítereffect occurs 
after inspection of a line tilted toward or 
away from the observer in his median plane: 
an objectively vertical line will now appear 
tilted in the median plane in the opposite 
direction. An analogous explanation of this 
aftereffect would require the existence of 
units sensitive to tilt in the median plane and 
with a preferred median-plane orientation. 
Such units have not been observed; but no 
one has looked for this particular type of 
unit, and its feasibility is indicated by the 
work of Barlow, Blakemore, and Pettigrew 
(1967) and Pettigrew, Nikara, and Bishop 
(1968), who showed that different binoc- 
ularly driven cortical units have different 
preferred receptive-field disparities. This 
means that with convergence fixed, different 
units are maximally excited by stimuli at dif- 
ferent distances. If a number of such units 
had receptive fields arranged in a vertical 
sequence on both retinae, and if the preferred 
disparity of the units increased progressively 
as a function of their vertical retinal coordi- 
nate, this aggregation of units would be 
maximally stimulated by a line having a par- 
ticular median-plane tilt. A higher order 
unit on which this set of units converged 
would have a preferred median-plane tilt 
and would serve therefore as the mechanism 
underlying the median-plane tilt aftereffect. 

The distinction between neutralization and 
aftereffect is important for the curvature and 
median-plane tilt aftereffects as well as for 
the frontal-plane tilt aftereffect. During in- 
spection, a curved line appears to become less 
curved (Gibson, 1933), and a line tilted 
from the vertical in the median plane appears 
to approach verticality (Bergman & Gibson, 
19059). In terms of the present analysis, 
these two forms of neutralization suggest the 
existence of units whose response rate is 
proportional to the degree of departure from 
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linearity of a curved line and to the degree 
of departure from verticality of a line tilted 
in the median plane. 


CONCLUSIONS 


This paper has proposed explanations of 
various types of aftereffect in terms of the 
adaptation of various types of feature-analyz- 
ing units. Whether such explanations are 
of value has been questioned by Weisstein 
(1969), who has claimed that perceptual 
aftereffects are useless as à paradigm for 
studying the characteristics of visual feature- 
analvzers. Her two main arguments are that 
(a) for the motion aftereffect, at least, a 
heterogeneous collection of units seems to 
underlie the aftereffect, and the aftereffect 
paradigm cannot distinguish among the dif- 
ferent classes of unit; and (b) orientation- 
specific color aftereffects, which were 
assumed to be direct evidence for chromatic 
adaptation of edge detectors (McCollough, 
1965), can also be explained in terms of a 
simpler kind of neural unit, the "dipole" 
(Harris & Gibson, 1968) and so. again, the 
aftereffect paradigm fails to give unambig- 
uous information about neural circuitr 

However, both of these arguments are irre- 
levant to the present paper because no 
attempt is being made to use aftereffect data 
to demonstrate the existence of particular 
neural circuits. Instead, where something 
about the circuitry, the aim is to 
show that the properties of aftereffects can 
be deduced from the circuits. Where 
nothing is known about the circuitry, the 
aim is to suggest the characteristics of the 
Fiorentini and Maffei 
the amplitude of the 
viewing a pair of 


is known 


* Since this was written, 
(1970) have reported that 
evoked potential in humans vi : 
stereoscopically presented gratings | is correlated 
with the degree of perceived inclination of the 
fused image. It could be argued that their mea- 
surements reflect the activity of units whose re- 
sponse rate is proportional to the departure from 
zero tilt of a stimulus tilted in the horizontal 
Except for the substitution of the horizontal 


plane. 
for the median plane, these are the units postulated 
above. Such units should exist for both types 


of tilt since neutralization occurs in both planes 
It would be of the greatest interest to record the 
amplitude of these evoked potentials during tl 

neutralization which would occur during proloi ne 
inspection. nged 


circuits underlying aftereffects. Direct ine 
vestigations of these underlying ene 
would then be required if this explanation o 
the aftereffect is to be pursued. In neither 
instance is it being assumed that aftereffect 
data provide a direct route to knowledge of 
the underlying neural circuits. — 
As far as the median-plane tilt 
and the curvature aftereffect are r 
all that has been done in this Paper is to 
show that plausible neural units can be de- 
signed which 
effects. What is required nex 


aftereffect 


A variety of techniques is available 
of which such evidence r 
Direct clectrophysiological studies, or psy- 
chophysical studies using such paradigms as 
those employed by Gilinsky (1968) or Mc- 
Collough (1965), might be of value, but the 
most promising approach would appear to 
be the €voked-potentia] technique used by 
Campbell and Maffei (1970) and Fiorentini 
and Maffei (1970), - 

s far as the frontal-plane tilt 
is concerned, it appears that pr 
logical knowledge 
explanation for thi 
neutralization 


by means 
night be gathered, 


` 
aftereffect 
esent physio- 
is sufficient to provide an 
S aftereffect, although the 
effect and the possible im- 
portance of gravitational orientation raise 
questions which remain to be answered, 
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PROBLEM OF SERIAL ORDER IN BEHAVIOR IS 


NOT RESOLVED BY CONTEXT-SENSITIVE 
ASSOCIATIVE MEMORY MODELS? 
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Associative-chain theories have troubl 


AND 


e 


the elements of behavior are ordered 


avoid these difficulties by postulating 
"which umes that serial order is er 
context-sensitive elementary 
(a) such theories are not 
other facts of speech 
Support are incorrect, and (c) the 
and rejected by Wickelgren is in no 
associative theories, 


In an article aimed at countering Lash- 
ley’s (1951) paper on the problem of serial 
order in behavior, Wickelgren (1969) pro- 
posed to save some form of associative theory 
by taking units at a basic, “noncreative,” ele- 
mentary motor response level and allowing 
them to combine in “context-sensitive asso- 
ciations.” He illustrated his thesis with the 
case of speech production, choosing the 
phoneme as the individual unit and the 
ordering of phonemes into words as the rele- 
vant problem of serial order. He argued in 
general that such models are sufficient to 
account for the problem of serial order in all 
"noncreative" behavior, 

The writers agree with Wi 
tention that there is no extant model that 
can pretend to cope with even the most 
obvious facts presently known about Speech 
production or any of the other kinds of hu- 
man skills that he discusses Further, we 
wholeheartedly agree that attempts are des- 


ckelgren's con- 


1 Preparation of this paper was Supported in part 
by grants to the University of Minnesota, Center 
for Research in Human Learning, from the Na- 
tional Science Foundation (GB-17590), the Na. 
tional Institute of Child Health and Human De- 
velopment (HD-01136), and the Graduate School 


of the University of Minnesota. We appreciate 
the efforts of James Tweedy and Walter Weimer 
in trying to keep us from error, They are, of 


course, not to blame for any failure 

* Requests for reprints should b 
Jenkins, 400 Ford Hall, 
Minneapolis, Minnesota 


in their task, 
e sent to James J. 
University of Minnesota, 
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way related t 
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fll this void. Finally, 
en’s step toward greater 
of model he employs in 
approaching the difficult problems involved 
m dealing with temporal integration. 
Having made these approving remarks, we 
must criticize Wickelgren’s 

al formulation as being severely 
Out its inability to 
of speech produc- 


it was directed, 
cifically, we argue that (a) 


misunderstood Lashley's b 
Wickelgren's distinction 

and noncreative behavior 
rious misunderst: 


involved in generative conceptions of pe. 
havior; and (c) the models Wickelgren has 


are scarcely more Powerful or ade- 


quate than the associative models already 
on hand. 


cope 
tion 
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We are not, of course, 
Propose explicit alternativ. 
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COMMENT ON LaAsHLEY'S ANALYSIS 


Rereading Lashley's (1951) paper sug- 
gests to us that Wickelgren’s first assump- 
tion is a consequence of mistaking a peda- 
gogical device for a theoretical position. 
Lashley’s major intent was to call attention 
to the failure of association theory in the 
hands of both the Structuralists and the Be- 
haviorists. He was convinced by his physio- 
logical and behavioral work that the nervous 
system was dynamic and organized in highly 
complex systems. In his article, he sug- 
gested behavioral examples that would serve 
as convincing indicators of these complexly 
organized systems. As Wickelgren points 
out, Lashley's examples were characterized 
in terms of "unitary acts" which were not 
themselves considered as containing any 
temporal relationships. He used this simpli- 
fication to make the point that the ordering 
of those units was à complex problem in- 
volving at least determining tendencies and 
the syntax of the act. We do not believe 
that he seriously intended these unitary acts 
as a “rock bottom” elementary response 
level, At one point in the article. he talks 
of the individual movements of the speech 
articulators; at another, of the individual 
production of phonemes ; and at another, of 
the activation of individual words. Through- 
out the article, it is clear that he was think- 
ing of schemata and of hierarchies of orga- 
nization prevading al skilled acts “even 
down to the temporal coordination of muscu- 
lar contractions in such a movement as 
reaching and grasping [p- 123].” It seems 
unlikely to us that he would have been im- 
pressed by an associative analysis of any ar- 
bitrary level ( such as the production of 
phonemes). We suspect that he would 
have pointed out that this level, too, is not 
composed of simple units, and that the 
whole problem could be restated in terms 
of further “unitary responses" below these 


complex phones. 
CREATIVE AND NoNCREATIVE BEHAVIOR 


Wickelgren (1969) has proposed that be- 
havior be separated into two broad classes: 


Tt seems quite useful to distinguish behavior 
which is fundamentally creative, though lawful, 
such as speech at the level of syntax, from be- 
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havior which is fundamentally noncreative, such 
as speech at the level of word pronunciation. ane 
We say the same words over and over again. 
We rarely say a sentence identical to one 
said before. This seems to me, as it has to many 
other people concerned with language (Chomsky, 
1964; Chomsky & Miller, 1963), to be perhaps the 
most fundamental fact about language at the 
level of syntax. Tf one feels this to be such à 
fundamental feature of syntax, One should cer- 
tainly not expect to account for the serial order 
of phonemes in words in exactly the same manner 
as one accounts for the serial order of words (or 
morphemes) in sentences. Thus, it seems impor- 
tant to distinguish creative and noncreative serially 
ordered behavior in language and in all other 
behavior [p. 2]. 

Two aspects of this must be discussed. 
First. the logic is misleading. Surely no 
one intends to account for phonology “in 
exactly the same manner as one accounts 
for the serial order of words.” We confess 
that we do not even know what that might 
mean. But this argument must not prevent 
our considering the possibility that phonol- 
ogy may be as complex as syntax. 

Second, and much more important. this 
distinction seems to us to represent a funda- 
mental misunderstanding of the notion of 
“creativity of behavior” as elaborated, for 
example, by Chomsky and his colleagues 
(Chomsky, 1964; Chomsky & Miller, 1963). 
Wickelgren does not provide us with a state- 
ment of what he means by the term “crea- 
" but it seems fair from his examples to 
infer that this concept is closely linked to 
the notion of infinite or potentially infinite 
variation. Although this argument has often 
been presented to overwhelm simple models 
of language production, and although one 
often hears the problem of creativity char- 
acterized as the problem of how the organism 
makes “infinite use of finite means (Chom- 
sky, 1966). this simple characterization, un- 
less qualified and elaborated, may be ver) 
misleading. Impossibility proofs are eas) 
wavs to defeat alternative models which can 
1 infinitely many sentences O 
infinite length. But impossibil 
jn themselves, constitu 


tive, 


not produce 
sentences of 


ity proofs do not, m : 
the evidence supporting the claim that mo; 


and, perhaps. animal) behavior 


human : 
fundamentally creative. Such proofs requi 
assumptions that no one wants to pu 
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very far. No real sentence ever was or ever 
will be of unlimited length, either temporally 
or in cardinality of words or phonemes; 
it is equally clear that there Is not an un- 
limited number of words in a given language 
at a given time. The important considera- 
tion is not that behavior is creative because it 
comes in (potentially) an infinite number of 
different versions but, rather, that any rea- 
sonable theory of sentence production must 
recognize that the limitations on the length 
and number of sentences produced by indi- 
vidual speakers are not directly related to 
the organization of the cognitive. machinery 
that produces the language behaviors, CT he 
behavior of a computer that is turned off 
ations due to 
the periodic Stoppage, but such limitations 
with limitations 
manner of 


assumptions, most 
of these assumptions is the claim 
lish sentences are generated 
application of certain synt 
The assumed recursive nature of the genera- 
tor is essential to the argument that the set 
of sentences is large without limit. Thus, 
recursivity is a more fundamental presup- 
position with respect to the claim of creatiy- 


ity than is the infinitude of Possible sen. 
tences, 


important 
that Eng- 
by recursive 
actic operations. 


We do not become convinced that E 
Sentences are creative behavior by 
given infinitely 
shown an infinitely long sentence, 


nglish 
being 
being 
but by 
English 
be applied 
is also an 
and longer 
It is the fact that language 
y accounting for this degree 
rather than the unlimitedness 
nglish Sentences, which is the 


Terry HALWES AND James J. 


JENKINS 


basis of the claim that ] 
behavior, 

Recursive mechanisms are, of course, re- 
quired to account for most of the kinds of be- 
havior Wickelgren wants to call noncreative 
("animal 'speech' and mimicry, and all ele- 
mentary skilled motor movements such as 
throwing a ball, running, walking, jumping, 
turning one’s head, moving one’s hand from 
one location to another, etc, |p. 2]"). No 
one would try modeling the free-hand draw- 
ing of a spiral on a piece of paper without 
recursively applicable Operators like trigo- 
Such operators are essen- 
are in the language case, 
wants to give a descrip- 
tion of the behavior itself, let alone the 
that produce it, Behaviors 
iplexly Structured to an extent 
us to consider them as crea- 
are exceedingly common—indeed we 
do not know of any behaviors which are 
clearly noncreative, excepting possibly, spi- 
nal reflexes, 

Although the preceding Sentence may be 
an overstatement, it appears incontestable 
that Wickelgren’s class of noncreative be- 
haviors must be smaller than he suggests. 


If we consider for a moment the behavior of 
drawing Curves on a piece of Paper, as sug- 
Sested above, we can see that the set of 
Such curves is not only large without bound, 
but has (with the acceptance of a 
assumptions similar t 

Prove that there 
grammatical Engli 
greater than 


anguage is creative 
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Further, it seems to isolate only a tiny por- 
tion of behavior. It may be true that asso- 
ciation theories are useful within that small 
domain, but it is clear that the classes of be- 
havior we are discussing are not included. 


MODELS FOR SPEECH PRODUCTION 

Wickelgren gave four models for speech 
production and examined them in relation to 
the size of the central nervous system and 
the fact that speakers correctly order pho- 
nemes in words. He also claimed to deal 
with the facts of coarticulation phenomena. 
We find the context-sensitive associative 
model that he favors to be unsatisfactory and 
the nonassociative model that he pits against 
it remarkably weak. We will consider these 
briefly. 


Context-Sensitive -Lssociative Model 

The problem in an associative model of 
speech is to cope with the stubborn fact that 
each phoneme cannot be used to call out the 
next one in typical associative chain fashion 
because all the phonemes have to be available 
in a huge variety of combinations. Wickel- 
gren's strategy is to avoid this problem by 
creating enough versions of the phonemes to 
make them all distinctive in their various 
contexts. This is no mean feat. It entails in- 
cr ng the number of speech production 
entities from about 50 to about 1,000,000. 
He regards this as acceptable since the ner- 
yous system has more than enough neurons 
to accomplish this. It is even hinted that 
this move is meritorious because it seems 
probable “that the nervous system generally 
uses a very large number of elements as a 
basic feature of its approach to control sys- 
tems problems [p. 7].” 

In essence, a new speech production entity 
is created for each speech sound in each 
context to capture its dependency on the 
speech units which precede and follow it.? 
1f, however, coarticulational effects are found 


3 One hundred preceeding sounds X 100 units X 
100 succeeding sounds — 1,000,000 unique units. 
There are about a hundred basic units because 
Wickelgren is forced to treat vowels with different 
stress essentially as different phonemes. We will 
not stop to contest this strange tactic since our 
point here is that the whole argument is vacuous 
whatever the number of elementary units. 
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as near as two phonemes away from the 
speech unit of interest, the system becomes 
absolutely impossible even under the weak 
constraints used by Wickelgren; such a cir- 
cumstance would require as many unique 
speech sound representatives as there are 
neurons in the central nervous sys- 
tem (100 x 100 x 100 x 100 x 100 = 
10,000,000,000 separate identifiable units). 
Unfortunately for the theory, evidence sug- 
gests that such remote dependencies exist. 
For example, Kozhevnikov and Chistovich 
(1965, pp. 124 ff.) report that the initial /s/ 
in /stru/ already shows the influence of the 
choice of the vowel which follows free 
phonemes later. It seems likely to us that 
the more we perform careful finegrained 
analysis of any series of behavior sequences, 
the more rapidly we will push to disas- 
trous limits this strategy of creating new 
units to do the associative job. This is as 
true at the level of words and lower levels 
as it is at the level of syntax. 

The situation may be worse than the pre- 
ceding paragraph may suggest. Wickel- 
gren's numerical gymnastics are based on an 
assumption that he knows is tenuous, the 
assumption that one word or one elementary 
motor response equals one neuron. Lashley 
and his successors demonstrated that the 
cortex is not neatly cut up into tiny pieces 
of tissue with specific functions à igned. 
Cortical lesions associated with aphasia do 
not result in the loss of specific words, they 
result in the loss of language functions gen- 
erally (Schuell, Jenkins, & Jiménez-Pabon, 
1964). The concept of mass action may be 
only partially correct, but to suppose that 
it is entirely wrong is to ignore the body of 
our understanding of cortical functioning. 

By the above, we do not mean to imply 
merely that each word requires more than 
one neuron for its storage. If current RNA 
coding models of memory are correct, it 1s 
perfectly conceivable that thousands of bits 
of information might be stored in a single 
neuron. Alternatively, if Hebb's suggestions 
regarding cell assemblies and. phase se- 
quences are correct, it is patterning of activ- 
ity rather than specific loci that is important, 
The point here is that the notion of asso- 
ciative storage 1n specific locations and the 
aicel “units” is a model that we know 


126 


Se impossible. Thus, it cannot be of any 
help to us either constructively or destruc- 
tively to consider the number of neurons 
available in the central nervous system, since 
such a tally cannot have any meaningful 
relation to any plausible theory, . 

At this point, we had hoped to give a de- 
tailed account of the failings of Wickelgren's 
model in terms of well-known facts about 
linguistic structure and Speech production, 
but it rapidly became clear that, on the one 
hand, these failings are so pervasive and, on 
the other hand, the theory is so inexplicit 
that it would be impossible to discuss the 
limitations of each of the possible versions 
of the theory in a paper of this size. The 
alternative we have adopted is to attempt a 
general characterization of the empirical 
limitations of Wickelgren’s model with a few 
important examples, 

The empirical inadequacies of Wickel- 
gren’s model are of two kinds, which may be 
characterized as (a) inability to account for 
the systematic relationships between events 
at the phonetic level and events at higher 
linguistic levels and (b) inability to account 
for the systematic interactions among events 
within the phonetic level. While these fail- 
ings are abstractly equivalent, we will deal 
with them Separately, in turn, 

The context-sensitive associative memory 
model is unable to account for orderly syn- 
tactic relations among phonological events 
simply because this model, being a first-order 
Markov generator, allows of no syntax, 
of course, of any influence of syntax on 
mentary motor responses.* Thus, for ex- 
ample, there is no Way to account for the 
orderly variation of the English plural. The 
fact that the final phoneme of the noun 
stem determines whether the plural will be 
realized as /s/, /z/, or /iz/ cannot be repre- 
sented in this theory. (We feel that it 
should be clear to the reader why this is 80, 
and why this is a critical failure for the 
theory, ^ Clearly Wickelgren’s theory does 
not have laws against regularities, it simply 
ignores them and along with them most of 
the known regularities of its domain; reg- 


nor, 
ele- 
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ularities such as those underlying the ability 
of children to acquire and employ a variety 
of “pig Latins"; the ability to imitate an 
“accent,” a dialect, or a lisp; the ability to 
modify articulatory behavior in order to talk 
while performing other acts with the mouth 
such as eating, chewing gum, holding a pen- 
cil in the mouth, or smoking; the ability 
to whisper, to shout, to speak rapidly or 
slowly ; the ability of the ventriloquist to 
speak a modified version of the speech code; 
and so on. We take it as obvious that a 
theory does not Succeed by being immune to 
evidence but by being vulnerable to evidence 
and surviving, ) 

€ must take cognizance of the 
failure of the theory to capture 


phonological Tegularities means that such 
regularities cannot be used to reduce the 
load on memory in 


i human performance. 
Wickelgren’s English cannot contain the ab- 
stract morpheme "plural" as part of a direc- 
tion to a phonological unit since there is no 
way to predict what the plural of “cat” will 
be; we must look it up, just as we now must 
look up the plural of “mouse” on a list of 


exceptions. 
: To imagine using such a language in which 
there 


are no regularities and 
an “exception” js staggering, 
ficient to remember the noun and apply the 
rule to get the plural—one must remember 
both the singular and the plural of each noun, 
i his is also true for the past tense (judge- 
judged: /d34d3/-/dz4dzd/; slip-slipped : 
/slp/-/strpt/) and for the present progres- 
/rhn/-/rAnz/; hop-hops: 
in which the regularities are 
on voicing. Likewise, the 
Tegularities of English stress assignment (as 
elationships among various 
noun &-g., telegraphic, telegraphy, 


telegraph) would disappear, at least for psy- 
chologists, 


This inability 
the elementary fa 


1 everything is 
It is not suf- 


and syntax to 
independent that it be- 
Suppose that phonology 

This independence of 
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levels is an intersection of two unfortunate 
positions—the claim of American Structural 
Linguists that syntax and phonology are in- 
dependent, and the tendency of American 
Associationists toward theories in which 
various levels of a hierarchical structure are 
presumed to operate independently. Com- 
bined in Wickelgren's theory, they give rise 
to a model which systematically ignores 
essentially everything that is known about 
words except the fact that the phonological 
events come in a particular order. 

The model also systematically ignores 
everything known about the influences of 
phonological events on other phonological 
events, Wickelgren's comments about co- 
articulation notwithstanding. For although 
he claims that one evidence of the superior- 
ity of the context-sensitive associative mem- 
ory model is its ability to “account for” con- 
text effects, he proceeds to account for 
merely the fact that context effects occur, 
with no concern at all for the regularities of 
the way in which they occur. In Wickel- 
gren's model, the structure of each context- 
sensitive phoneme has no relationship to the 
structure of any other context-sensitive 
phoneme, because context-sensitive pho- 
nemes have no intrinsic structure. Wickel- 
gren’s model cannot account for the fact that 
voiceless stops /p/, /t/, /k/ lose their 
aspiration when pronounced after initial 
/s/, because their is no room in the model 
for abstract phonetic categories like “voice- 
less," "stop," “aspiration,” etc. Coarticula- 
tion is not described as a set of regularities 
like “when voiced stops follow a vowel, the 
vowel is lengthened," but rather as a list of 
unrelated articulating patterns, one for each 
context-sensitive phoneme. 

Wickelgren hopes somehow to bring the 
phoneme back into the picture since he sup- 
poses that the "context free phoneme” is to 
have some role in learning how to speak, but 
he holds that such phoneme representatives 
need not play a role in control of articula- 
tion : nor are they employed in speech recog- 
nition. Thus, it seems to us that he sacri- 
fices any hope of representing important 
regularities. He does not tell us in any 
clear way how phonemes are to be intro- 


duced in the theory or in behavior or how 


tw 
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they can be associatively learned, aside from 
an appeal to the notion that the phoneme is 
an allophone class based on similarity of arti- 
culation (presumably sensed in the nervous 
system, but in way which are not even 
mentioned). 

In brief, it seems to us that theory is being 
developed in the wrong direction. In place 
of depth of theoretical construction, it sub- 
stitutes massive multiplication of allegedly 
basic entities. Not only does this strategy 
violate our intuitions about what is “ele- 
gant,” “direct,” and “plausible” (all Wickel- 
gren’s terms), it also makes it difficult to 
describe known systematic effects in speech 
production. This kind of theory would pre- 
sumably lead investigators away from the 
discovery of further systematic effects by re- 
moving the theoretical reasons for hunting 
such phenomena. It is this very general 
and potentially dangerous aspect of the the- 
ory to which we object most strongly. It 
should be clear to the reader who is familiar 
with Wickelgren’s paper that the above 
comments apply as well in substance to the 
two other associative theories that he con- 


siders. 


Nonassociative Theory 

The nonassociative theory suggested by 
Wickelgren is unbelievably weak. Essen- 
tially, he postulates a serial memory which 
is more or less directly connected to à simply 
conceived motor control system. Serial or- 
der is made possible by explicit motor com- 
mands which are stored in just the order 
thev are to appear in the motor output. 
The “nonassociative memory" is scanned 
serially and the appropriate pieces of be- 
havior are emitted serially. . 

No "cognitive" psychologist would pro- 
pose a model of this sort or would entertain 
one such as this for more than a few mo- 
ments. The fundamental difficulty with this 
model, as with all of the models that Wickel- 
gren considers. is that it is a memory model. 
(By “memory model” we mean any model 
"which the stru 
virtue of the way in which the 
otor command sequences are re- 
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is no reason to believe that any 
model is capable of doing the job. ; 
We feel that underlying this point is a 
very broad issue that often becomes obscured 
in the debates about details of models, That 
issue concerns the general acceptance of 
association theory which, as Lashley pointed 
out, provided the foundation for such dis- 
parate paradigms as structuralism and be- 
haviorism and almost completely dominates 
American psychology today. 
fail to consider productive models of be- 
havior because they have accepted the 
assumption of empiricism and associationism 
that all, or nearly all, of the complex be- 
haviors are learned behaviors. We tend to 
believe that the environment has provided 
us with exemplars and experiences from 
which we have derived response repertoires 
keyed to appropriate stimuli, which we now 
Carry about with us in our memories, ()n 
this view, there are three main processes of 
interest : learning (acquisition), retention 
(storage), and recall (retrieval). Since we 
assume that speech js learned, the approp- 
riate questions become: In what form have 
We Stored the speech elements? and How 
do we gain access to them for retrieval? 
(In a more general sense, an associative 
theory of a given type of behavior is essen- 
tially a theory of the part of Memory which 
is devoted to that type of behavior, ) Thus, 
Wickelgren can exhaust the set of obvious 
models for speech Production (and many 
other serially ordered behaviors) by con- 
sidering four memory models, 
The alternative that we Support is that 
responses are not “looked up” in a vast rote 
memory and emitted serially but, rather, 
are generated by a complex processor Operat- 
ing on abstract entities and emitted bya very 
complicated series of transformations—trans- 
formations that by their very nature cannot 
even be considered in an associative theory.5 
We believe that Wickelgren's nonassociative 
memory model will not work for the same 


memory 


Psychologists 
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reason that his other models will fail—be- 
cause they are memory models that attempt 
to capture an essentially creative behavior 
with a necessarily uncreative (in the tech- 
nical sense) theory, 

Wickelgren is apparently ready to con- 
ceed (for the moment at least) that syntax 
is creative and nonassociative, His failure 
to extend the same argument to speech pro- 
duction is attributable, perhaps, to a mise 
understanding of the nature of productive 
theories. He Says it would be “absurd to 
claim . . | that the full apparatus of a genera- 
tive grammar is necessary for serial order 
in noncreative behavior such as the pronun- 
Ciation of a single word [p. 2]."° Tt would 
be absurd, but not for the reason Wickel- 
Sren has in mind. It would be absurd be- 
cause fundamentally the formal nature of a 
generative grammar is in essence not the 
forma] nature of a Psychological theory. No 


Srammar could be a psychological theory, 
let alone a correct one, A grammar is a 


theory of the Structure of behavior, A psy- 
chological theory is a theory about the 
Nature and functioning of the cognitive 
"machinery" Which produces behavior. But 
the generative Srammar indicates the mag- 
nitude of complexity that a Correct model 
of the behavior will have to embrace, 

How complicated is the Production of a 
single word? At the Phonemic level, the 
word can be represented as a matter of some 
small amount of information ; but at the 
motor command level, it is represented by 
a much larger amount. Tt js reasonable to 
Suppose that the brain takes advantage of 
the compact and efficient coding offered by 
some leve] approximating that of the pho- 
neme. And it also seems reasonable to us 
tO suppose that the br: 
of intricate and product 
verting this simple phe 
into the vastly 


ain does a good deal 
ive Processing in con- 
Memic representation 
more complex motor com- 
$8 By "generative 
Wickelgren means a 
transformational (or s 
ful) type, since it 
easy to show 
tive memory 


grammar” we assume that 
latively power- 
to be relatively 
ntext-sensitive associa- 


1 j same level of formal power 
as a right linear (generative) grammar. (His 
Prefix “context sensitive” : 


: “con 3 55 unfortunately very 
misleading in thi 


5 regard.) 
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mands. [t is not at all absurd to suppose 
that an adequate theoretical description of 
this processing will be at least as intricate 
as a generative grammar. In this, we merely 
echo Mattingly and Liberman (1969) who 
considered this question and concluded: 
From the formal point of view, then, there is good 
reason to regard the acoustic signal as another 
linguistic level, and the conversion from the pho- 
netic message to the acoustic signal as a process 
comparable, in an important sense, to the conver- 
sions at higher linguistic levels [p. 107]. 


For nonassociative productive models, the 
problem of serial order is not a serious one. 
There are myriads of possible mechanisms 
with sufficient power to produce sequentially 
ordered outputs. The problem that con- 
fronts the cognitive psychologist is, rather, 
to select. from among the many possible 
mechanisms one which is a reasonable model 
of the human cognitive apparatus. Tt would 
be pretentious for us to offer any kind of 
Specification here, What we urge is that 
interested psychologists devote their atten- 
tion to studying the production and percep- 
tion of speech with the objective of discover- 
ing the rules that govern the phenomena and 
inventing the theoretical apparatus needed 
to make substantive advances rather than 
attempting to build models of a very re- 
stricted and generatively weak class simply 
because they are sanctified by tradition. 


CONCLUSION 


Lashley (1951) knew that the nervous 
System did not operate on the basis of one- 
neuron-to-one-function and that simple asso- 
Clationism was unsupported by what he knew 
of the brain’s activity. He illustrated his 
position behaviorally by presenting sets of 
serially organized behavior that elementary 
associative theories could not explain. 
Wickelgren (1969) has attempted a re- 
joinder to Lashley by postulating a class of 
noncreative behaviors, of which speech pro- 
duction, at the level of the phoneme, is an 
instance. He proposes à context-sensitive 
associative model to account for serial order- 
ing in behaviors of this class. 
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The present paper objects to the distinc- 
tion between creative and noncreative be- 
havior on the grounds that Wickelgren's dis- 
tinction seriously misinterprets the creativity 
notion. Beyond that point we have objected 
to theories of speech production based on the 
associative tradition, pointing out that 
Wickelgren's model has certain serious dis- 
advantages and that memory models in gen- 


(an inheritance from the Association- 


eral 
Tt was urged 


ists) will prove insufficient. 
that we let the phenomena of behavior and 
neurology restrict our classes of models 
rather than allowing them to be dominated 
by tradition. 
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State University of New York at Stony Brook 


Hypothesis theory generates a few 


to reach criterion in discrimination-le: 
should match behavior on tasks which are intrinsic. 
wise comparable to the normal tasks. : 
quences, performance on both the unsolved and insolu| 


be below the chance level. 


A college student appears at the labora- 
tory and is given a simple learning task. He 
is instructed to respond to each of a series 
of stimuli with any one of a limited set of re- 
sponses, After some (preselected) response 
the experimenter (E) says “right,” after the 
others he says “wrong.” The well-known 
experimental result is that the selected re- 
sponse increases in relative frequency.? For 
more than half a century this finding has been 
so ubiquitous that it has been twice codified, 


1 The experiments described here and the prepa- 
ration of this article were supported by Research 
Grant MH 11857 from the National Institute of 
Mental Health. I wish to express my appreciation 
to William Glassman who performed the first ex- 
periment and to O. Ernest Andrews who super- 
vised the last set of experiments. 

* Requests for reprints should be sent to Mar- 
vin Levine, Department of Psychology, State Uni- 
versity of New York at Stony Brook, Stony Brook, 
New York 11790, 

? More typically, E says "right" only to certain 
Fesponses in the presence of certain stimuli. For 
example, E may say "right" to S's response “alpha” 
only if triangles are part of the stimulus complex, 
Otherwise he says "right" only to the response 
“beta.” While it is always more accurate to 
Speak of F's feedback as contingent on certain re- 
Sponses in the presence of certain stimuli, the 
Halicized Phrase will generally be omitted. It 


sh FRA a 
mud however, be understood as given through- 
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theorems about adult humans 
arning tasks. 


Second, 


nonlearning by 
as long as S is sampling from a 
cking the solution) he will show 


in this circum 


of 
se results to ran- 
and the Einstellung phe- 


as the Law of Effect 
ment Principle, 


The phenomenon, however, is not abso- 
lutely reliable, There are at least two classes 


of results indicating that learning does not 
always take place. 7 


The first comes from 
studies concerned with the role of awareness 
in learning, Several authors (e.g., Dulaney, 
1961; Spielberger & DeNike, 1966) have 
identified subsets of subjects (Ss) showing 
no change in Tesponse probability through- 


out the task. The Second comes from in- 
vestigations of transfer of learning from a 
lirst to 


a second discrimination problem 
(eg. the reversal-shift procedure). The 
authors typically report the elimination of 
several Ss who, after an inordinately long 
number of trials, still had not learned the 
first problem, Jt appears, then, that non- 
learning certainly occurs, The experiments 
below will demonstrate that it occurs even 
with well-motivated Ss for whom all the rele- 
vant information js available and unambig- 
uous. The purpose of this article, however, 
1s not merely to substantiate this finding, 
but to present a theory which accounts for 
this and related results. 

Hypothesis (H) theory has been vigor- 
ously developed in recent Years (for reviews 
see Levine 1969 ; Trabasso & Bower, 1968). 


and as the Reinforce- 
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The work suggests that in discrimination- or 
concept-learning laboratory tasks, adult hu- 
man behavior is the result of H testing. 
Novel analyses (backward learning curves; 
blank-trial probes) imply that S has a set 
of Hs from which he repeatedly samples 
until he selects the correct H. Some detailed 
specifications of the theory will be given be- 
low, along with a review of the data on 
which the specifications are based. How- 
ever, it will be useful first to have a clear 
description of the initial task to which the 
theory will be applied. 

Several 12-trial multidimensional simul- 
taneous-discrimination problems were con- 
structed. Each trial was initiated by the dis- 
play of two stimulus patterns (cf. Figure 1). 
One pattern consisted of a set of values from 
each of eight two-valued dimensions ; the 
other pattern consisted of the complementary 
values of the dimensions. From trial to trial 
the dimension values shifted from one pat- 
tern to the other so that on some trial the 
T might be large, surrounded by one solid 
circle, etc. The shifting of values from trial 
to trial followed these rules: (a) on each 
pair of adjacent trials, values from four di- 
mensions remained paired together Giles 
were on the same side for both trials) while 
the values from the other four dimensions 
were changed (shifted sides from one trial 
to the next); (b) no two dimension values 
stayed paired with each other for more than 
three consecutive trials (e.g., A's could not 
be large more than three times in a row, 
circles could not be paired with the solid 
underline more than three times in a row, 
etc.) ; (c) for any dimension value there 
was one and only one other value paired for 
the three trials. 

The experimental problems were basically 
of two types: 12-trial 8-dimensional (8-D) 
problems with legitimate solutions and 12- 
trial 7-D insoluble problems. The latter 
were formed by selecting the solution from 
&-D stimuli and by then eliminating the 
relevant dimension. Suppose, for example, 
that (before the experiment began) "single 
spot" was selected to be the solution. The 
E noted on which side the single spot occur- 
red on each of the 12 trials—choice of this 
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Fic. 1. A simultaneous-discrimination stimulus 
constructed from eight dimensions (letter, A-T; 
letter size, large-small ; letter color, black-white ; 
underline, solid-dashed; border shape, circle- 
square; border number, one-two; border texture, 
solid-dashed; spots, one-two). 


side would be correct. He then made 12 
new stimuli, identical to the old except that 
the spots were omitted from both of the 
stimulus patterns. The S received eight such 
insoluble problems distributed throughout 
24 legitimate problems. 

The Ss were first shown examples of the 
stimuli and were carefully instructed about 
the eight dimensions. They were told that 
they were to select one of the two patterns 
and that after each choice E would indicate 
the correctness of that response. They were 
further told that the solution (i.e. the basis 
for responding correctly) on any one prob- 
lem was one of the 16 attribute values (8 
dimensions X 2 values per dimensions) and 
that they were to try to be correct as often as 
possible. The S was then given several 
practice problems (both 7-D and 8-D—all 
soluble) followed by the 32 (24 soluble, 8 
insoluble) experimental problems.’ 

In the results, three types of problems can 
be characterized : legitimate 8-D problems 
which are solved (defined as having correct 
responses on Trials 10, 11, and rn ge 
wil hereafter be ignored; legitimate 9- 
problems which are unsolved E as 
having an error on one of Trials 10, 11, or 
12) ; and insoluble 7-D problems, which, of 
ease, S cannot solve. Tt is the comparison 


of performance o e 
problems which is of interest. 
senting this comparison, however, the theory 


will first be reviewed. 
4For a more complete review of this experi- 


ment, see Glassman (1970). 


Marvin 


THE THEORY or INSOLUBLE 
(AND UNSOLVED PROBLEMS 
>. F: 


ecessary assumptions are : 

is bu een The S selects 
an H from some universe and responds on 
the basis of that H. For example, S may 
predict that circles are the basis for choosing. 
He would then choose the side containing 
the circle. This is a primitive axiom, al- 
though support for the second Part, that the 
H directs the response, is found by Inserting 
blocks of blank (i.e., no feedback) trials into 
normal problems, Frankel, Levine, and 
Karpf (1970) have shown that Ss virtually 
always respond to a single attribute value 
for as many as 30 consecutive blank trials, 

2. The composition assumption: The uni- 
verse from which samples contains only 
those Hs Corresponding to the stimulus 
levels. The 5 Predicts that “large” or oT 
etc., is correct, He does not employ Hs 


concomitantly, entails different pre- 
training). The important feature of the 
present assumption is that for insoluble 
7-D problems, S's universe Consists of 14 
Hs. 

3. Presolution responding: Prior to solv- 
ing the problem (as manifested by 
of successive correct responses) 
neither the correct H nor the 
the same dimension, i 
is the solution, before 


Xperi- 
and Stein- 
S's H after each 
at prior to solving, Hs from 
the relevant dimension virtually never ap. 
peared, he important feature of this as- 


sumption jg that during unsolved 8-D prob- 
» S’s univers 


trial, showed th 
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congruity between the Jast two assumptions: 


During both insoluble 7-D and unsolved 8-D 
problems, the S5 is sampling from a universe 
i Hs. In both 


the two conditions should be similar. This, 
then, is the first prediction of the model, 
that the trial-by-tria] behavior should be the 
same during both an insoluble 7-D problem 
and an unsolved 8-D problem, ! 

The next two assumptions will permit fur- 
ther refinement of this prediction and some 
Specification of the details of that trial-by- 
trial behavior. 

4. Affirmative feedback: If S" response 
is followed by a positive outcome (e.g, E 
says “correct’’), he keeps his 77 for the nexf 
trial. Probing for the HT after each trial, 
Levine (1969) showed that, with the pres- 
ent Procedure, this assumption almost al- 
Ways held (p ~ 95). 

5. Negative feedback: Tf S’s response is 
followed by a negative outcome (eg. E says 
“wrong”’) | he abandons his last 77 and re- 
Levine (1969) Showed that this 
assumption also Virtually always held (px 

9). Several Proposals have heen made 
Concerning "s mode of resampling follow- 
ing a “wrong.” A few of these will be de- 
Scribed here. All of these alternatives can 


a memory continuum de- 


in effect, return this 

and would resample from the 
full Universe of 14 Has, This assumption, 
irst made by Restle (1962). has been shown 
to be Senerally false (Levine, 1966; Restle 
& Emmerich, 1966), t is included here. 


lOoWever, as one end of the memory con- 
tinuum, : 


T to the set 


i 8 to the stimulus pat- 
s chose and might recognize that 
disqualified by the negative fecd- 


back. He resamples, in effect, from the 
seven Hs consistent with the information on 
the trial just past. Also, he does not utilize 
information from earlier trials. Gregg and 
Simon (1967) dubbed this the “local con- 
sistency” rule. Trabasso and Bower (1968) 
have employed this assumption. 

(c) Reject 11: The S might sample from 
Hs consistent not only with the just-past 
trial but with the preceding trial as well. 
With the stimulus sequences of the present 
experiment, this always means that S ig- 
ores 11 Hs when resampling after a 
“wrong.” This two-trial consistency check 
is part of an assumption proposed by Tra- 
basso and Bower (1966). 

(d) Reject all: The S might remember 
more than just two previous trials and could, 
therefore, eliminate more incorrect Hs. At 
the extreme, his memory would be perfect 
and he would always sample among logically 
correct Hs. This corresponds to the per- 
formance of focusers as described by Bruner, 
Goodnow, and Austin (1956, pp. 129-134). 
This poses an interesting psychological prob- 
lem in the present context. The stimulus 
Sequences are such that by the fourth trial 
of a problem all 14 Hs are disconfirmed. 
The focuser facing an insoluble problem has 
àn empty universe after Trial4. Some sug- 
estions concerning the behavior of S in this 
circumstance will be made later. 

This is all the theory necessary to derive 
additional properties of S’s behavior during 
insoluble and unsolved problems. The pre- 
diction was already stated that during both 
types of problems, the protocols should 
Match each other. Further predictions come 
from applying the theory to the sequence of 
stimuli employed. Within a problem, the 
stimuli, it will be recalled, were selected not 
completely randomly, but to fit stringent 
counterbalancing requirements. Because 
this counterbalancing held for every prob- 
lem, it is possible to derive a predicted per- 
formance curve when the response rules de- 
scribed in the theory are applied to these 
stimulus sequences. In general, the pre- 
dicted performance curve depends on the 
memory assumption held for a resampling 
S (i.e, upon variants a-d of Assumption 5). 
The family of memory assumptions generates 
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Fic. 2. The family of theoretical curves, each 


of which corresponds to the particular memory 
assumption added to the theory. 


a corresponding family of predicted curves. 
Figure 2 shows the curves predicted using 
cach of the four variants. The most strik- 
ing feature of the predictions is that for all 
the variants of the memory assumption, the 
theory predicts that during both insoluble 
and unsolved problems, performance will be 
below chance. The theory, then, predicts 
that not only will the curves from both types 
of problems be the same, but that for both, 
fewer than half of the responses will be 
correct. Furthermore, correct responding 
will decrease during the first four trials and 
will then show an upswing. Again, these 
predictions hold regardless of S’s memory, 
that is, they hold for all the curves in Fig- 
ure 2. The details of performance following 
Trial 4 is currently a matter for some spec- 
ulation and will be discussed later. 

The data comparing 12-trial insoluble and 
unsolved problems are shown in Figure 3. 
Tt is seen there that the predictions are veri- 
fied precisely. The curves show below- 
chance responding and the initial decrease. 
Most importantly, the curves are closely 
matched: At no trial do the pairs of points 
differ at the .01 level; at only one trial do 
they differ at the (05 level. This similarity 
holds despite the fact that the .05 confidence 
interval (indicated by the shaded band in 
Figure 3) is fairly narrow. 'The conclusion 
is compelling, therefore, that the relevant 
dimension is not being sampled by an S 
when he is faced with a soluble problem 
which he is not solving. 
presentative derivation of this type of 


] curve may be found in Levi x 
d Rosenberg (1968). me, Yoder, 


5A re 
theoretica’ 
Kleinberg, an 
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» first three trials. By the nature of the Dag 
Ep ulus constraints, the probability that this : 
will lead to a correct choice on l'rial 4 j 
zero. "The perfect Processor selects this " 
" after Trial 3, is told "wrong" at Trial 4, anc 
FA then has an empty sei. Tf S checks ren 
through the set at this point, one should see 
x a rise in the Proportion correct at l'rial " 
followed by another minimum a few trials 

30 


TRA i curves in 
dd LIE b later. The Cyclic character of the ug chi 
Figure 3 Suggests that this resurvey of the 

Fic. 3. The trial-by-trial Proportion correct dur- N 


periment by Erickson (1968), 

Tue EwPry Ser four-dimensional stimuli, Erickson pre- 

The S who does not solve a soluble prob- Sented blocks of three blank trials ped 
lem was described above with a general < trial as a technique for ps y? 
conception: He samples from a finite set SH. H disconfirmed the ig Ae 
of Hs (14, in the Particular case consid- * response to the firs 
i as wrong, Both Erickson 
and Levine (1966, 1969) (also cf. 
i » above) showed that Ss do not 
i © the set, where 

it may be resampled, The H, then, is out 

tion is intertwined with a question raised i ickson’s experi- 
ment was that this Æ, for all remaining 

ory for disqualified Hs is good enough, his trials, was made the Correct JJ. Thus, an 
set, which contains only incorrect Hs, will might select the large stimulus for the 
first four trials. After the fourth trial (i.e., 


the first feedback trial) the Æ says “wrong,” 
© wrong? Con- hoice of the large sti i 


rect response for all remainin 

for his car keys. He thinks "Perhaps T ditions of the experiment faci 

left them in one of my jacket pockets.” He tion Processing: not only w 

i number imensions, bu 

£ stayed on when feedback wa 

stitute his H set), Suppose a rapid check lere Was a long 
of all the pockets reveals no ae sound 

keys (the set goes to Zero). : 

bili s [e he thinks “Did T sk 


Ondition, in which 

pockets ? Perhaps I should have checked feedback throughout the 

each pocket more carefully." That is one whole problem the mean number of feedback 

Possibility is that he goes through the set i S 41. Tithe experi- 
i E . Mental condition (with the first-trig] rever- 

experiment just reported (see Figure 3) is 

in the cyclic Pattern during the insoluble 


g trials.6 Con- 
litated informa- 
as there a small 
it the stimulus 
S presented, and 
intertrial interval (cf. 


In fact, 


A the Problem, They 
0 required on the average only 23 feedback 
Problems. A perfect processor starts with eo, 
the full set of Hs at the first trial and re- be a loose Paralle] to Erickson’s procedure. 
Uces the set until, by the third trial, he has © in fact, Used a 


cedure. Th descr; ce ICCOSSive discrimination pro- 
` ? descripti ^ "i ms 
Tything but the one /7 corre. ptio; Correctly conveys the rele 
i absent) correct H for the 


er, Without laboring over the 
es, 
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trials more than the control condition, ap- 
Proximately the result one might expect if 
Ss typically reviewed the H set a second 
time. One reaction, then, when a set goes 
to zero is to go through it again. | 

For another reaction let us return to our 
friend bereft of his car keys. Suppose that 
he again goes through his jacket pockets 
for the keys, this time more systematically 
(ie. in such a way as to retain in memory 
that every H was tried and rejected). The 
Set is again empty. What other alternative 
1s open to him? He can leave this set of 
rag and look for a new one. He can do this 

Y scanning his memory ("What was I 
doing the last time I saw those keys?") or 
the present situation (“Could they be among 
that clutter of things on my desk?"). That 
new set may be fairly large (“Maybe they're 
m one of my pants pockets") or be very 
small consisting of virtually only the cor- 
Tect H (Of course! I was wearing my rain- 
Coat yesterday; they must be in my rain- 
= Pocket”), A parallel within the experi- 
nental situation would occur if S, con- 
IM that he is overlooking something. 
t" is back to E's instructions, in an effort 
= retrieve some possibly forgotten dimen- 
Sions, 

jn Another reaction, then, when a set goes 
ior ero is to abandon this set and to look 
dins | new one. [t is proposed that this is 
CS eg oriant process underlying the shift 
dee ee to solution behavior. The 
with Iption, while it is probably in keeping 
tira ore eer experience, is frankly specula- 
lane 4 Nevertheless, it is sufficiently impor- 
the 9 the learning process that it warrants 
. * Status of an assumption, along with the 
Ive assumptions presented above. 

6. Changing of sets (or subsets) : The re- 
duction of a set to zero Hs serves as a signal 
to select a new set. As indicated above, this 
Searcli for a new set may follow one or more 
reviews of the old set. 


Tue INFINITE SET 
A mechanism was just provided permit- 
ting an S who starts a problem with the 
Wrong H set to eventually solve the problem. 
That set will go to zero and S, perhaps 


after one or more reviews of the set, will 
look for a new one. Suppose the initial in- 
correct set can never go to zero? Suppose. 
that is, that S is sampling from a set which 
is infinitely large yet which does not contain 
the correct H? According to the theory 
above, S will never abandon that incorrect 
set. Only Assumption 6 allows for a change 
of H set, but it requires that the incorrect set 
become empty. An S, therefore, sampling 
from an incorrect but infinite set, will never 
solve the problem no matter how easy it is, 
that is, no matter how obvious the S-R and 
feedback contingencies are. 

This implication was first explored by 
Ress (1965; for a summary see Ress & 
Levine, 1966). To realize the simplicity of 
the task employed, consider first the control 
group: Thirty Ss each received a two-di- 
mensional simultaneous discrimination. 
They went through a deck containing 115 
cards, each card showing one large (2.5 
centimeter) and one small (1.3 centimeter ) 
circle. In typical fashion, for half of the 
cards, the large circle was (randomly ) lo- 
cated on the right (or left) side. The by 
was instructed that he should touch the cen- 
ter of one of the two circles on each card, 
that E would say “right” or “wrong” after 
each response, and that he (S ) was to try 
to be right as often as possible. The E said 
“right” whenever S touched the large circle, 
“wrong” whenever the small. In short, it 
was formally identical to the discrimination 
problem one gives to monkeys in the Wis- 
consin General Test Apparatus or to rats 
in the Lashley jumping stand. It is not 
surprising that these adult humans solved 
the problem in a mean of three trials. 

The experimental Ss received the same 
This was preceded, however, by a 
ces intended to start the 
Ss sampling from an infinite and incorrect 
H set. Sixty Ss each received a series of 
preliminary problems using decks identical 
(except for shuffling) to the one for the 
main problem. The solution to these pre- 
liminary problems was always a position 
sequence. The sequences 2L-2R (two left- 
side responses followed by two right-side 
responses), 3L-2R, IL-R, 5L-1R, 2L-4R. 


task. 
series of experien 
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and 3L-3R were presented.” The E an- 
nounced the correct solution after each of 
these problems. It was assumed that this 
series of preliminary problems would fulfill 
the composition assumption that 5 was sam- 
pling from an infinite set containing Hs deal- 
ing only with position sequences. : 
After the preliminary problems, E 
nounced, as he had prior to each of the 
liminaries, *Here is another problem" and 
presented S the reshuffled deci. This time, 
of course, the correct H was "touch "large, " 
an H not in the set of position Sequences, 
The Ss were run until they either gave 15 
consecutive correct responses or made an 
The results 


an- 


pre- 


were that 


this problem. Twenty-eight of the 
not solve it in 115 trials, 
The Ss, incidentally, 
vated to solve these prob 
back ("right" and 
effective reinforcers, 


trials on the sixth, 


, At first glance it appears that the theory 
1$ strongly confirmed. € Operations em. 


an average of 62 
tion any self-respecting monkey | 
few trials, 
from 


problem searching for Sequence solutions, 
The results, however, do contain a puzzle 
for the theory. According to the theory, Ss 
should not solve at all. If they are sampling 
from an infinite (and incorrect ) Set, from a 
Set which does not become empty, then they 
should never select a new set. Yet more 
than half of the Ss ( 32/60) solved the prob- 


* Two other variables were manipulated: The 
Ss had cither three or six preliminary problems ; 
the sizes of the Circles differed (2.5 ys, 1.3 centi- 
Ma 2.5 vs.20 Centimeters). These variables 

"ow Ignificant effects. The data are combined 
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lem, that is, somehow selected 
containing the correct H. 

The puzzle can be resolved in one of two 
ways: (a) One may look for an additional i 
assumption, Postulating some other process 
by which an H Outside the set can be sam- 
pled; or (b) one may question whether 
the composition assumption was fulfilled. 
whether the preliminary problems really did 
Orient Ss to Sample from an infinite set ol 
A resolution to this 
puzzle, favoring the Second alternative, was 
Suggested in an analysis of Sequence struc- 
tures by Restle (1967 ). He noted that bi- 


a new set 


$ could code 
called mandatory rules, More complex se- 
quences, involving Several changes in (bi- 


nary) state within a cycle, required what he 


called optional rules. His analysis sug- 


ne trained the Ss 
ices, those having 
shift within the 


with only a subset of sequer 
short cycles with a single 


cycle. This implies that at the final problem 
the Ss might have held not an infinite set 


of all Possible Sequences, but a 
and simple sequences. 


s Some 
Ss might then try a different incorrect H 


(e.g. 


has Something to do with the 


Was that Ress and 
Obtain idea] results because 


experiment e 
plex Sequences, This, accordingly, was 
done. Two groups of 20 Ss each received 
Six preliminary Sequence Problems followed 
The simple group 


; the complex 
arying complexity, 


Except for 


S 
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this variation between the two groups, the 
task was identical to that employed by Ress 
and Levine. E j 
Nu V. 20 Ss in the simple group 
Be he final problem, thereby replicating 
^ "earlier results; in the complex group, 
Pur ter 
ce is significant (y? — 8.44, 
n pota This experiment, then, suggests 
a: om sequences produce more 
nis a ulfillment of the composition 
Piers ep Also, this dramatic demon- 
ieee Bs Ss failing to learn the most ob- 
imd A sida qe further validates the H 
á e described above as a description of 
ne learning (and nonlearning) process. 
ae considering the results of the com- 
el pen in greater detail, an additional 
ie mental variation will be described. In 
he ors above, S touched one of two 
ger’ S: One of these circles (e.g. the lar- 
ECL UN stimulus entering into the 
snis oe One might propose an ex- 
bans simple reason why S failed to 
: he did not look at these circles. The 
the wee by this line of reasoning, touch 
Sedin ght or left side of the card without 
S the circles. To put it another way, 
en s (1945) criterion, that an observing 
Bic m must occur before the data can be 
ee nis relevant to any learning theory, 
Y not have been met. 
similior experiment, therefore, was under- 
ceding The primary change from the pre- 
the Hes experiments was the replacement of 
F eT circles on each card by two letters, 
Bn sh 9 Also, every S received the six 
p "i 3 sequences employed with the com- 
X group, above. Two groups of 16 Ss 
€ach were run, The touching group, as be- 
fas ws told to touch one of the two let- 
la he speaking group was told to indi- 
prier. sheiue by saying one of the two 
ins = Note two features about the speak- 
Neg p during the final problem: First. 
the hs n way then they can, avoid seeing 
E muli : second, the contingencies are 
fon i jore simplified, Instead of discrimina- 
earning, this is now the reinforcement 
Paradigm in its simplest form: Whenever S 
wee "A" E says "right"; whenever S savs 
3” E says “wrong.” 
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There was essentially no difference be- 
tween the two groups. Four Ss in the touch- 
ing group and three Ss in the speaking 
group solved the problem. This result, 
along with the observation that Ss in the 
touching group virtually always directed 
their gaze toward the letters, eliminates any 
concern that the nonlearning was an artifact 
of nonstimulation. 

Three groups, then (the complex. touch- 
ing, and speaking), were run in which the 
composition assumption appeared to be ful- 
filled. Of a total of 52 Ss, 81% (42 out of 
52) did not reach the criterion of 15 succes- 
sive correct responses within 115 trials. It 
is possible, of course, that these 42 Ss, 
though they failed this fairly extreme cri- 
terion, may nevertheless have shown some 
response strengthening. There is, however, 
no indication of anything but chance occur- 
rence of the correct response during the 100 
trials of the problem. On Trials 91-100, 
for example, 53% of the responses were 
correct. Despite a large N (42 Ss x 10 
responses per S). this is not significantly 
different from 50%. Tt appears clear that 
these Ss had not switched to the set contain- 
ing the correct H. 

The remaining 10 Ss did solve the prob- 
lem. Why these Ss might have been excep- 
tional is suggested by comments made by 
One said that the whole ex- 
ed her of Luchins’ wa- 
‘hich had recently been 
the other said that 
a trick— psycholo- 


two of them. 
periment had remind 
ter-jug experiment, w 
reviewed in her course; 
he had been looking for 

gists always trick you." These comments 
imply that the preliminary problems did not 
always succeed in restricting S’s H set to 
position-sequence Hs. 


DISCUSSION 


Two types of experiments have been pre- 
sented demonstrating nonlearning. The first 
experiment showed that an S who had not 
learned behaved as though the relevant 
stimuli were literally absent. The second 
set of experiments, by reducing stimulus 
complexity to a minimum and by guarantee- 
ing that 5 observed the relevant stimuli, 
demonstrated nonlearting in a more dra- 
matic form. Nonlearning was shown also to 
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be not a sporadic occurrence by an occasional 
S, but rather an outcome of definite prob- 
lem-solving processes, susceptible to theore- 
tical analysis and to manipulation, Accord- 
ing to the analysis, nonlearning can be pro- 
duced— despite a well-motivated S and con- 
sistent S-R feedback contingencies—hy 
making S sample from an H Set which does 
not contain the correct H. 

Tt should be stressed that the theory em- 
ployed was not ad hoc. A “theory of non- 
learning” was not improvised to account 
for the data. Rather the data conformed to 
predictions derived from a general theory 
of hypothesis-testing behavior, The theory 
had already been employed to account for 
behavior during blank trials, for the dis- 
continuity in the percentage of COrrect re. 
sponses during simple learning, for cue re- 
dundancy and cue reversal, etc, The theory 
was here enlarged to include the 
empty and infinite sets, but this 
followed plausible lines. 

This expanded theory and the experi- 
mental results have relevance to three other 
research topics: the effects of random feed- 
back on subsequent discrimination learning, 
the awareness controversy, and the Einstel- 
lung effect, 

Random feedback : Levine (1962) 
Strated that the learning of very 
criminations was clearly retarded if E said 


^ orf i randomly 
few trials, This result Was al 


demon- 


so shown to 


hold for concept learning by Mandler, 
Cowan, and Gold (1964) anq for complex 
(6-D) discrimination learnir 


and Premack (1965). 
that some Ss rejected the sj 
cluding the solution ) during the random 
feedback phase and 
another subset (e.g., 
into the discrimination phase. 
Premack made a similar Suggestion. ne. 
hind these Suggestions was the tacit 
assumption that if Ss were testing Hs from 
an incorrect set, then they would not learn 
the correct contingencies. No attempt was 
demonstrate either that random 
* caused Ș to sample from an in- 
retrect AT sèt op that such a sampling would 
hinder the subsequent simple learning. The 


well 
Holstein and 
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second type of experiment presented above, 
however, relates directly to the latter 
assumption. It demonstrated that by set- 
ting S to sample Hs from an incorrect set, 
the learning of 4 simple contingency was 
indeed delayed. The operation of random 
feedback is well viewed, then, as a gross 
technique for manipulating the sets from 
which .$ Samples, The resulting —— 
tion in simple learning follows from Fi 
theory as outlined above. 

The awareness controversy : 


In recent 
Years several hundreds of Es 


have challenged 


or defended the reinforcement principle 
under the banners of the awareness contro- 
versy, 


Two extreme Partisan positions in this 
controversy may be discerned. 
laney, 1968; Spielberger 
is that the reinforcement 
vant, that only 
of the contingencies 


One (Du- 
& DeNike, 1966) 
principle is irrele- 
$ becomes "aware 
Will performance me 
crease. The other (Dixon & Oakes, 1965; 
Verplank 1962) holds that the response 
strengthening occurs automatically, that 
awareness is an irrelevant or, at best, an 
independent Process, Postman and Sas- 
Senrath (1961) took 


as have recently” Several other theorists 
( Bandura, 1969 . Kanfe; 1968; Krasner. 


1967). 


response-strengthening 
5, the former weakly but persistently, 
the latter strongly but rapidly, that is, in a 


discontinuous Way. The results presented 
above are relevant n 


Views but to this last oTa 


vocally that the reinfo 


principle can 
be completely nullified 


à Establishing effec- 


not a sufficient condi- 
Producing learning. 


creasing the Probability or th in; 

The Principle, so long a bastion of learn- 
ing theory, may be defended in two ways. 
Silver, Saltz, and Modigliani (1970) sug- 
gest that learning via H testing and learning 
automatically via reinforces. may both 
occur but are mutually; exclusive processes. 


According * the procedures of 


to this view 
the present experiments evoke the 7/T-test- 
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ing mechanism and thereby switch off auto- 
matic strengthening. These experiments, 
of course, cannot negate this possibility. 
Nor can any laboratory experiments which 
S might interpret as a problem-solving task. 
It may be, however, that the Silver et al. 
formulation will provide the ultimate resolu- 
tion of the controversy. 

The second defense of the principle comes 
through redefining the response. This is 
most easily seen in the second set of experi- 
ments. One may argue that because of the 
preliminary problems, saying "A" was no 
longer the functional response. Rather, the 
functional response, that is, that which was 
affected by the reinforcement, was S's co- 
vert formulation of a position sequence. 
These covert formulations were so strength- 
ened in the preliminary problems that they 
did not extinguish over the 100 trials of 
the test problem. Far from rebutting this 
defense; one need only note that it is a 
major conversion to H theory. Indeed, H 
theory may be regarded as the theory of 
these covert formulations. In the early 
stages of the development of / theory, Le- 
vine (1959, 1963) suggested that its most 
important feature was the redefinition of the 
response—'"The H, rather than the specific 
choice response on a particular trial, is re- 
garded as the dependent variable, i.e., as the 
unit of behavior affected by the reinforce- 
ments [Levine, 1963, p. 270]”. Subse- 
quently, H theory has had considerable re- 
finement (e.g., the all-or-none character of 
H strengthening and weakening, the changes 
in strength of several Hs simultaneously, the 
encoding mode—visual and verbal—of in- 
formation, etc.), but the redefinition of the 
response remains the foundation on which 
the theory has been built. 

, Einstellung: The reader, like one S men- 
tioned above, has undoubtedly noticed the 
similarity of the second set of experiments to 
the Einstellung experiment (cf. Luchins, 
1942). In this experiment also, S has pre- 
liminary experiences which render him func- 
tionally blind to a subsequent simple solu- 
tion. For example, Luchins presented Ss 
with a series of paper-and-pencil maze prob- 
lems. For the first nine problems, S was 
given a picture of a maze consisting of both 


a direct straight path leading from the start 
toward the goal box and other long cir- 
cuitous paths. During these problems, the 
straight path always was blocked, that is, 
visibly ended as a cul-de-sac just before the 
goal box. One of the circuitous paths lead- 
ing off from the right of the start box always 
eventually led to the goal. On the tenth 
problem, the direct path was connected to 
the goal. Nevertheless, the Ss typically 
chose the round-about alley leading off from 
the right. In H terms, the analysis is clear: 
the preliminary problems led S to sample 
Hs dealing with alleys off to the right. Un- 
til this set is exhausted, the S is insensitive 
to direct central alleys. This theory, inci- 
dentally, has an obvious implication not yet 
tested within the older Einstellung tradition. 
The theory implies that the larger the in- 
correct H set, the longer it takes that set to 
become empty, and the more durably is hy 
functionally blind to the simple solution. 
This, in turn, implies that for the critical 
(tenth) maze, the larger the number of in- 
correct right-hand circuitous paths, the 
longer will S fail to see the correct direct 
path. 
For decades the Einstellung effect has 
been treated in isolation from learning the- 
ory, primarily because that theory was based 
on conditioning principles. This deficiency 
vanishes when the effect is viewed from the 
perspective of H theory. The Einstellung- 
like results in the second set of experiments 
were predicted from the basic principles of 
H testing. By the straightforward develop- 
ment of H theory, one suddenly realized that 
the theory encompassed one form, at least, 
of the Einstellung effect : The S$ appears 
functionally blind as long as he is sampling 
from an incorrect H set. This easy, natural 
hitherto isolated phenomenon 
along with the confirmed 
lidity of the H model 


treatment of a 1 

further attests, 

predictions, to the va 

described above. 
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AN ANALYSIS OF TWO QUANTITATIVE 
THEORIES OF COGNITIVE BALANCE’ 


A. RODNEY WELLENS ax» DONALD L. THISTLETHWAITE ? 


Vanderbilt University 


Previous attempts to formulate balance theories capable of accommodating 


interelement relations of 
algebrai 


Wiest are proposed. 


the former theory contains an alge 
to one that may be given for the upper bound: 
Empirical results probing these theories provided some sup- 


latter theory. 


ing magnitudes are briefly reviewed. Two new 
formulations of the quantitative balance theories of Feather and 
It is shown that the discrepancy principle used in 
braic formulation which is equivalent 


es of balance implied in the 


port for the algebraic formulation common to both theories, but discon- 
firmed the polarization principle which has been used as an adjunct to 


Wiest's geometric model. 


An early attempt to incorporate the con- 
cept of "cognitive consistency" into work- 
able psychological theory was Heider's 
(1946) theory of cognitive balance. Cart- 
wright and Harary (1956), after proposing 
an extension of the Heider model, observed 
that a principle problem yet to be resolved 
was the development of a systematic method 
of treating interelement relationships of 
varying strength. Osgood and Tannenbaum’s 
(1955) principle of congruity accommodated 
two relations that may vary in magnitude, 
but as Wiest (1965, p. 1) and Zajonc 
(1968, p. 354) have recently observed, the 
third relation (the assertion linking source 
and concept) assumed only a positive or a 
negative sign and was not sensitive to finer 
variation, Numerous writers have since 
called attention to the need for a quantitative 
formulation of balance theory. For ex- 
ample, Insko (1967, p. 174) noted that 
balance theory should be extended to allow 
for small graduations in both sentiment and 
unit relations; Shaw and Costanzo (1970, 
p. 193) observed that the magnitude of à 
relation would be expected to be an impor- 
tant determinant of balance. Morrissette 

formula for express 


(1958) proposed a ssi 
the degree of balance in s-graphs containing 
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lines (relations) varying in strength. How- 
ever, this formulation is not applicable to $- 
digraphs containing a single semicycle, the 
paradigm most commonly examined in the 
literature. More recently, Wiest (1965) and 
Feather (1966) independently proposed 
models capable of treating all relationships in 
such triadic structures quantitatively while at 
the same time defining a state of balance. It 
appears, however, that these latter models 
also have limitations: Wiest’s geometric 
model is cumbersome when implications 
other than the correlational relationships he 
discussed are to be deduced, and Feather's 
discrepancy principle leaves balance unde- 
fined when one relation has a neutral value. 
The present paper offers heretofore unavail- 
able, algebraic formulations of the quanti- 
tative balance theories of Wiest and of Fea- 
ther, discusses conditions under which the 
two formulations vield similar and dissimilar 
predictions, and briefly reports the results of 
an experiment which probes the adequacy of 
each formulation. 


Wiest’s MODEL 


Wiest (1965) has presented a quantitative 
extension of Heider's balance theory, appli- 
cable to three element configurations involv- 
ing a person (P). another person (O), and 
an attitude object (X). The three inter- 
element relations, P's attitude toward X 
(PX), P's perception of O's attitude 
toward X (ON), and P's liking for 
O (P — O), are treated as three independ- 


141 


G 


Y sp“ 
Fic. 1. Wiest's three-dimensional space as defined 
by the interelement relations P — X, O — X, and 


P — O, represented here by the axes of a cube 
X, Y, and Z, respectively. 


ent bipolar continua which represent quanti- 
tative scales of the degree of attraction be- 
tween pairs of elements. Figure 1 shows a 
cube whose three axes, X, Y, and Z, may be 
taken to represent these interelement con- 
tinua. According to Wiest, a balanced triad 
is defined as any whose coordinates fall 
within or on the surface of the tetrahedron 
formed by connecting the four corners of 
the cube that correspond to the four balanced 
triads recognized by Heider (i.e, corners 
A, C, IE, and £x). 

Wiest derived confirmable predictions 
from the tetrahedron model by “slicing” the 
cube along a single dimension at points 
which correspond to various magnitudes of 
a relation given in P-O-X configurations. 
These planes pass through the tetrahedron 
yielding “boundaries of balance" associated 
with triads possessing the given relational 
value. Such a procedure is illustrated in 
Section A of Figure 2 which shows the cube 
in Figure 1 sliced in five places (—2, —1, 0, 
+1, and +2) along the Y dimension. These 
boundaries yield predictions as to the magni- 
tude of covariation expected between two of 
the relations as a function of the third. As 
can be seen, X and Z covary directly as a 
function of the level of Y: the stronger the 
Y relation, the greater the correlation be- 
tween X and Z. The same pattern of boun- 
daries shown in this figure would appear if 
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the cube had been similarly cut along either 
of the other two dimensions. 


Reformulation 


It may be noted that the tetrahedron 
ACEG, in Figure 1 has a top surface which 
includes the triangles CEG and AEG, while 
the bottom surface includes triangles ACE 
and ACG. When planes are passed through 
the cube at five equidistant points perpendic- 
ular to the Y axis (i.e., corresponding to the 
values —2, —1, 0, +1, and +2 along the 
Y axis in Figure 1) the intersection of each 
of these planes with the top surface of the 
tetrahedron results in the lines EC, ILK, 
MPO, RUT, and AG in Section A of Fig- 
ure 2. These lines represent the upper 
boundaries of balance associated with triads 
possessing a given Y value of —2, —1, Q, 
+1, and +2, respectively. Similarly, the 
intersection of the five planes with the bot- 
tom surface of the tetrahedron is shown in 
Section À of Figure 2 as the lines RC 
ITK, MNO, RST, and AG which represent 
the corresponding lower boundaries of bal. 
ance associated with triads possessing the 
same given values. . 

The boundaries of balance discussed above 
define the theoretical.upper and lower limits 
of Z for given values of X and Y. Because 
these boundaries are composed entirely of 
lines with slopes of plus or minus one, they 
may be expressed in terms of quite simple, 
linear equations. It can be shown (for ex- 
ample, by substitution of given X and Y 
values) that each of the predicted limits of 
Z in Section A of Figure 2 satisfies one of 
the following equations: 


Zy — k= [X — Y| 
Zy1=|X+Y|—k 


[1] 
[2] 


where Zy represents the upper boundary 
limit for Z; Zr, represents the lower boun- 
ary limit for Z; |X — Y| and |X + Y| cor- 
respond to the absolute values of the real 
difference and sum, respectively, of the two 
given relations; and the integer k is a con- 
stant representing the highest positive scale 
value possible on each of the three inter- 
element relations. 
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Generalized Tetrahedron Formulation 


'The upper and lower limits of the range 
of values expected for Z, given values for 
relations X and Y, have been defined by the 
values Zr and Zr. These values may be 
combined to vield a generalized formula for 
predicting a single value for Z: 


Z — aZe + bZ; [3] 


where Z represents the expected value of the 
missing relations, and a and b define weight- 
ing factors assigned to the range limits. 

Tt should be noted that the values of a and 
b may vary according to assumptions made 
about the expected distribution of values 
along the range Zr, to Zr. On the basis of 
Osgood and Tannenbaum's (1955) polariza- 
tion principle, Wiest hvpothesized a tend- 
ency for configurations to cluster toward the 
surfaces of the tetrahedron. Tf it is further 
assumed that these polarization effects tend 
to displace structures equally toward the 
surfaces of the tetrahedron, then it would 
seem appropriate to assign equal weights to 
the boundaries of balance associated with 
these surfaces. Predictions made according 
to this assumption may be construed as 
equivalent to assigning values of .5 to both 
a and b in Formula 3 and will hereafter be 
referred to as those based on the "equal 
weights" tetrahedron model. The solid lines 
in Section A of Figure 2 show mean pre- 
dictions made according to this model for 
any missing relation in a triadic ( P-O-N) 
structure given various values of two of the 
relations (the scale used in making these 
predictions ranged arbitrarily from —2 to 
+2). 

One implication of the polarization 
assumption is that the obtained distributions 
of values of Z should be bimodal for all 
structures in which Zr, and Zv do not coin- 
cide. It may be noted, however, that a va- 
riety of alternative distributions of Z values 
within the theoretical Zr, to Zv ranges might 
be considered: for example. one might ex- 
pect a normal distribution of values along 
any range, or a rectangular distribution of 
equally probable values. or perhaps even a 
skewed distribution with the bulk of the 
cases clustering about either the upper or 
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lower boundaries of balance. In any case 
any specific assumption regarding the pre- 
dicted distribution of points inside the tetra- 
hedron can be regarded as a dispensable ad- 
junct to the generalized geometric model. 


FEATHERS MODEL 


F eather’s (1964) structural balance model 
of communication effects is based on Cart- 
wright and Harary’s (1956) reformulation 
of Heider's balance theory. Attitudinal re- 
lationships considered in Feather’s model 
deal with a source (S) presenting a com- 
munication (C) about an issue (T) to a re- 
ceiver (R). Only three of these elements 
will be of concern here (namely, S, T, and 
R), with relations being limited to the re- 
ceiver's attitude toward an issue (R — T) 
the receiver's perception of the source's atti- 
tude toward the issue (S — I), and the re- 
ceiver's attitude toward the source (R — S). 
Recently, Feather (1966) has shown that 
the sign and relative magnitude of any one 
of these relations may be predicted when the 
sign and magnitude of the other two rela- 
tionships are known or given. According 
to Feather, any given relation may be either 
positive or negative, and in addition, may be 
either weak or strong. resulting ina 4-point 
scale ranging from strong negative to strong 
One of the principles of combina- 
tion proposed by Feather may be called a dis- 
crepancy principle; it states that when two 
given attitudinal relations are of the same 
sign. small discrepancies in their magnitudes 
will be associated with (or will induce) a 
strong positive third relation, but, when two 
given attitudinal relations are of opposite 
signs, large discrepancies will be associated 
with a strong negative third relation? A 
corollary which seems implicit in Feather's 
formulation is the following: Given two 
attitudinal es of the same sign, the 
smaller the discrepancy in their macnit, 
the stronger will 5 the Ar o m 

E Sve, third rela- 


3 Although Feather. (1966) stated th 
posed principles of combination 
nature of guesses based on one’ 
edge of interpersonal. relations.” 
edged that his discrepancy : 
gested by Neweomb's (1959 
of the conditions for strain tow 
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tion; given two attitudinal relations of op- 
posite sign, the larger the discrepancy in 
their magnitudes, the stronger will be the 
negative, third relation. 


Extension and Reformulation 


The scale which Feather proposes may be 
expanded to include any number of magni- 
tudes. Such a scale may be conceived as a 
range of magnitudes extended from —k to 
+k, where k may be any integer.  Discrep- 
ancy may then be defined as the distance 
between any two values along this con- 
tinuum, If X, Y, and Z are taken to repre- 
sent the relations R — I, SI, and R> S, 
then Feather's discrepancy principle may be 
rewritten as two functional relationships : 


Zs-zk-|X —Y] [4] 
redi [5] 


Here Za or Zo represents the missing (or 
to-be-induced) relational value; k represents 
the highest possible positive value on the 
scale employed; and |X — Y] corresponds 
to the absolute value of the real difference 
between the two known relational values in 
the given structure. Formula 4 is used 
when the given values for X and Y are of 
the same sign. This formula yields a maxi- 
mally positive value for Zs when X and Y 
are minimally discrepant; Zs approaches 
zero as |X — Y| increases. The lines EV, 
ILJ, MPO, SUT, and WG in Section B of 
Figure 2 show predicted Zs values based on 
this formula. Formula 5 is used when X 
and Y are of opposite signs and it yields 
the largest negative value for Zo when X 
and Y are maximally discrepant; Zo ap- 
proaches zero as |X — Y| decreases. The 
lines V'C, J'K, M'P'O', RS; and AW’ in 
l'igure 2 show the predicted Zo values based 
on this formula. It may be noted that when 
it is assumed that an attitude is perfectly 
juxtaposed between positively and negatively 
affective positions (ie., when one holds, or 
is perceived to hold, an attitude that is lo- 
cated precisely at the “middle” position on 
the Y or Y scale), both Formulas 4 and 5 
may be considered applicable. Thus, for 
certain combinations of X and Y values, 
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Formulas 4 and 5 yield competing predic- 
tions concerning the missing Z relation (ct. 
V vs. V’, J vs. J’, etc. in Section B of Fig- 
ure 2). One alternative in using these 
formulas when X or Y has such middle 
values is to use the average of the competing 
Zg and Zo values as the prediction for the 
missing relation. The solid lines in Sec- 
tion B of Figure 2 show predictions made 
using this convention; these predictions will 
hereafter be referred to as those based on the 
"composite" discrepancy principle. Again, 
the scale used in making these predictions 
ranged from —2 to +2. 


COMPARISONS BETWEEN THE 
Two FoRMULATIONS 

The formulations presented here represent 
two different methods of combining informa- 
tion to arrive at independent predictions 
concerning missing attitudinal relations in 
analogous triadic situations. One point of 
similarity that exists between the Wiest 
and Feather models concerns a single form- 
ula common to both algebraic formulations. 
Formula 4 used in conjunction with combin- 
ing values of the same sign in the extended 
Feather model is identical to Formula 1 
used for finding the upper boundary limits 
in the reformulated Wiest model. It should 
be emphasized, however, that interpreta- 
tions of the values obtained from these two 
equations vary significantly. Formula 4 in 
the Feather model yields a value for Zs, 
which is taken as the predicted point for a 
missing relation computed on the basis of 
relations having the same sign; on the other 
hand, values for Zy in the Wiest model de- 
fine the upper limits for predictions made in- 
volving relations of any sign. Figure 2 
shows that predictions for Zs from Formula 
4 (EV, ILJ, MPO, SUT, and WG) repre- 
sent only a subset of the predictions for Zu 
from Formula 1 (EC, ILK, MPO, RUT, 
and AG). 

Examination of the solid lines presented 
in Sections A and B of Figure 2 shows the 
differential treatment of the X x Y inter- 
actibti predicted by the cqual-weights tetra. 
hedron model and by the composite dis- 
crepancy model. When one of the relations 
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given has a value which is low in magnitude 
(e.g, Y — 4 1l. or —1), predictions made 
according to the former model approach an 
asymptote at plus or minus the lowest value 
given. Under the same conditions, predic- 
tions made according to the latter formula- 
tion tend to peak when given values are 
nondiscrepant, regardless of the strength of 
the relations involved. 

Another interesting difference between the 
predictions of these two models concerns 
predictions made when one given relation 
has a magnitude of zero (eg, Y — O0). 
Under these conditions, the equal-weights 
tetrahedron model predicts a magnitude of 
zero for the missing relation regardless of 
the strength of the remaining relation. The 
composite discrepancy model, on the other 
hand, predicts an inverted V relationship be- 
tween the missing relation (Z) and the 
second relation (X). This is to say that 
when one relation in a P-O-X configuration 
is given as neutral, the degree of positivity or 
negativity of the missing relation will depend 
solely on the absolute strength of the third 
relation. As can be seen in Figure 2, when 
Y is given as neutral, Z increases in positiv- 
ity as X decreases in absolute magnitude. 

These two models become less discrete in 
their predictions when given relations take 
on extreme values. Thus, when one given 
value, say Y, is maximally positive, a strong 
positive correlation is expected between 
levels of the second given relation and the 
induced missing relation. Likewise, both 
models predict a strong negative correlation 
between levels of one given relation and the 
missing relation when the other given rela- 
tion remains maximally negative. Although 
the equal-weights tetrahedron model and the 
composite discrepancy principle vary as to 
the degree of linearity associated with the 
expected relation between X and Z, it may 
be noted that the two theoretical curves are 


extremely similar when Y — +2, 


ĪLLUSTRATIVE APPLICATION OF THE 
ALGEBRAIC FORMULATIONS 

There are, of course, numerous ways to 

investigate the implications of balance form- 


ulations. For example, Wiest (1965) 
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attempted to measure “real” attitudes of 
children toward their classmates and found 
that the more a child liked another child, 
the more he tended to agree with the other 
in respect to the degree of his liking for a 
focal child. On the other hand, Aronson 
and Cope (1968) used a "completion" para- 
digm involving the direct induction of two 
"real" attitudes, with the dependent variable 
being the way the subject completed the 
third relation. Briefly, a given attitude to- 
ward an experimenter was first induced by 
having the experimenter act in a harsh or 
pleasant manner ; next, an attempt was made 
to manipulate the subject's perception of the 
supervisor's attitude toward the experi- 
menter by having the supervisor behave in a 
harsh or pleasant manner toward the experi- 
menter within earshot of the subject ; finally 
the subject's acquired attitude toward the 
supervisor was assessed. Similarly, Fuller 
(1970) used the completion paradigm by em- 
ploying the “bogus questionnaire" "tech. 
nique of Byrne (1969) to manipulate inter- 
personal similarity, thereby inducing affec- 
tive relations toward two strangers (O and 
Q), and then asking subjects to predict 
O's probable liking for O. More commonly 
investigators have used “role-playing” fc 
structions which ask the subject to imagine 
specified types of relationships. In a para. 
digm which may be called a "felt tension” 
variant, the subject is given three relationg 
in a balanced or imbalanced triad and then 
rates the pleasantness of the hypothetical 
situation. In contrast, a completion para- 
digm using role-playing instructions typicall 

gives the subject only two of the three bp. 
element relations in a given triad and then 
asks him to estimate the third relation 
Feather (1966), for example, has used ie 
latter procedure in testing his formulation 
of the discrepancy principle. 

The following experiment was designed 
to compare the ability of each of the give 
algebraic formulations to predict values ini 
ciated with one particular interele IET 
um. Ruin dem tnterelement rela- 

Juested to 
given P — X and OSX rel 
then asked to estim 


imagine 
relationships anq 
ate their probable liking 
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for an imagined other person (P — O).* 
Obviously, this experiment was not intended 
to widely sample or probe the relative appli- 
cability of these formulations, but serves 
mainly to illustrate one of several means of 
comparison. 


Method 


An attitude questionnaire was constructed with 
the stated purpose being "to investigate the proc- 
esses by which people form first impressions of 
others on the basis of limited information." Each 
item of the 25-item questionnaire specified the sub- 
ject's hypothetical stand on a "resolution about 
recent student demonstrations occurring on the 
West Coast” (P— X), together with another 
person's feeling toward the same resolution (O — 
X). Thus, for example, in varying the PX 
relation, subject's were asked in successive items 
to imagine that they were "strongly opposed to," 
“mildly opposed to," "just as much in favor of as 
opposed to," "mildly in favor of," or "strongly in 
favor of" the hypothetical issue. These descrip- 
tive phrases may be thought of as representing 
the quantities —2, —1, 0, +1, and +2, respectively, 
on the X dimension of both the Wiest and Feather 
models. The O X relation was similarly varied 
and may be conceived of as corresponding to 
identical values along the V dimension. Sub- 
jects were instructed to mark, on a 7-point scale 
ranging from "dislike" to "like," how they thought 
they would feel toward the other person (P —O). 
Judgments were to be based solely on the in- 
formation provided within a specific item. 

The 25 items in this questionnaire represented 
the 25 cells of a Latin square formed by consider- 
ing five levels each of the two independent vari- 
ables, X and F. The order of the items was de- 
termined randomly, with the restriction that no 
cells tangent to each other in the Latin square 
could be adjacently positioned as items on a given 
page of the questionnaire. This procedure was 
designed to prevent any sequential incrementing 
of the independent variables that would lead to 
Ss responding according to an externally sug- 
gested pattern. Order was further controlled by 
creating à between-subject variable which varied 
the order in which pages were assembled; three 
such orders were prepared. 

Subjects and procedurc. A total of 150 students 
enrolled in an introductory psychology course at 
Vanderbilt University were asked to complete 
the questionnaire described above. Subjects were 
told to read the instructions printed on their 
questionnaire and to begin marking items as soon 
as they understood what they were to do. Any 
questions were answered individually. The session 
lasted approximately 40 minutes. 


4 The relations ONX,P-— XN, and P » O will 
be considered equivalent to the relations so], 
RI, and RS. 
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Results 


This experiment represented a 5 X 5x3 
factorial design, with repeated observations 
on the first two factors; these two within- 
subject factors represented the five levels of 
X and Y to which each subject was exposed 
in all possible combinations, while the third, 
between-subject, factor represented the three 
orders of item presentation. The dependent 
variable consisted of the subject's 7-point 
rating of his probable liking for the other 
person (Z). These ratings were trans- 
formed to a 5-point scale ranging from —2 
to 2: 

An analysis of variance of these ratings 
indicated that the overall interaction pre- 
dicted by both the equal-weights tetrahedron 
model and the composite discrepancy prin- 
ciple was highly significant (p < .001, o° = 
46). In addition, main effects were found 
to be associated with both X and Y (p< 
001, w?=.02; p«.001, «&?—.02). Al- 
though the formulation based on an equal- 
weights version of Wiest’s model predicted 
no main effects, such effects were predicted 
according to the composite discrepancy 
model. As can be seen, if the Y classifica- 
tion is ignored in Section B of Figure 2. the 
latter model predicts that’Z will have its 
largest value when X = +1 and smallest 
value when X = —2. A similar statement 
holds when the X classification is ignored 
and the main effects of Y on Z are con- 
sidered. Generally the obtained effects con- 
formed to these expectations. The third, be- 


tween-subject, factor, representing the order 
of presentation of items, showed no signif- 
icant main or interaction effects with the 


other two factors. . 
Examination of Section C of Figure 2 


shows that the asymptotic relationship pre- 
dicted according to the equal-weights tetra- 
hedron formulation was not confirmed. In- 
stead, when one of the relations given (e.g., 
Y) had a value of low magnitude, Z values 
peaked at nondiscrepant points as predicted 
according to the composite discrepancy 
model. Individual comparisons ( Newman- 
Keuls) performed between these “peak” 
means and the other means associated with 
the particular level of Y involved showed 
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both peaks to be significant (p < .01). 
Composite discrepancy predictions concern- 
ing given neutral values were also con- 
firmed: when Y was given as zero, ratings 
of Z for X —0 were significantly greater 
than ratings of Z for other values of X by 
a Newman-Keuls test (p < .01). 

The general correlational predictions in- 
volving extreme values of Y were confirmed 
for both the equal-weights tetrahedron model 
and the composite discrepancy principle. 
When it was given that Y = +2, the correla- 
tion between given X values and obtained 
Z ratings was +.98; and when Y = —2, the 
correlation between these variables was —.96. 

Distribution analyses performed on scores 
contributing to each of the 25 obtained means 
were examined for evidence of bimodality. 
In eight out of the nine cells in which the 
tetrahedron model yields Z;, to Zu ranges 
greater than zero, the distributions of ob- 
tained Z ratings were unimodal (and each 
of the remaining 16 cells vielded a similar 
unimodal distribution of ratings). This 
finding is in direct conflict with the assump- 
tion of an overall polarization tendency ope- 
rating within the tetrahedron, pushing struc- 
tures toward its surfaces. 
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While the analysis of variance and indi- 
vidual comparison criteria indicate some- 
what greater support for the composite dis- 
crepancy principle than for the equal-weights 
tetrahedron model, other criteria provided 
mixed results. Table 1 summarizes the ave- 
rage squared deviations of the obtained Z 
ratings and the ratings predicted by each of 
the algebraic equations: Formulas 1 and 2 
represent the upper and lower boundaries 
of balance, respectively, of Wiest's model: 
Formula 3' is an equal-weights version of 
the generalized tetrahedron model; Form- 
ulas 4 and 5 represent formulations of 
Feather's discrepancy principle for given 
relations of similar and opposite signs, re- 
spectively; finally, Formula 6 represents the 
composite discrepancy principle (ie, pre- 
dictions were based on either Formula 4 or 
5, as appropriate, and on an average of Zy 
and Zo values when both equations may be 
construed as making competing predictions). 
The last column of Table 1 shows correla- 
tions between obtained ratings and the rat- 
ings predicted by each formula across all cells 
in which a given formula was applicable, In 
general, the average deviations and correla- 
tions in the upper half of Table 1 indicate 
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that the upper boundaries of balance of the 
tetrahedron model (Formula 1) yielded pre- 
dictions more accurate than the lower bound- 
aries (Formula 2), but no more accurate 
than an equal-weights model (Formula 3’). 
Also, the lower half of Table 1 shows that 
the discrepancy principle for given relations 
of similar signs (Formula 4) gave more ac- 
curate predictions, according to the least- 
squares criterion, than did the discrepancy 
principle for relations of opposite signs 
(Formula 5) or the composite discrepancy 
principle (Formula 6). The correlations 
for the latter three formulas showed only 
negligible differences, however. Judged by 
the criterion of squared residuals, the equal- 
weights tetrahedron model yielded a some- 
what closer fit to the data than the composite 
discrepancy principle. 


DiscussioN 


Tt is apparent that the choice of criteria 
largely determines any inferred suitability of 
one model over the other. Although the 
composite discrepancy formulation better 
reflects obtained relational profiles in the 
cases tested (Y =+1 and Y —0). the 
equal-weights tetrahedron model provides 
predictions that are closer in terms of ab- 
solute deviations from obtained means. Re- 
sults of the experiment therefore do not per- 
mit us to conclude that one of the quanti- 
tative balance formulations is clearly superior 
to the other. It seems more appropriate to 
consider which part of each formulation is 
better supported by the data, and to note how 


5 The large number of subjects included meant 
that the power of each test (Lewis, 1960, p. 374) 
performed to evaluate the significance of discrep- 
ancies between obtained and predicted ratings was 
unusually great. Thus, it was found that the 
obtained data deviated significantly (p <.01) from 
the theoretical curves based on each of the six 
equations for each of the Y values shown in 
Table 1. Generally, the magnitudes of the F 
ratios for these significance tests varied directly 
with the average squared deviations shown in 
this table. For example, the poorest fit was ob- 
tained for Equation 2 in predicting Z values when 
y —0: the average squared deviation was 2.34 
(Table 1) and F = 76895, df =5/596. The best 
fit was obtained for Equation 6 in predicting Z 
values when Y —0: the average squared deviation 
was (02 (Table 1) and F — 5.40, df — 5/596. 
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varied data collection procedures might affect 
the outcome of such comparisons. 

First, it is clear that there is little support 
for any assumption of a polarization tend- 
ency pushing points in the interior of the 
tetrahedron toward the surfaces of the tetra- 
hedron equally. The present results are con- 
sistent with those reported by Fuller (1970) 
in vielding no supportive evidence of such 
polarization tendencies. Little would be 
lost by discarding this assumption. 

As noted, the upper boundaries of balance 
derived from the tetrahedron model seem to 
yield a decidedly better basis for predicting 
subjects’ judgments than did the lower boun- 
daries associated with this geometric figure. 
While the concept of a lower limit of balance 
did not, by itself, appear to provide a basis 
for accurate predictions of ratings, nonethe- 
less it does seem useful in that it provides 
the basis for "compromise" estimates as sug- 
gested by Formula 3. For example, if the 
coefficients a and b in this formula are 
assumed to have weights of .75 and .25, re- 
spectively, a somewhat better fit of the data 
is obtained : the overall average squared de- 
viation is .22 and the overall correlation of 
predicted and obtained ratings is .95. Fur- 
ther, it seems likely that the optimal a and 
b parameter values in Formula 3 will vary 
according to the type of relationship the sub- 
ject is asked to estimate. Wellens and 
Thistlethwaite (in press) report some evi- 
dence suggesting that the deviations of ob- 
tained ratings from predictions based on the 
lower surface of the tetrahedron are smaller 
when the subject is asked to estimate P > 
X or O X, rather than P— O. The lat- 
ter findings may be interpreted as indicating 
that subjects prefer not to imagine that they 
dislike another. 

Similarly, we can conclude that the dis- 
crepancy principle of Feather provides more 
when the two given re- 


accurate predictions 
lations have similar signs than when they 
have opposite signs. The latter condition a 
of course, one in which qualitative balance 
theory would predict that the subject will 
complete the missing relationship negatively 
(by imagining that he dislikes the other 
person). If it had been assumed at the aui 
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et that subjects tend to avoid attributing 
negative liking relations toward others, then 
there would have been a basis for explaining 
the relatively inaccurate predictions of the 
discrepancy principle when the given rela- 
tions are of opposite signs. 

In short, if it is true that subjects gen- 
erally avoid attributing negative liking rela- 
tions to others, then the conditions sampled 
in this experiment may not have been opti- 
mal for assessing the applicability of Form- 
ulas 2 and 5. Obviously, numerous and va- 
ried applications of these algebraic formula- 
tions will have to be evaluated before the 
relative utilities of the several formulas can 
be confidently assessed. 
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are theories that describe the relationships among several 
Conjoint measurement provid 
nformation only. This analy: 
| three variables [A + P + U, 
n widely employed in different areas yf psychology. 


methods for analyzing 
s applied to a class of 


(4 + P)U, AP + U, 


1t leads to the formulation of observable ordinal properties that can be used to 


test and diagnose which of the rules, if an. 


Much of scientific endeavor can be de- 
scribed as an attempt to account for changes 
in some dependent variable (e.g., response 
strength, subjective value, kinetic energy) 
asa consequence of variations in one or more 
independent variables. To attain this goal, 
the dependent as well as the independent 
variables have to be measured, and the 
function (or rule) that relates them has to 
be specified. If the variables that enter the 
proposed functional relation can be mea- 
sured independently (as is typically the case 
in physics), the validity of the composition 
principle can be tested directly. For ex- 
ample, to verify the law that the kinetic 
energy, E, of a body of mass nt, moving at 
velocity v is given by E = 3 mv’, one has to 
measure E, m, and v independently, and 
then demonstrate, by computation, that 
the equality holds within the limits of ac- 
curacy of the measurements. 

Many psychological theories have also 
been formulated as composition principles 
that describe a functional relation among 
several variables. Hull (1952), for instance, 
proposed that the reaction potential, or the 
response strength, A, of an organism is a 
multiplicative function of its drive D, habit 
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y, isappropriate for a given set of data. 

strength H, and incentive K. Thus, Hull 
postulated the relation R = DHK. Spence 
(1956), on the other hand, argued that re- 
sponse strength equals the product of habit 
strength with the sum of drive and incen- 
tive, or in symbols, R = H(D + K). We 
call this the distributive rule. Additional 
examples of psychological composition rules 
are discussed below. 

The major difficulty in the study of com- 
position rules in psychology is that, in gen- 
eral, there is no adequate independent pro- 
cedure for the measurement of the relevant 
variables. These psychological variables 
(e.g., drive, incentive) are typically char- 
acterized in terms of some appropriate 
physical measures (e.g., hours of depriva- 
tion, magnitude of reward). This charac- 
terization may be very useful for various 
experimental purposes, but it provides 
neither a satisfactory definition nor an ade- 
quate measurement procedure, because the 
psychological attribute and the physically 
controllable variable may not be related in 
the proper manner. It appears rather arbi- 
trary to assume anything more than a 
monotonic relation between response 
strength and running speed, for example. 
Furthermore, the psychological attribute 
of response strength may be characterized 
in terms of different physical measures such 
as running speed or proportion of correct 
responses, and there is no reason to suppose 
that these measures are related to each 
other by a linear transformation. Hence, 
the best one can hope for is to find a mono- 
tonic relation between the controllable phys- 
ical scale and the relevant psychological 


variable. But if the physical scales provide 
at most an ordinal measure of the psycho- 
logical attributes, how can any psychologi- 
cal composition rule be established? 

An answer to this question is given by 
the conjoint-measurement approach, which 
attempts to solve the measurement and the 
composition problems simultaneously, by 
constructing measurement scales for the 
relevant variables so that the proposed 
composition principle is satisfied. The 
problem investigated in this approach, 
therefore, is not whether a specified func- 
tional relation holds among several (in- 
dependently measured) variables, Dut 
rather whether there exist scales of mea- 
surement (for both the dependent and the 
independent variables) that satisfy the 
proposed composition rule. The key fea- 
ture of the conjoint-measurement approach 
is that only the ordinal aspects of the data 
are required to be compatible with the pro- 
posed composition principle. "Thus, in 
conjoint measurement, or for that matter in 
any fundamental measurement, one starts 
with an ordering of the dependent variable 
and investigates what properties this order 
should satisfy so that it can be represented 
numerically according to the proposed com- 
position principle. 

For concreteness, consider a factorial 
experiment in which the effects of drive, 
habit strength, and incentive are manipu- 
lated by varying, respectively, the length 
of the period of food deprivation, the 
amount of previous training in the maze, 
and the concentration of food reward. Sup- 
pose that response strength is ordered by 
the time it takes the organism to traverse 
the maze, and let (d,h,k) > (d’,h’,k’) denote 
the observation that organisms traverse the 
maze faster (hence show greater response 
Strength) in treatment combination (d,h,k) 
than in treatment combination (d’,h’,k’), of 
drive, habit strength, and incentive, respec- 
tively. From the viewpoint of conjoint 
measurement, however, the actual speeds 
at which the maze is traversed are inessen- 
tial—only their order matters. The con- 
joint-measurement analysis of the Hullian 
model consists, then, of the study of the 
ordinal conditions that are necessary and/or 
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sufficient for the existence of scales D, IH, 
and K for the measurement of drive, habit 
strength, and incentive, respectively, satis- 
fying 


(d,h,k) > (d’,h’,R’) if and only if 
D(d)H (h)K (k) > D (d) H (A) K (k^). 


Thus, the composition rule proposed by 
Hull is satisfied in an ordinal sense, if there 
exist scales D, H, and K whose product pre- 
serves the observed order of response 
strength. Similarly, the distributive com- 
position rule, proposed by Spence, is satis- 
fied in an ordinal sense, if there exist scales 
that satisfy the relation 


(d,h,b) > (d',h',k’) if and only if 
CD (d) + K(k) JH (h) > 
ED (d^) + K(k) (h’), 


Given this interpretation of the problem, the 
question becomes what are the qualitative 
(i.e., ordinal) conditions that the data must 
satisfy in order to be compatible with the 
composition rules proposed by Hull or 
Spence. Indeed, one may wonder Whether it 
is possible to distinguish between these rules 
on the basis of ordinal information alone, 
It turns out, however, that the two com- 
position rulesare qualitatively different, and 
that the multiplicative rule, for example, 
implies testable ordinal properties that are 
not implied by the distributive rule. To i]. 
lustrate, suppose (d,h,k) > (d^, b) and 
(d^) > (d,h,k'). Such data violate the 
Hullian model, but not the Spencian model 
Under the Hullian model, the first inequality 
implies D(d)H(h)K (k) > D (d') H (h") K (k) 
Assuming that all the values : 
tive, we can multiply this inequality by 
K(k')/K(k) to obtain D(d)H(h)k (k^) 
> D(d')H(h')K (k^) which, under the Hull- 
ian model, implies (d,h,k') > (d' h' k’) 
contrary to our hypothesis. On the ole 
hand, if we let D(d) = 1, D(d') = Ke) 
= 2, K(k) = H(h') = 4, and H(h) = 2 
we see that the Spencian model į x 
patible with the data, because a E 
> (2- 4)4 and (2 T2453 ( se ns 35 
The multiplicative and the distrib ti 
models, therefore, can be ipod 
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three factors are varied simultaneously. 
If one or more factors are held constant, 
however, and the scales are all positive, then 
the two models can be shown to be ordinally 
equivalent, that is, they are indistinguish- 
able on the basis of ordinal information. 
Unfortunately, in most of the experiments 
designed to compare the models, for ex- 
ample, Seward, Shea, and Davenport (1960) 
and Pavlik and Reynolds (1963), only two 
of the factors were varied. Hence, any con- 
clusion based on such tests is necessarily 
limited to the particular measurement 
scales employed. 

Viewed numerically, all composition rules 
are equally refutable. This is no longer 
true, however, when they are evaluated 
from an ordinal viewpoint. For example, 
assuming that all scale values are positive, 
it can be shown that more orderings are 
compatible with the distributive rule than 
with the multiplicative rule. The Hullian 
model, therefore, is easier to refute ordinally 
than the Spencian one. Such information 
must be taken into account in order to 
evaluate the models properly. 

The search for ordinal properties that 
lead to the construction of numerical scales 
and to the formulation of composition rules 
attracted much research both in the 
physical and the behavioral sciences. 
Around the beginning of the century, 
Hólder (1901) presented a set of assump- 
tions, or axioms, formulated in terms of an 
ordering of objects with respect to some 
attribute, and showed that these qualita- 
tive assumptions yield the familiar additive 
measurement scale for attributes like mass, 
length, or time. Perhaps the earliest and 
the best known example of measurement 
theory in the behavioral sciences is the 
theory of expected utility that was first 
formalized by von Neumann and Morgen- 
stern. (1951), and later generalized by 
Savage (1954) and by Luce and Krantz 
(1971). This theory consists of a set of 
qualitative assumptions about an indivi- 
dual's preference order between uncertain 
options, Or lotteries, that lead to the con- 
struction of utility and subjective-probabil- 
functions that satisfy the expected- 
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utility principle. 
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Recent research on fundamental measure- 
ment originated with investigations of the 
additive rule (e.g., Adams & Fagot, 1959; 
Davidson, Suppes, & Siegel, 1957; Debreu, 
1960; Luce & Tukey, 1964), and later ex- 
tended to more complicated rules (e.g. 
Krantz, 1968; Roskies, 1965; Tversky, 
1967a). This work, motivated primarily 
by theoretical problems in social science, 
has led to the development of new methods 
for the ordinal analysis of a wide variety of 
composition rules, and for the construction 
of numerical scales. For a detailed account 
of these developments see Krantz, Luce, 
Suppes, and Tversky (1971). In the pres- 
ent paper, we analyze from an ordinal view- 
point a particular c of simple composi- 
tion rules in three variables. Our purpose 
is to exhibit some testable ordinal proper- 
ties that can be used to diagnose which, if 
any, of the various composition rules is ap- 
propriate. Consequently, this paper is pri- 
marily concerned with diagnosis and testing 
rather than with axiomatization and scal- 
ing. It should be noted, however, that if 
the data are sufficiently regular, or finely 
graded (i.e., equally spaced, or dense), then 
the proposed diagnostic properties are not 
only necessary but also sufficient for the 
validity of the rules in question and for the 
construction of the appropriate numerical 
scales. These results were first obtained by 
Krantz (1968), and later developed in full 
by Krantz et al. (1971, Ch. 6 and 7). 

This paper is devoted to the analysis ofa 
class of composition rules in three variables, 
called simple polynomials. In general, this 
class is characterized as follows: (a) any 
single variable is a simple polynomial ; (b) 
if two simple polynomials have no variables 
in common, then their sum and their prod- 
uct are also simple polynomials. More 
specifically, there are exactly four? simple 
polynomials in three variables; namely, 

A+P+U additive rule 
(A + P)U distributive rule* 

3 We ignore the distinctions obtained by permut- 
ing variables, since the diagnosis and testing problem 
for (A + P)U is the same as that for (A + U)P, 

4 This rule is called distributive because the multi- 
plicative factor U is distributive over the additive 
factors A and P. The reason for the selection of 
these letters will become clear later. 


a 


154 D. H. Krantz AND A. TVERSKY 
AP+U dual-distributive rule® form, this isa simple polynomial in six variables. 
MA For situations where incentive 15 Inversely related to 
1 APU multiplicative rule. probability, Atkinson has mplified the model by. 
` - 


An example of a composition rule in three 
variables that is not a simple polynomial is 
AP+PU+AU. The simple polyno- 
mials in three variables are of special inter- 
est. First, they encompass a large class of 
psychological theories. Second, they are 
particularly tractable from a mathematical 
viewpoint, although their ordinal analysis 
is by no means trivial. Third, once the 
ordinal analysis of this class is performed, 
it can be extended to simple polynomials in 
any number of variables (see Krantz, 1968). 
To motivate the present investigation, we 
first list several examples of theories, from 
various areas of psychology, that are based 
on the above composition rules. Most of 
the theories, however, have been tradition- 
ally interpreted within a numerical, rather 
than an ordinal, framework. The present 
results, therefore, provide more general 
(i.e., ordinal) methods for testing and com- 
paring such theories 


Examples 


1. We have already seen that the performance 
models of Hull (1952) and Spence (1956) coi ist, 
respectively, of the multiplicative and the distribu- 
tive composition rules. "These models can be further 
extended by subtracting off a term attributable to an 
inhibition factor. This generalization leads to the 
polynomials HDK — Iand (D + K)H — I. If, sa 
K is held constant, then both models become ordi- 
nally equivalent to the dual-distributive rule H D 
— I. Clearly, subtraction is equivalent to the addi- 
tion of a variable with a negative sign. 

2. The additive composition rule has been applied 
by scores of authors, in many different areas of psy- 
chology, in either an ordinal or a numerical form, 
We cite just a few examples: Adams and Fagot 
(1959) and Tversky (1967b) in decision making; 
Hammond (1955) and Hoffman (1960) in clinical 
evaluation; Anderson (1970), Anderson and fe 
bein (1965), and Tversky and Krantz (1969) in 
impression formation. 

3. In his theory of achievement motivation, At- 
kinson (1957) postulated that the attractiveness of a 
risky task is given by ZsPsMs — IrPrMr where S 
and F denote success and f. ilure, respectively, Z the 
incentives of these states, P the probabilities of their 
Occurrence, and M the strengths of the motives to 
achieve succes and to avoid failure. In its general 
B son 

* This rule is called dual distributive because it is 
obtained from the distributive rule by interchanging 
the roles of addition and multiplication. 


assuming that Js = 1 — Ps = Pr, and Ip = l 
— Pr. Letting Q = IsPs = IpPp yields the dis- 
tributive rule (M — Mr)Q. 

1. Several ordinal composition rules in three vari- 
ables are obtained by applying expected-utility 
theory to specific situations, In particular, consider 
the lottery where one receives a certain amount of 
Commodity 1 and a certain amount of Commodity 2 
if some chance event occurs, and nothing otherwise. 
Assuming that the two commodities are independent, 
the expected utility of such a lottery is given by the 
distributive rule (Ui + U3)P, where P isa (sub- 
jective) probability function and Ui, Us are the 
utilities of the two commodities. 

5. Consider a game in which a fair coin is tossed 
several times, and where a person wins a certain 
prize each time the coin turns up heads, and pays a 
certain fine every time the coin turns up tails. 
Coombs and Huang (1970) have proposed that the 
risk of such a game is given by the distributive rule 
(R + V)N, where Risa function of the range of the 
outcomes, V is a function of the expected value, and 
Nisa function of the number of repeated plays, 

6. Another example of a distributive composition 
tule is derivable from the opponent-colors theory 
proposed by Hurvich and Jameson (1955). Ac- 
cording to their theory, the saturation, S, of a color 
is expressible by 

S eso 1 
Toes (lr-el+ ly- a 
where |w — bk| is the absolute value of the achro- 
matic (white-black) response, and |r — £l, [2028 b] 
are the absolute values of the chromatic red-gréen 
and yellow-blue responses, respectively, a s 

7. In the models discussed thus far 
values need not appear in a mul 
There are some instances, how 
tor has a natural zero point, so that the values on the 
opposite sides of this point produce opposite effects 
on the ordering of another factor, In this case the 
use of negative scale values is suggested. For ex- 
ample, suppose that combinations of adjectives (e.g., 
helpful, tall, offensive) and adverbs of quantity (e.g., 
very, slightly) are evaluated with respect to moral 
connotation (e.g., good to bad). In such tasks, we 
find (for examples see Cliff, 1959; Gollob, 1968) that 
“very helpful" is better than "slightly helpful,” 
“very offensive" is worse than "slightly offensive.” 
and that “very tall” and “slightly tall" are bot 
neutral, and hence equal, with respect to moral con- 
notation. In this case, a multiplicative composition 
rule is called for, where the values of the adverbs are 
all positive, and the values of the adje tives hel; f 
tall, and offensive are respecti n s aap 
and negative Another d gl p UU 
fille boc e e ample of à composition 

s positive and negative values can 

be found in Rosenberg (1956). 
p — “a rule that accommodates 
quantifying the theo; yer es is obtained By 
ry of Structural balance, pro- 
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posed by Heider (1946), which has been widely ap- 
plied to the study of cognitive organization and in- 
terpersonal relations (for examples see Brown, 
1965, Ch. 11). 

Consider a group of three individuals, with various 
degrees of liking among its members, and assume, for 
simplicity, that liking (and disliking) is a symmetric 
relation. Following Heider's ideas, we propose that 
the degree of balance of the group is expressible by 
the multiplicative rule LizLsaLas, where L 
positive (or negative) degree of liking (or dis 
between Individuals i and j. Thus, a group is bal- 
anced if either all its members like each other—'‘the 
friend of my friend is my friend"—or if one of its 
members dislikes the other two, who in turn like 
each other—‘‘the friend of my enemy is my enemy." 
Likewise, a group is unbalanced if either all its mem- 
bers dislike each other—"'the enemy of my enemy is 
my enemy"'—or if one of its members likes the other 
two, who in turn dislike each other—''the enemy of 
my friend is my friend." 


DIAGNOSIS AND TESTING 


In this section we formulate several test- 
able ordinal conditions and show how they 
can be used to diagnose and test simple 
polynomials in three variables. To simplify 
the exposition, we first discuss the positive 
case where all scale values are positive, and 
then investigate the more general signed 
case where zero and negative values are re- 
quired. We use the boldface letters A, P, 
and U to denote variables, or factors, and 
the capital italicized letters A, P, and U to 
denote the scales that measure their respec- 
tive effects. Note that A, P, and U are sets 
of treatment conditions, whereas A, P, and 
U are numerical functions defined on these 
sets. For notational convenience, we use 
letters from the beginning (e.g., a,b,c), the 
middle (e.g, p,q”), and the end (e.g., wu, 
v,w) of the alphabet to denote, respectively, 
elements of A, P, and U. Thus, A (a) is the 
scale value of the element a in A, etc. 


Independence 


One class of simple testable properties, 
called independence conditions, plays a 
central role in the ordinal analysis of com- 
position rules. The essence of ordinal in- 
dependence is that the ordering of the de- 
pendent variable can be used to order some 
of the independent variables (i.e., factors) 
in a manner that does not depend on the 
remaining variables. 

Suppose, for example, that (a,p,u) Z (b, 


pu). Since the two treatment conditions 
differ only with respect to their first com- 
ponent, one is tempted to conclude that the 
effect of a is at least as large as the effect of 
b. This conclusion, however, is valid only 
if the former effect always exceeds the lat- 
ter when the other components are fixed ; 
that is, if for any q and v, (a,q,2) 2 (b,q,2). 
Otherwise, the order of the elements of A is 
not well defined because it depends on the 
elements of P and U with which they are 
combined. Formally, A is independent (of 
P and U) whenever 


(a,p,u) > (bp) if and only if 
(a,gw) = (0,4,2)- [11 


(In stating formal properties of this type; 
it is understood that the quantification is 
universal. That is, A is independent when- 
ever Equation 1 holds for all a, b in A; Pid 
in P; and s, v in U.) Independence for P 
(or for U) is defined in the same fashion. 
The concept of ordinal independence can 
be applied not only to a single factor but to 
a pair of factors as well. Suppose, for ex- 
ample, that (a,p,7) > (b,q,u). Since these 
two treatment combinations have the same 
third component, one may be tempted to 
conclude that the effect of the (a,p) com- 
bination is at least as large as that of the 
(b,q) combination. For this conclusion to 
be valid, however, the same order of the 
joint effects of A and P must hold for any 
fixed third component, that is, (apv) 
> (b,q,v). Stated formally, A and P are 


jointly independent (of U) whenever 


(apu) = (qe) if and only if 
(apo) = (9). [E] 


Joint independence for P and U and for A 
and U is defined in similar fashion. (Note 
that joint independence for all pairs of fac- 
tors implies independence for all single fac- 
tors. The converse does not hold in 


general.) 


6 To demonstrate, assume joint independence, and 
suppose that (a,b) Z Gba). Since A and P are 
jointly independent, this inequality is equivalent to 
(ap) 2 (b.e) which, in turn, is equivalent to 
(a,q,v) Z (b,q,7) since Aand U are jointly independ- 
ent. Hence, (a,p,u) 2 (b,p,0) is equivalent to 
(a,q,2) 2 (b,q.2), and independence is thus derived 
from joint independence. ve 


aqu) 
aq P (3) 
Fic. 1. A graphic illustration of a 
2 X 2 X 2 factorial structure. 


A graphic illustration may elucidate the 
meaning of independence. Consider the 
2 X 2 X 2 factorial Structure presented in 
Figure 1, whose vertices represent treat- 
ment combinations that are labeled by the 
appropriate letters and numbered from 1 to 
8 


Using the rectangular solid displayed in 
Figure 1, independence and joint independ- 
ence can be interpreted geometrically. In- 
dependence for A, for instance, corresponds 
to the property that endpoints of edges 
parallel to the A axis are ordered in the 
Same way. For example (2) > (1) if and 
only if (8) > (7). More generally, single- 
factor independence corresponds to the 
condition that endpoints of parallel edges 
are ordered in the same fashion, for example, 
(4) > (2) if and only if (7) > (5). 

Joint independence for A and P corre- 
sponds to à somewhat different property, 
namely, that pairs of vertices with the same 
U component, which lie on parallel lines, 
are ordered in the same way, for example, 
(4) 2 (1) if and only if (8) > (5). More 
generally, joint independence corresponds 
to the condition that endpoints of parallel 
diagonals are ordered in the same way. 

Note that both single-factor and joint 
independence conditions are formulated in 
terms of a 2 X 2 X 2 subset of a factorial 
Structure. In order to test them, therefore, 
one has to consider all 2 X 2 X 2 subsets of 
any larger three-factor experiment, and to 
check whether the required properties are 
satisfied within each such subset. 

In Practice, such tests of independence 
are quite simple. Given data from an 
IXmxn three-factor experiment, one 


looks at the ordering of any “row” 
in which the A component v; 
P and U components fixed. 
ings need to be iden 
so that the A components can be permuted 
such that the order along any A “row” is 
monotone increasing. Similar tests of in- 
dependence hold for P and for U, To test 
joint independence of A and P, one must 
check whether the ordering is the same in 
each A X P matrix, with the U component 
fixed, etc. 

The first step in the diagnosis of composi- 
tion rules is the test of single-factor inde- 
pendence. If it fails, for any factor, then no 
composition rule with positive scales is com- 
patible with the data. In other words, if 
single-factor independence does not hold, 
then either no composition rule is available, 
or the composition rule must include some 
zero or negative scale values. 

If single-factor independence is satisfied, 
then neither negative nor zero values are 
required. That is, if any composition rule 
is appropriate, positive scales suffice. Since 
this is the case in Many applications, and 
since the introduction of zero and negative 
values results in certain complications, we 
first discuss the positive case (i.e., where 
simple independence holds), and treat the 
signed case in a later section. (Note that 
in the positive case, the additive and the 
multiplicative rules are (ordinally) equival- 
ent, because the former can be obtained 
from the latter by a logarithmic transforma- 
tion. In the signed Case, where the restric- 
tion to positive values is not imposed, the 
additive model is equivalent to that special 
case of the multiplicative model in which all 
scale values are positive.) 
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Cancellation 


Besides independence, there is another 
class of ordinal Properties—called cancella- 
tion conditions—that are used to diagnose 
and test the various Composition rules. 
The first cancellation condition Presented is 
a necessary condition for all composition 
rules in the positive Case. [tisa two-dimen- 
sional Property, that is, it applies to pairs 
of treatment combinations that have a 
common component jn one factor, For 
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mally, A and P satisfy double cancellation? if 
(aqt) 2 (b,r,u) 


(b,p,u) 2 (egu), 


and [3] 
imply 
(a pu) > Gera. 

Double cancellation is the key property 
necessary for additive composition in two 
variables (e.g., see Krantz et al., 1971; Luce 
& Tukey, 1964). A graphical illustra- 
tion of double cancellation is presented in 
Figure 2, where treatment combinations are 
represented by points, and the inequalities 
of the hypotheses and the conclusion are 
denoted, respectively, by thin and thick 
arrows. 

Unlike the independence conditions, 
which can be tested with only two levels per 
factor, double cancellation requires at least 
a 3X3 design. (As Figure 2 shows, six 
cells of a 3 X 3 matrix are involved.) Ina 


larger, m X n design, there are (2) x (5) 


possible tests of double cancellation. Each 
of these can be classified in one of three 
ways: acceplance (the antecedents and the 
conclusion of Equation 3 hold), rejection 
(the antecedents hold, but the conclusion 
does not), or no lest (the antecedents do not 
hold). The tests are not logically independ- 
ent, since the conclusion of one test may be 
the antecedent of another, etc. 

If independence holds but double can- 
cellation is violated, then all simple poly- 
nomials are rejected. This follows from the 
fact that in all simple polynomials, any 
two-factor subset of treatment combina- 
tions, obtained by selecting a fixed level on 
the third factor, satisfies additivity, and 
hence satisfies double cancellation. In the 
positive case, therefore, both independence 
and double cancellation are necessary con- 
ditions for any simple polynomial. Once 
these properites, that are common to all 
the rules, are satisfied, we can turn to the 
investigation of those properties that dis- 
tinguish among the three composition rules. 

7 The term cancellation is due to the fact that the 
conclusion can be obtained from the hypotheses by 
“cancelling” the common elements b and g on both 


sides of the inequalities. 
8 See also Adams and Fagot (1959). 


Fic. 2. A graphic illustration of double cancella- 
tion. (The inequalities of the hypotheses and the 
conclusion are represented, respectively, by thin 
and thick arrows.) 


We start by checking the number of pairs 
of factors that are jointly independent. 
If the additive (or, equivalently, the multi- 
plicative) rule is satisfied, then all pairs of 
factors satisfy joint independence. In the 
other two rules [i.e, (4 + P)U and AP 
+ U) A and P are jointly independent, but 
the other two pairs of factors are not. 
Hence, if no pair of factors is iointly inde- 
pendent, then no simple polynomial is ap- 
propriate; if one pair of factors is jointly 
independent, then both the distributive and 
the dual-distributive rules may be appro- 
priate; and if all three pairs of factors are 
jointly independent, then the additive rule 
is appropriate? 

9? The numerical example in D, H, K near the be- 
ginning of the paper shows that D and H need not be 
jointly independent for the distributive | rule 
(D + K)H to hold. A similar example can easily be 
constructed for the dual-distributive rule. On the 
other hand, these violations of joint independence 
depend on the fact that the ranges of the scales are 
sufficiently large. Thus, in some circumstances, the 
distributive or dual-di: ributive rule might hold, yet 
due to narrow range of scale values, two or three pairs 


of factors might satisfy joint independence. If two 
; joint independence, then the additive 


and one can proceed as though only 
one pair satisfied joint independence, anticipating, 
however, à constricted range. F all three pairs 

satisfy joint independence, the distributive and dual- 
distributive rules are not necessarily eliminated, but 
the additive rule is by far the most likely to work. 
Moreover, if the data are sufficiently finely graded 

and all three pairs satisfy joint independence, then i 
can be shown that the additive rule is bound to work 
(Krantz et al., 1971). k 


pairs satis! 
rule is eliminated 


, (d SY 


Fic. 3. A graphic illustration of distributive can- 
cellation. (The inequalities of the hypotheses and 
the conclusion are represented, respectively, by thin 
and thick arrows.) 


To complete the analysis of the positive 
case, the distributive and the dual-distribu- 
tive rules have to be differentiated. For 
this purpose, we introduce two additional 
properties—distributive cancellation and 
dual-distributive cancellation. Each of 
these properties is satisfied by the respective 
composition rule and need not be satisfied 
by the other. Hence, they can be used to 
test which of the two rules, if either, is more 
likely to hold. (Both cancellation condi- 
tions, however, are satisfied by the addi- 
tive composition rule.) 


Distributive cancellation is said to be satis- 
fied whenever 


(apu) > (cro), (bq) > (diss), 


and 

(d rw) > (b,p,u) [4] 
imply 

(aque) 2 (esw). 


A graphic illustration of this property is pre- 
sented in Figure 3, where treatment com- 
binations are represented by points that are 
both labeled and numbered. The inequali- 
ties of the hypotheses and the conclusion 
are denoted, respectively, by thin and thick 
arrows. 

Interpreted in terms of Figure 3, dis- 
tributive cancellation asserts that if two 
endpoints of a diagonal in the unprimed 
plane [e.g., (1) and (2)] are respectively 
larger than the two endpoints of a diagonal 
in the primed plane [e.g., (1’) and (2’)], 
o ne two endpoints of the second diag- 

Í he primed plane [e.g., (3') and (4^) ] 
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cannot be both respectively larger than the 
two endpoints of the second diagonal in the 
unprimed plane. In short, (1) > (1), 
(2) > (2), and (3) > (3) imply (4) = 
(4’). 

It is instructive to see how this property 
follows from the distributive rule. Let 
A (a), P (p), and U(u) be the scale values of 
a, p, and u, respectively. Suppose the dis- 
tributive rule holds; hence, from the hy- 
potheses of distributive cancellation 


(a,p,u) > (c,r,v) implies 
LA (a) + P(p)] Ue) 

2 [A(c) + P(r)] UG), 
(b,q,u) > (d,s,v) implies 
CA (b) + P(q)] Uu) 

> [A (4) + P()] VQ), 
and 
(d,r,v) > (b,b,u) implies 
[A (d) + P(r)] U(v) 

> [A (b) + P(p)] U (u). 
Adding the first two numerical inequalities 
yields 
LA (a) + P(p) + A(b) + P(g)] Uu) 

> [A (e) + P(r) +A) + P(s)] U(v). 
Subtracting the third numerical inequality 
(that follows from the third hypothesis) 
from this latest inequality yields 
[A (a) + P (g)] U(w) 

2 [A(c) + P(s)] Uv), 
which, under the distributive rule, implies 
(agu) > (c,sw). This establishes the nec- 
essity of distributive cancellation. 

Finally, to test the dual-distributive rule, 
a more complicated ordinal property, that 
includes five hypotheses, is required. Dual- 
distributive cancellation is satisfied whenever 
(sw) > (arw), (d,pu) > (b,t,x), 
(d,r,x) > (es), (eq,y) > (ey), 


and 


(aby) > day) | 


imply 
lpw) > (b,q,w). 


I he dual-distributive cancellation is simi- 
lar, in nature, to the other cancellation con- 
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ditions, except that it is more difficult to test A flowchart summarizing the procedure 
and interpret. The interested reader is for the diagnosis of three-factor simple 
invited to verify that this property follows | polynomials in the positive case is presented 
from the dual-distributive composition rule. in Figure 4. 


Stort 


Does 
independence (I) hold 
for all factors? 


Check the signed case 


Does double 
cancellation (3) hold 
for all pairs of factors, 


Yes 


Reject all rules How many 
pairs of factors sa- 


tisfy joint independence 
(2)? 


Additive rule 


Reject allrules 


Does 
distributive cancellation 
(4) hold? 


Distributive rule 


Does 
dual - distributive cancel" 
lation (5) hold? 


Dual-distributive rule 


Reject all rules 


Fic. 4. Flowchart for the diagnosis of three-factor simple polynomials in the positive case. 
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Sign Dependence 


The diagnostic properties discussed in the 
previous section are all valid when the 
respective composition rules are restricted 
to positive-valued scales. If, however, the 
multiplicative factors admit negative or 
zero scale values, then these properties are 
readily violated. To demonstrate how in- 
dependence (Equation 1) fails to hold, sup- 
pose treatment combination (p,u) has a 
"positive" effect, while treatment combina- 
tion (gw) has a "negative" effect, and as- 
sume (a,p,u) > (b,p,u). According to the 
multiplicative composition rule, therefore, 
(b,q,v) > (a,gw), because a change in the 
sign of a multiplicative factor reverses the 
direction of the inequality. This violation 
of independence is due to the fact that treat- 
ment combinations (5,4) and (q,v) produce 
opposite effects or have different “signs.” 
However, if we restrict the test to treatment 
combinations that have the same "sign," 
then independence is satisfied under all 
three composition rules. To accommodate 
negative and zero scale values, therefore, 
independence is generalized by taking 
"signs" into account. This can be accom- 
plished if the set of all treatment combina- 
tions of the form (p,u) can be divided into 
three sets whose effects are, respectively, 
"positive," "neutral" and “negative.” 
Specifically, A satisfies sign dependence if 
P X U can be partitioned into three sets, 
denoted |P X UIR |P XU|* and 
|P X U|- such that A is independent of 
each of these sets (in the sense of Equation 
1) and such that the positive and the nega- 
tive sets induce opposite orders, and the 
neutral set induces an equality over A. 
More formally, let the pairs | p,u|*, |p,u]°, 
and |p,4|- denote, respectively, generic 
elements of |P X U|*, |P X U|’, and 
[P X Uj-. The factor A is called sign de- 
pendent (on P X U) whenever" 


1. (a,|p,u\+) > (b, | pul?) if and only if 
(a,|qv|*) > (b,|qm|*). 
2. (a,\pju|-) > (b,| p,u|-) if and only if 
(a|q2|-) 2 (lga). [6] 
dd Formally, 3 implies 1 and 2, but this logical 


nicety is inc in E i i i 
be c Y IS not incorporated in Equation 6, since it may 
€ confusing at first glance 
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3. (a,|p,u|*) > (b,| p,u|*) if and only if 
(6,|9,v|-) = (a,|g,0|-). 
4. (a, | p,u|*) = (b,| pul’). 


Thus, A is sign dependent if the ordering 
induced over A by (5,4) depends only on the 
"sign" of (p,u), that is, only on the subset 
to which (p,w) belongs, with different sub- 
sets behaving according to their respective 
"signs." Clearly, independence (Equation 
1) is the special case of sign dependence 
(Equation 6) where the sets of neutral and 
negative treatment combinations (i.e., |P 
X U|’, and |P X U|-) are both empty. In 
this case, Part 1 above reduces to Equation 
1 while the other parts of Equation 6 are 
satisfied vacuously. 

In terms of an 1 X m X n factorial de- 
sign, sign dependence for A means that the 
A components can be permuted so that the 
entries in the / cells of any one A "row" are 
either monotone increasing or monotone de- 
creasing or constant. The fixed P X U 
elements have opposite "signs" when the 
corresponding A “rows” are oppositely 
ordered, or neutral "sign" when the A 
"row" is constant. If any two nonconstant 
A "rows" are ordered differently, but not 
oppositely, both independence and sign de- 
pendence fail for A. 

Joint sign dependence for a pair of factors 
is defined in an analogous manner, except 
that only a single factor, the third one, is 
partitioned into signed subsets. Formally, 
A and P are said to be jointly sign dependent 
(on U) if Ucan be partitioned into three sets, 
denoted U+, U^, and U-, with respective 
generic elements u*, u’, and u-, such that 


1. (a,p,u*) > (b,g,u+) if and only if 
(a, po^) > (b,q,0*). 


2. (apu) = (b,g,u-) if and only if 
(a,b) > (bga). [7] 
3. (aput) > (b,g,ut) if and only if 


(bg) > (a, pu). 
4. (u,p,u*) = (bga). 


Thus, A and P are jointly sign dependent on 
U if they are jointly independent of U+, U’, 
and U, and if the order induced by the 
positive set is the opposite of that induced 
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by the negative set, and the order induced 
by the neutral set is an equality. Clearly, 
joint independence is the special case of 
joint sign dependence where the sets of neu- 
tral and negative treatment conditions 
(i.e., U’ and U-) are empty. In this case, 
Part 1 of Equation 7 reduces to Equation 2, 
while the other parts of Equation 7 are 
satisfied vacuously. 

Joint sign dependence for A and P means 
that any two nonconstant A X P matrices 
(for different fixed components in U) are 
ordered either alike or entirely oppositely. 
If the latter, then the corresponding com- 
ponents in U have opposite sign. A con- 
stant A X P matrix is diagnostic of u in 
w, 

Although it is obvious how to designate 
U’, the + and — signs can be specified 
arbitrarily for any one partition. Other 
partitions, however, must have signs desig- 
nated in a consistent manner. For ex- 
ample, in the case of the multiplicative 
composition rule, sign dependence for single 
factors yields partitions of A X P,AXU, 
and P X U, while joint sign dependence 
yields partitions of A, P, and U. We must 
have the properties 


|A X P|* = (At X P+) U (A X P), 


ete. Also, any condition (a,p,u) with (a, p) 
in |A X P|* and win U* must yield a value 
of the dependent variable which is greater 
than the constant value for conditions in 
|A X P|* X U or A X P X UY, etc., and 
the latter constant (the natural zero on the 
rescaled dependent variable) must be 
greater in turn than any condition in 
|A X P|* X U-, etc. 

Altogether, the partitioning of factors 
into positive, negative, and neutral subsets, 
if it exists, should be fairly obvious, because 
the complete reversals of ordering should be 
a striking phenomenon, unlikely to be 
missed when it occurs. In some instances 
(see Example 7), the partitioning can be 
carried out in advance of collecting quanti- 
tative data. 


Generalized Cancellation 
We start the diagnostic procedure with 
the test of sign dependence: if it fails, then 


all simple polynomials are rejected; if it 
holds, then we test additional properties. 
Following the testing procedure developed 
for the positive case, we turn now to the 
test of double cancellation. Like independ- 
ence, double cancellation need not hold in 
the signed case. It is satisfied, however, 
within any homogeneous subset of the fac- 
torial structure, that is, a subset in which no 
multiplicative factor changes sign. Clearly, 
in the positive case the entire structure is 
homogeneous. To test the various cancella- 
tion conditions in the signed case, the struc- 
ture is divided into homogeneous subsets 
according to the partition of its factors. 
Since any homogeneous subset containing à 
neutral multiplicative factor has a degener- 
ate or a trivial order, only the positive and 
the negative sets are considered. The 
multiplicative rule induces eight such homo- 
geneous subsets (e.g., At X P* X U*, A* 
Xx Pt X U-, A> X P- X U-^, etc.); the dis- 
tributive rule induces four (i.e, |A X P|* 
x Ut, |AX P|* XU-, |AXP|- X U*, 
JA X P|- X U~); and the dual-distributive 
rule induces four (i.e, At X P+ X U, At 
X P- X U, A- X P+ X U, A- X P- X U). 
Once these subsets are identified, the ap- 
propriate cancellation conditions can be 
applied to each of the subsets. For a par- 
ticular composition rule to be valid, all the 
conditions it implies in the positive case 
should be satisfied within each homogene- 
ous subset. Hence, double cancellation 
must be satisfied by all such subsets if any 
of the simple poynomials is valid. 

The next step is to test the number of 
pairs of factors that satisfy joint sign de- 
pendence. If no pair of factors satisfies 1t, 


then all rules are rejected ; if one or possibly 
atisfy it, then both the 


two pairs of factors sé l 
distributive and the dual-distributive rules 
may be valid; if all three pairs of factors 
satisfy it, then the multiplicative rule is 
appropriate. 

When exactly one pair of factors satisfies 
joint sign dependence, a very simple method 
for discriminating between the distributive 
and the dual-distributive case exists. (This 
is usually the case under these rules sce 
Footnote 9.) With no loss of generality, we 


can label these factors A and P. In the 


signed case, if the distributive rule holds, 
then either U is sign dependent, but not in- 
dependent (of A and P), or A and P are 
jointly sign dependent, but not jointly inde- 
pendent (with respect to U), or both. On 
the other hand, if the dual-distributive rule 
holds, then both U is independent (of A and 
P), and A and P are jointly independent (of 
U). Thus, a comparison of the pattern of 
independence and sign-dependence rela- 
tions is sufficient for choosing between the 
distributive and the dual-distributive rules 
in this case. This comparison, however, 
does not guarantee that any of them is satis- 
fied. To test each of the rules, the appropri- 
ate cancellation condition should be tested 
in each of the homogeneous subsets. Fin- 
ally, one has to verify that the necessary 
order relations between the homogeneous 
subsets are satisfied, that is, positive com- 
binations should exceed neutral combina- 
tions, which, in turn, should exceed negative 
ones. 

A flowchart summarizing the major steps 
in the diagnosis of simple polynomials in 
the signed case is presented in Figure 5. 
Note that the testing procedure for the 
signed case, outlined in Figure 5, closely 
parallels the testing procedure for the posi- 
tive case, outlined in Figure 4. The main 
difference between the two procedures is in 
the replacement of independence by sign 
dependence, and the restriction of cancella- 
tion to homogeneous subsets. 


An Empirical Example 


An interesting application of the present 
methods, to the problem of diagnosing rules 
of composition for factors affecting per- 
ceived risk, can be found in the paper of 
Coombs and Huang (1970). Subjects were 
asked to order, with respect to risk (not 
preference!), games that consist of multiple 
independent plays of a simple monetary 
gamble. In such a gamble, one receives x 
if a fair coin comes up heads and loses y if 
the coin comes up tails. Each game of this 
type can be characterized as a triple (7,v,7), 
where r = x + y is the range of the out- 
comes, v = $(x — y) is the expected value 
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of each play, and » is the number of inde- 
pendent repeated plays of the game (rv). 

An inspection of the effects of zero values 
suggests the distributive composition rule. 
First, note that for n = 0 the game becomes 
trivial, and all trivial games are equally 
risky. Hence N must be a multiplicative 
factor with N(0) = 0. Similarly, if both 
r and v equal zero, the game again becomes 
trivial; yet, if v alone equals zero, the 
game may have different degrees of risk, 
depending on r. Consequently, R x V 
(but not V) is also a multiplier, and the 
distributive rule, therefore, is likely to 
be the only appropriate composition rule, 
of all the three-variable simple polynomials. 

In the actual study, three levels of R, 
three levels of V, and two levels of N were 
employed in a factorial design. To diagnose 
the correct composition rule, independence 
(Equation 1) was first investigated. Actu- 
ally, Coombs and Huang (1970) tested a 
more general form of Equation 1, but it is 
evident from their data that independence, 
too, is almost perfectly satisfied (i.e., with 
no more than a single violation) by a major- 
ity of the subjects (i.e, 16 out of 28). 
Double cancellation was tested only for sub- 
sets with a fixed value of N. Since only two 
levels (1 and 5) of N were used, double can- 
cellation could be tested only for R x V 
planes. Practically no violations of this 
property were observed. 

Following the steps outlined in Figure 4, 
the authors tested joint independence (Equ- 
ation 2). Allowing no more than a single 
violation, it was found that in the risk or- 
derings of the majority of the subjects, R and 
V were jointly independent (of N);VandN 
were jointly independent (of R); but Rand 
N failed to be jointly independent (of V). 
The number of subjects (out of 28) that 
satisfied joint independence (with no more 
than a single violation) were, respectively, 
22,17, and 7. Finally, distributive cancel- 
lation was perfectly satisfied by 21 subjects, 
and the overall average number of viola- 
tions was less than one. Thus, the analysis 
clearly supports the distributive rule that 
was initially favored by and 
Huang (1970). Í 


Coombs 


1 
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Critical Diagnostic Tests provide rather thorough tests of its validity. 

The ordinal properties introduced thus If we assume that one of the rules is satis- 
far were designed not merely to diagnose the fied, then simpler and more efficient critical 
appropriate composition rule, but also to tests are available for discovering the cor- 


Start 


Does 
Sign dependence (6) hold 
for all factors ? 


No 


Rejectollrules 


of factors? 


Reject all rules ow poirs 


of factors satisfy joint 
sign dependence 
(7)? 


Multiplicative rule 


Does 
Gistributive concella- 


tion (4) hold for homoge: 
neous subsets ? 


No Yes 


Distributive rule 


Does 
‘dual - distributive 
concel lation (5) hold for 
homogeneous 
subsets? 


No 


Dual-distributive rule 


Reject all rules 


Fic. 5. Flowchart for the diagnosis ofi three-factor simple polynomials in the signed case. 


rect rule. : 
properties of the variables usually deter- 


In the first place, the signed 


mine the appropriate rule. For example, 
suppose that one of the simple polynomials 
is valid and that the ranges of the measure- 
ment scales of all three factors include zero. 
An element dp of A is said to be an effective 
zero for P if (uou) induces equality over P 
for every u in U, that is, if (aop,u) = (do, 
q,u) for all p, g in P and v in U. In the light 
of this definition, it is readily seen that in 
this case, the number of factors containing 
an effective zero determines the appropriate 
composition rule. If thereare three, the rule 
is multiplicative; if there are two, the 
rule is dual distributive; if there is one, the 
rule is distributive; and if there are none, 
the rule is additive. Further discussion of 
the diagnostic problem in the signed case 
can be found in Krantz et al. (1971, Ch. "1. 
The development of efficient diagnostic 
procedures is not limited to the signed case. 
To demonstrate, we show how to design a 
critical test to choose among simple polyno- 
mials in the positive case. For convenience, 
we label the odd factor, if one exists, U. 
The candidate for the odd factor may be 
identified a priori, or it can be determined 
through the test of joint independence. 
Select treatment conditions so that the 
effects of a, p, and u are, respectively, larger 
than the effects of b, q, and v, and such that 
(a,pv) = (b,p,u). Now, compare (a,gw) 
with (b,g,u). If the additive (or, equiva- 
lently, the multiplicative) rule holds, then 


A(a) + P(p) + Ul) 

= A(b) + P(p) + Ulu); 
adding P(g) — P (p) to both sides yields 
A(a) + P(q) + Uv) 

= A(b) + P(g) + Ulu), 


which, in turn, implies (a,g,2) = (b,q,u). 
(Note that this conclusion follows at once 
from joint independence.) On the other 
hand, if the distributive rule holds, then the 
equality (a,p,v) = (b,p,u) implies 


LAG) + P(r) Ue) 
= [A (b) + P(p)] Uu), 
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or 

AW + PQ) | UW) 

A(b)+ P(p)  U(wy 
Since the levels have been selected so that 
A(a) > A(b), and P(p) > P(q), and all 


values are assumed to be positive, it follows 
that 


Ato) EPU) 
A (b) + Pla) 


U(u) 
U(v)' 
hence 


[A (a) + P(9)] UC) 
> [A (b) + P(Q] UW), 
which implies (a,gw) > (b,q,u). Finally, 


if the dual-distributive rule holds, then 
(a,p,v) = (b,p,u) implies 


A(u) P(p) + U(y) = A(b) P(p) + Ulu), 


or 


P(p)LA() — A()] = Ulu) — Uo); 


and since, by hypothesis, A (a) > A (b), 
P(p) > P(q), U(u) > U(v), and the values 
are all positive, it follows that 


P (q) [A (a) — A @)] < Ulu) — Uw), 


hence 
A (a) Pq) + U() < A (b) P (4) + UW), 


which implies (a,gw) < (b,q,u). Under the 
above hypotheses, therefore, the additive, 
the distributive, and the dual-distributive 
rules can be readily diagnosed depending 
only on whether (a,g,v) is, respectively, 
equal to, greater than, or less than (b,q,u). 


DISCUSSION 


The procedures developed in the previous 
section do not exhaust the available methods 
for the diagnosis and testing of composition 
rules. Moreover, it can be shown that it is 
impossible, in general, to provide a (finite) 
list of necessary properties that are also 
sufficient, for any composition rule (see 
Scott & Suppes, 1958; Tversky, 1967a). 
Nevertheless, from an empirical viewpoint, 
the proposed independence and cancellation 
conditions are practically sufficient for the 
validity of the various rules, because the 
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instances where these conditions are satis- 
fied while the rule is invalid are quite rare. 
Furthermore, as was already mentioned, 
the proposed conditions are actually suffi- 
cient to establish the validity of the rules in 
question, if the factorial structure under 
study is sufficiently rich (i.e., dense), or 
regular (i.e., equally spaced). Contrary to 
some commonly held views that ordinal in- 
formation is too “weak” for measurement, 
modern research has shown, in many forms 
and contexts, that when sufficiently many 
ordinal observations are available, not only 
the theory, but even the numerical scales 
are essentially determined. For rigorous 
proofs of sufficiency, including discussions 
of constructive scaling procedures, the 
reader is referred to Krantz et al. (1971). 
In the present discussion, we only comment 
briefly on scaling and on the problem of 
fallible data, and then discuss the merits of 
the axiomatic versus the numerical ap- 
proach to the composition problem. 


Scaling 


Although the present paper is confined to 
the applications of conjoint-measurement 
theory to diagnosis and testing, it is in- 
structive to examine how the theory is 
employed to construct scales of measure- 
ment. Suppose the additive model holds. 
Since the "origin" of all three scales is arbi- 
trary, we can select any ao in A, po in P, and 
uo in U and set A (ao) = P(po) = U (uo) 
= 0. Furthermore, since one unit is also 
arbitrary, we can select any di in A, with 
(ai, Dostto) = (ao, Posto), and set A(a,) = 1. 
Next, we look for some pı in P and some i1 
in U that satisfy the equation (a1,Po,t0) 
= (ao pio) = (ao,Po,tt1). Under the ad- 
ditive rule, therefore, A (a1) = P(p)) = 
U(u;) = 1. Once these values are deter- 
mined, we can continue the search for, say 
a» in A that satisfies the equation (d:,Po,to) 
= (dopi) whence Ala) = 2 = P$) 
+ U(ui) since, in this notation, the scale 
value of any treatment combination is the 
sum of its indices. If the factorial structure 
under study is rich enough so that elements 
satisfying the de red equations can always 
be found, then the above proces yields mea- 
surement scales by marking off equal units 
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along each of the factors. By selecting an 
appropriately small unit, the scale value of 
any treatment combination can be obtained. 
(This method is analogous to the familiar 
process of physical measurement, where 
length, for example is measured in terms of 
the number of smaller units (e.g., centi- 
meters) that are needed to "match" the 
length of any given object.) If, however, 
the data are not sufficiently rich to permit 
the above construction, then the scale 
values can only be determined approxi- 
mately in terms of upper and lower bounds. 
Essentially the same constructive method is 
applicable to the other composition rules, 
though the existence of both additive and 
multiplicative factors complicates the pro- 
cedure somewhat. 

Obviously, the proposed measurement 
can be carried through only when the com- 
position rule is known. For example, the 
very ordinal properties (joint independ- 
ence) that are use to test the additive rule 
are required to ensure that the above con- 
struction is consistent. For example, the 
element as of A can be defined as the solu- 
tion of the equation (as,po4o) = (ay pitt), 
or it can be defined as the solution to the 
equation (aspe) = (apii), which fol- 
lows from the first equation by joint inde- 
pendence. If, however, joint independence 
does not hold, there is no reason to expect 
that the same element of A, that is, 4 will 
satisfy both equations. The necessary diag- 
nostic properties, therefore, play à major 
role in the construction of the measurement 
scales. Furthermore, they can be tested in 
the actual scaling process by checking that 
the values otained by solving the various 
equations are, indeed, consistent. 


Fallible Data 


A major problem encountered in testing 
ordinal properties is the presence of error. 
If the data are infallible, then the necessary 
properties can be routinely checked and the 
test yields an unequivocal conclusion. 
Typically, however, the data are regarded 
as fallible in the sense that they reflect ran- 
dom fluctuations or sampling errors, in 
which case the testing problem is consider 
ably more complicated. Independence, for 


example, may be violated by the ordering 
estimated from the data, merely as a con- 
sequence of sampling error. Any adequate 
evaluation of the validity of some composi- 
tion rule, therefore, should take the error 
problem into account. ! 

Two general approaches to this problem 

are available. First, we may collect suffici- 
ently many observations to ensure the 
stability of the observed order and thereby 
reduce the probability of an erroneous con- 
clusion to some acceptable low value. Al- 
though it is always desirable to obtain a 
more stable order, the above approach may 
be quite costly. Alternatively, we may in- 
terpret the ordinal properties as statistical 
hypotheses, and test them accordingly. For 
example, independence may bereformulated 
probabilistically as Pr[(a,p,u) > (b,p,u) ] 
22 if and only if Pr[(a,q) > (6,q,2) ] 
2 2, where Pr can be defined in terms of the 
proportion of times (or subjects) for which 
the inequality holds. For pairs of treat- 
ment combinations that are statistically in- 
dependent, the above hypothesis can be 
tested using standard statistical tests such 
asa likelihood ratio test. If the treatment 
combinations cannot be treated as statistic- 
ally independent, then more complicated 
statistical methods have to be employed. 


Axiomatic versus Numerical Approach 


The physical measurement of fundamen- 
tal attributes such as mass, length, or time 
is based on certain qualitative laws (i.e., 
axioms) that lead to the construction of nu- 
merical scales on the basis of qualitative, or 
ordinal, observations. In contrast, most of 
the work (e.g., Luce, 1959; Stevens, 1957; 
Thurstone, 1927) on the measurement of 
psychological attributes such as loudness, 
intelligence, or subjective value has been 
based on some given numerical scales that 
are either produced by subjects (e.g., rating 
scales, magnitude estimates), or that de- 
Scribe, in physical units, some aspects of 
their behavior (e.g., choice probabilities, 
test scores). Similarly, most of the work on 
the Composition problem in psychology 
Sie Anderson, 1970; Goldberg, 1968; 
1 > 1964; Thurstone & Jones, 1957) 
las been based on the as; E . ^ 

assumption that the 
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numerical values of the dependent variable 
can be regarded as interval-scale measures 
of the relevant psychological attribute. 

Like the classical (extensive) theory of 
physical measurement, the theory of con- 
joint measurement is based on the analysis 
of qualitative laws. Unlike the physical 
theory, however, that is concerned with the 
laws that govern single attributes, the the- 
ory of conjoint measurement is concerned 
with the qualitative laws that govern the 
composition of several attributes. Inherent 
in the conjoint-measurement approach is 
the view that the importance of measure- 
ment (as distinct from other usages of nu- 
merical indices) lies in the qualitative laws 
on which it is based, and that the usefulness 
of numerical scales derives from the empiri- 
cal lawfulness that they reflect. Accord- 
ingly, the measurement is regarded as a con- 
sequence, or a by-product, of the validity of 
some qualitative laws, and major emphasis 
is placed on the study of these laws. This 
view has not been shared by all. Some 
(e.g., Adams, 1966) proposed that the im- 
portance of numerical scales in science far 
transcends the axioms that characterize 
them, while others (e.g., Anderson, 1970) 
argued that it is preferable to study 
position rules in terms of numerical v 
than in terms of qualitative properties, 

In discussing the merits of the numerical 
and the axiomatic approaches to the study 
of composition rules, we submit that 
should be regarded 


com- 
alues 


they 
as complementary rather 
than competing: The two methods of analy- 
Sis are likely to highlight different aspects 
of the data, and both analyses may prove 
useful in various contexts. With respect to 
the numerical approach, it should be noted 
that in some situations the numerical values 
of the dependent variable obey the proposed 
composition rule, to a satisfactory degree of 
approximation (e.g., Anderson, 1970; Gul- 
liksen, 1956; Hoffman, 1960; Rimoldi, 1956: 
Spence & Guilford, 1933). In such in- 
stances, the dependent variable can be 
regarded as an adequate Measure of the 
relevant psychological attribute, and the 
measurement problem is greatly simplified, 
because standard statistical procedures (e.g., 
linear regression, analysis of variance)can be 
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used not only to test the validity of the pro- 
posed composition rule, but also to obtain 
estimates of the contributions of the various 
treatment conditions. It should be added, 
however, that in order to demonstrate the 
validity of some proposed composition rule, 
the statistical test must have adequate 
power. Otherwise, it may fail to reject a 
composition rule because of noisy data. 
Unfortunately, this important factor is all 
too often neglected. 

If the numerical values of the dependent 
variable do not decompose according to the 
proposed composition rule, it may be either 
because the rule is invalid, or because the 
numerical scale is inadequate. At this 
point, an ordinal analysis is required to 
determine which of the possibilities obtains. 
The analysis may consist of testing the 
ordinal properties (i.e., axioms) that are 
necessary for the composition rule under 
study, along the lines described earlier in 
this paper. 

An alternative to testing ordinal proper- 
ties is to search for an appropriate mono- 
tonic transformation of the dependent vari- 
able that best satisfies the proposed rule, 
and then evaluate the correspondence be- 
tween the proposed rule and the rescaled 
data. The desired monotonic transforma- 
tion may be selected on the basis of an in- 
formal inspection of the data, or it can be 
constructed in a systematic fashion, such as 
by using an iterative computer program to 
solve a system of simultaneous inequalities 
dictated by the data. The availability of 
high-speed computer programs facilitates 
the applicability of this method, yet the 
problems of finding the most appropriate 
transformation and evaluating its goodness 
of fit do not have simple satisfactory solu- 
tions. These problems have been investi- 
gated particularly in the case of the (simple) 
additive rule, where some useful scaling 
methods and statistical testing procedures 
have been developed, for example, Box and 
Cox (1964) and Kruskal (1965). Likeother 
scaling techniques, these methods always 
provide numerical estimates of the scale 
values, regardless of whether the rule in 
question is valid or not. Hence, if one is 
primarily concerned with scaling, rather 
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than with diagnosis, the numerical methods 
are recommendable. 

On the other hand, if one is primarily 
interested in psychological theory, rather 
than in scaling, the axiomatic approach has 
much to offer. Itusually provides not only 
an explicit and precise formulation of the 
theory, but often a deeper and a more pene- 
trating analysis of the structure under 
study. The assumption that the orders of 
the effects of drive, habit strength, and in- 
centive are all jointly independent, for ex- 
ample, is psychologically more meaningful, 
and perhaps more plausible, than the as- 
sumption that response strength is a multi- 
plicative function of drive, habit strength, 
and incentive. Similarly, the qualitative 
assumptions of expected-utility theory (e.g, 
transitivity, the extended sure-thing princi- 
ple) provide a much more satisfactory 
formulation of decision making under un- 
certainty than the assertion that people 
behave as if they assign utilities and sub- 
jective probabilities to outcomes and events, 
respectively, multiply the utility of each 
outcome by its subjective probability of 
occurrence, and then choose the alternative 
for which the sum of these products is maxi- 
mal. Although the axiomatic and the nu- 
merical formulations of the multiplicative 
or the expected-utility rules are equivalent, 
the former suggest a more plausible (non- 
numerical) psychological mechanism. In 
this respect, it explicates the proposed nu- 
merical rule in terms of more fundamental, 
and often more transparent, principles. , 

The axiomatic analysis of the composition 
problem isolates the essential ordinal prop- 
erties which underly the various composi- 
tion rules. These properties, therefore, can 
be used to test the validity of the various 
rules in a measurement-free manner. Fur- 
thermore, they can be used to design critical 
tests for choosing among alternative rules, 
as demonstrated in the previous sections. 
"The major practical advantage of the direct 
test of axioms over the general evaluation of 
numerical correspondence is that, in gen- 
eral, it provides a more powerful diagnostic 
test between alternative composition rules. 
The multiplicative and the distributive 
rules, for example, are so close to each other, 


in the positive case, that with fallible data 
it is extremely difficult to distinguish be- 
tween them on the basis of any overall mea- 
sure of goodness of fit. A comparison of the 
two rules in terms of joint independence, on 
the other hand, is likely to provide a much 
sharper and more diagnostic test, because it 
focuses on a critical distinguishing property. 
Furthermore, the test of critical properties 
is typically a more efficient method than an 
overall test of goodness of fit, in order to 
discover which aspects of a theory, if any, 
are at fault. An evaluation of goodness of 
fit of a numerical model can indicate that 
the theory is violated, but unlike the test of 
specific axioms, it is not very likely to locate 
the source of the discrepancy. The failures 
of joint independence in the color space 
(Krantz, in preparation), or transitivity in 
choice behavior (Tversky, 1969), or of the 
extended sure-thing principle (MacCrim- 
mon, 1968) in decision making under un- 
certainty are all examples of significant 
empirical findings that were revealed by the 
direct test of critical axioms, and that were 
practically impossible to detect from global 
analyses of goodness of fit of the corre- 
sponding numerical models. 

In summary, the conjoint-measurement 
approach, exemplified in this paper, offers 
a new methodology for the study of com- 
position rules, with potential for deeper 
theoretical analysis and sharper empirical 
tests. 
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LETTERS TO 


PREPAREDNESS AND STRUCTURE 


Seligman (Seligman, M. E. P. On the gen- 
erality of the laws of learning. Psychological 
Review, 1970, 77, 406-418) presents convincing 
arguments for the position that the laws of 
learning are not general, but rather vary with 
the organism's preparedness for a particular 
association. Herein, he succeeds in his stated 
goal of bringing animal learning concepts into 
the mainstream of psychological interest. 
Associative preparedness is similar to the hu- 
man skill learning concept of s-r compatibility 
(Fitts, P. M. Skill learning. In A. W. Melton 
(Ed.), Categories of human learning. New 
York: Academic Press, 1964). But more can 
be done. 

Seligman ties preparedness to evolutionary 
history—inherited biological structure. Asso- 
ciability must depend on Structure, but per- 
sonal as well as evolutionary history is in- 
volved. Encounters between the organism and 
his environment determine the direction of de- 
velopment of inherited Structures (e.g., Held, 
R., & Hein, A. Movement-produced stimula- 
tion in the development of visually guided be- 


havior. Journal of Comparative and Physio- 
logical Psychology, 1963, 56, 872-876). In 
humans, 


the dependence of associability on 
learned structures is now recognized (Mandler, 
G. Verbal learning. In T. Newcombe (Ed.); 
New directions in psychology: III. New York: 
Holt, Rinehart & Winston, 1967). Underwood 


THE EDITOR 


and Schultz (Underwood, B. J, & Schultz, 
R. W. Meaningfulness and verbal learning. 
Philadelphia: Lippincott, 1960) point out that 
a major factor in paired-associate learning is 
the meaningfulness of the relation between 
stimulus and response. Meaningfulness is a 
learned relation. 

The challen 
ically test. 
Structure. 
thermore, tl 
be used as 


ge, then, is to provide an empir- 
able account of the development of 

Piaget's theory is an example. Fur- 
n€ nature of specific structures could 
the basis for predicting associability, 


ples: Can Garcia’s con- 
ditioned illness data and Thorndike’s Scratching 


data both depend on a psychobiological struc. 
ture involving sharp differentiation between 
events outside, on the periphery, and inside 
the organism? If so, what kinds of experiences 
common to most mammals contribute to the 
development of this structure ? 

The advantage of this appro 
provides an alternative between extremes: 
tremendously specific inherited preparednesses 
for all possible associations and the Oversimpli- 


fied one-kind-of-learning view which Seligman 
justly criticizes, 


ach is that it 
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SPECIFIC HUNGERS AND POISON AVOIDANCE AS 
ADAPTIVE SPECIALIZATIONS OF LEARNING: 


ES W. KALAT? 


PAUL ROZIN? axp JAN 


University of Pennsylvania 


Learning and memory are considered within an a aptive-evolutionary frame- 
work. This viewpoint is illustrated by an analysis of the role of learning in 
thiamine specific hunger. Consideration of the demands the environment 
makes on the rat and the contingencies it faces in the natural environment, 
appreciation of the importance of the novelty-familiarity dimension for 
these animals, and the realization of two new principles of learning, permit 
a learning explanation of most specific hungers. The two new principles, 
"belongingness" and "long-delay learning" specifically meet the peculiar 
demands of learning in the feeding system. In conjunction with the importance 
of the novelty dimension, they are discussed in an attempt to develop the laws 
of taste-aversion learning. It is argued that the laws or mechanism of learning 
are adapted to deal with particular types of problems and can be fully under- 
stood only in a naturalistic context. The “laws” of learning in the feeding 
system need not be the same as those in other systems; manifestation of a 
learning capacity in one area of behavior does not imply that it will be acces- 
sible in other arcas. This notion leads to speculations concerning the evolution 
and development of learning abilities and cognitive function. Full under- 
standing of learning and memory involves explanation of their diversity as 
well as the extraction of common general principles. 


developed in situations and species where 
other solutions to the problems at hand 
are less adaptive. Furthermore, when 
learning or memory capacities are brought 
to bear on a particular type of problem or 
situation, it stands to reason that these 
capacities should be shaped by or adapted 


The application of the basic principles 
of evolution and adaptation to learning 
and memory offers a hopeful means of 
conceptualizing and ordering the increas- 
ingly diverse data of these fields. Learning 
and memory, being the result of natura 


selection, should be expected to be best 


1 This manuscript is an expanded version of an 
invited address by the first author to APA Division 
6 in 1969. The preparation of this article and Eas 
of the research reported in it were supported by 
National Science Boinidation Grant GB 8013. We 
would like to thank John Garcia, Henry and Lila 
Gleitman, Elisabeth Rozin, Martin Seligman, W.J. 
Smith, and Richard Solomon for their helpful com- 
n 5 riticisms. 

poe for reprints should be sent to Paul 
Rozin, Department of Psychology, bow ^ 
Pennsylvania, Philadelphia, Beye AU 

?Supported by National Institute of fealth 

Now at Duke University. 


training grant. 
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to the situation. 

We propose to treat learning 
memory as any other biological 
acteristic, subject to natural selection and 
therefore adapted to handle specific types 
of problems.  Insofar as an important 
environmental problem (e.g., obtaining 
adequate foods) presents unique demands 
or contingencies, we would expect to find 
appropriate modifications of learning and 
memory abilities, closely articulated with 
one another and with the natura] behavior 


and 
char- 
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T of the organism.  Biologically speaking, 


‘there is no reason to assume that there 
should exist an extensive set of generally 
applicable laws of learning, independent o 
the situation in which they are manifested. 
This is not to say that there might not be 
some general laws, at a minimum resulting 
from basic constraints and features of the 
operation of the nervous system, we 
perhaps reflecting general principles o 
causality in the physical world. However, 
if we look at learning within an adaptive- 
evolutionary framework, we should seek 
not only to uncover some of the greatest 
common denominators among the be- 
haviors we study, but also to explore the 
plasticity of the mechanisms themselves, 
as they are shaped through selection to 
deal with particular types of problems. 
For many years, the leading ethologists 
(e.g., Lorenz, 1965; Tinbergen, 1951) have 
espoused a position consistent with this. 
They have emphasized the importance of 
considering learning within a naturalistic 
context; learning is viewed as being 
genetically programmed to occur at specific 
points in an ongoing behavior sequence. 
The thesis presented here is in harmony 
with the ethologists’ position, but em- 
phasizes differences in learning mechanisms 
themselves, as a function of the situations 
in which learning occurs. 

In the face of recent evidence, 
most significantly from the work of Garcia 
and his colleagues (Garcia, Ervin, & 
Koelling, 1966; Garcia & Koelling, 1966), 
challenging two important generalities in 
learning, a framework for ordering the 
suggested diversity of laws of learning is 
desirable. We propose such a framework 
in terms of the notion of adaptive specializa- 
tions. Consideration of learning from the 
adaptive point of view offers two ad- 
vantages: (a) it provides a heuristic, for 
ordering and predicting the types of learn- 
Ing one will see in different situations, and 
(b) ìt provides a type of explanation of be- 
‘avior, in that one aspect of understanding 
aig can be considered to be explica- 

of its adaptive value. 

Adapti 


useful į ve considerations have proven 
»eiu In Many are 


coming 


as of biology and are a 
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significant part of our understanding of 
sensory psychology and physiology. Spe- 
cializations in sensory function, as shown 
in the classic works of Lettvin, Maturana, 
McCulloch, and Pitts (1959) in the visual 
system of the frog, Capranica (1966) on 
the remarkable tuning of the frog auditory 
system to mating calls, and Roeder (1963) 
on detection of bat sonar by the moth 
auditory system, can be fully understood 
only in the context of adaptations to 
natural problems. The incredible diversity 
of the visual system (e. g., variations in the 
proportion of rods and cones in the retina 
as a function of species and ecology) can 
be ordered and comprehended in a phy- 
logenetic adaptive framework, as shown 
in the classical work of Walls (1942). We 
would like to suggest that there are similar 
possibilities in the area of learning and 
memory. 

In the major portion of this paper, we 
shall discuss specific hungers and poisoning 
in rats. We shall argue that some basic 
features of learning and memory as 
to food selection in the rat are 
different from features char. 


rat's learning in more traditional laboratory 
situations, that these differences make 
sense in terms of evolutionary adaptation, 
and that an understanding of the role of 
learning and memory 


applied 
rikingly 
acterizing the 


in food selection in- 
volves an elucidation of specifically adapted 
learning mechanisms and an in 
of these with genetically 
havior patterns. i 


integration 
determined be- 


In the remaining portion of the paper, we 
shall discuss whether the feeding system 
is a unique case or representative example, 
relate our position to somewhat similar 
theoretical positions, including ethology 
and the recent papers of Garcia (Garcia & 
Ervin, 1968; Garcia, McGowan, & Green, 
1971), Revusky (Revusky, in press; Re- 
vusky & Garcia, 1970), Seligman (1970), 
Lockard (1971), and Shettleworth (1971), 
and discuss the implications of our position 
in the areas of comparative psychology of 
learning and human function. 


ADAPTIVE SPECIALIZATIONS OF LEARNING 


SPECIFIC HUNGERS 


Basic Phenomenon and Problems with a 
Learning Inter pretation 


The phenomena of specific hungers were 
beautifully demonstrated in the classical 
work of Richter (1943). The question 
raised by this work was, How is the animal 
(in particular, the rat) able, when deficient 
in a particular nutrient, to select those 
foods containing it? In the case of sodium, 
it is quite clear that sodium deficiency re- 
leases an innate preference for substances 
containing sodium (Nachman 1962; Rich- 
ter, 1956; Stricker & Wilson, 1970). How- 
ever, it is hard to believe that the rat comes 
equipped with prewired recognition systems 
for each of the many substances for which 
it can show a specific hunger. The al- 
ternative view holds that rats can learn 
about what foods "make them better" 
when they are ill and in this manner come 
to select the proper nutrients (Harris, 
Clay, Hargreaves, & Ward, 1933; Scott 
& Quint, 1946). Rats would, in their 
lifetime, learn. preferences for the tastes of 
only those foods associated with recovery 
from the deficiencies they happen to have 
experienced. This notion has the distinct 
advantage of simplicity in that it accounts 
for all of the specific hungers (except sodi- 
um) with one basic mechanism. 

There was, in fact, some evidence for a 
learning interpretation of specific hungers. 
Scott and Verney (1947) offered a dis- 
tinctively flavored vitamin supplemented 
food and an unflavored deficient food to 
deficient rats. After a preference developed 
for flavored, enriched food, the flavor was 
switched to the deficient food. By and 
large, rats now preferred the flavored defi- 
cient food, suggesting a learning mechanism. 

The problem with a learning interpreta- 
tion, so far as psychologists were concerned, 
was the long delay of reinforcement. The 
recovery (reward) effects produced by an 
ingested nutrient would not occur until 
many minutes or hours after ingestion. 
This poses serious problems for a learning 
within the context of traditional 
If specific hungers were 
then some new type ol 


mechanism 
learning theory. 
in fact learned, 
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learning had to be involved. This served 
as the impetus for our investigation of 
specific hungers. 

The first demonstration of vitamin B 
hunger was that of Harris et al. (1933), 
who found that given a choice of three 
foods, one supplemented with B vitamins, 
B vitamin deficient rats would quickly 
come to choose the correct food. For 
convenience, Rozin, Wells, and Mayer 
(1964) decided to study a more simplified 
situation. One vitamin, thiamine, was 
selected, since a clear specific hunger for 
thiamine had been reported (Richter, 
Holt, & Barelare, 1937; Scott & Quint, 
1946). The most pronounced symptoms 
of thiamine deficiency in the rat are weight 
loss and anorexia. (More details about this 
and other aspects of thiamine hunger are 
available in a review by Rozin, 1967a). 
Following injection or ingestion of thiamine, 
deficient rats show clear signs of reversal 
of symptoms within a few hours, at most. 
The basic design consisted of raising wean- 
ling rats on a thiamine deficient diet for 
21 days, at which point they showed clear 
signs of deficiency. At this point they 
were offered a choice between this deficient 
diet and the same diet supplemented with 
thiamine. 

Thiamine deficient animals strongly pre- 
ferred the thiamine enriched choice, while 
control animals who were not deficient, but 
otherwise treated identically, did not. A 
few rats with a choice between deficient 
and highly enriched diet showed no clear 
preference during the first few choice days, 
but in the meantime ingested great amounts 
of thiamine and showed marked recovery. 
Subsequently, these rats developed a clear 
preference for the thiamine rich diet. It 
seemed highly unlikely that a rat would 
show a preference for what it would have 
needed a few days ago without showing 
that same preference. when the thiamine 
presumably had reinforcing value. This 
anomalous observation was confirmed in an 
experiment in which rats were made 
deficient in thiamine and then recovered 
from deficiency by injection of high 
amounts of thiamine while consuming the 
deficient diet. After a period of recovery, 


" 
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" $ were offered a choice of thiamine 
Be, and deficient diets. Preferences 
the thiamine enriched diets in rats that 
d been deficient emerged at all the re- 
very intervals studied (Rozin, 1965). . 
This result raises a second problem fora 
arning interpretation of thiamine hunger: 
ow could rats develop a learned. prefer- 
ice for something that at the time the 
reference appeared, had no known rein- 
xcing effects? If the preference developed 
ecause of the initial reinforcing effects 
{í the thiamine, why did it not appear 
intil much later? We shall call this 
yroblem Preference after recovery. 
= Careful consideration of the specific 
hungers situation and contingencies gave 
rise to additional problems that could not 
be easily solved within the traditional learn- 
ing framework. Given that a rat might 
learn with the delays of reinforcement that 
seem to hold here, how would the recovery 
be specifically associated with a particular 
food? If the animal had eaten from a 
number of the choices available (or both 
choices in the two-choice situation), how 
could one of these choices, presumably the 
one containing thiamine, show a specific 
increase in preference? In other words, 
if an ingestion of two or more foods is 
followed by recovery, how does one of these 
foods acquire positive properties? We 
shall call this third problem, Which Jood? 
A logical extension of this notion raises 
the question of how food or the cating of 
food becomes associated with recovery 
when many other potential candidates for 
association exist in the environment. After 
all, following ingestion of foods including 
the vitamin enriched food, and before 
recovery can ensue, the rat performs many 
acts, such as chewing, grooming, exploring, 
and sleeping, and receives many stimuli. 
By what principle does he elect to respond 
to food stimuli instead of other stimuli? 
We call this fourth problem, Why food? 
We have, then, four problems in the 
ation of traditional learning principles 
© the explanation of specific hungers: 


l. Delay, 


reference after recovery. 
- Which food? | 
4. Why food? 


Reinterpretation of Specific Hungers 


The first step in the resolution of the 
problem as formulated came with the 
appreciation of the importance of novelty 
in the determination of food preferences 
in deficient rats (Rodgers & Rozin, 1966). 
Deficient rats show an immediate and 
almost complete preference for new foods, 
even when the new food is thiamine 
deficient and the old food has a thiamine 
supplement. In this case, the new-food 
preference reverses after a few days. The 
results suggested that the “specific hunger” 
for thiamine may be simply a reflection of 
the thiamine deficient rat’s preference for 
new foods: If the new food contains 
thiamine, a learned preference could de- 
velop (mechanism still unknown). 

Rodgers (1967a) succeeded in demon- 
strating quite conclusively that there is no 
specificity to thiamine specific i 
Reasoning that the novelty respon 
overwhelm an existing tendency 
the vitamin, Rodgers offered defii 
the choice of a deficient novel diet or the 
same novel diet supplemented with thi- 
amine. The usual strong specific hunger 
did not appear. Furthermore, when the 
choice was between two different novel 
diets, one enriched, preferences for the 
enriched source did not develop rapidly. 
Finally, separate groups of thiamine defi- 
cient rats and pyridoxine deficient rats 
were offered their basal diet in two forms: 
one supplemented with thiamine, the other 


with pyridoxine. 
cificity, 


unger. 


If there were any spe- 
one would expect each group to 
show a preference for the vitamin that 
would produce recovery. There were no 
significant differences between the groups. 

There seemed to be two types of ex- 
planations for the novelty effect. One is 
that the deficiency experience triggers an 
innately programmed "neophilia."' 
attractive hypothesis is that the 
effect comes about through le 
paradigms described all pit 
against the familiar food t 
associated with deficiency, 
preference could be a re 
aversion to the old food. 
loquially, we could ask whe 
for new foods 


A more 
novelty 
arning. The 
the new food 
hat has been 
The new-food 
flection of an 
Put more col- 
ther preferences 
appear because the rat 
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“loves the new stuff or hates the old 
stuff." The issue can be clarified by 
watching the behavior of deficient animals 
toward different foods (Rozin, 1967b). 
Rats housed in relatively large individual 
cages were observed during the first 15 
minutes of an 8-hour food presentation 
each day. "Their responses were classified 
into a number of categories, such as groom- 
ing, sleeping, eating, and chewing. The 
experimental rats were placed on a thi- 
amine deficient diet. As deficiency de- 
veloped, less time was spent eating during 
the first 15 minutes, and two striking be- 
haviors, quite rare in normals, appeared. 
One was spillage of the food: the rat would 
approach the food cup, sniff at the food, 
and then paw at the food in a scooping 
motion, spilling it out of the cup, so that 
it fell through the wire mesh floor. (This 
spillage phenomenon in deficient rats was 
actually quite familiar, since we have 
always had some difficulty measuring the 
food intake of deficient rats on this ac- 
count.) The other new behavior was what 
we call “redirected feeding.” Following 
this initial investigation of the food cup, 
rats would occasionally move over to the 
wooden barrier separating the nest area 
from the rest of the cage and begin to chew 
it vigorously. They would also, occasion- 
ally, chew on the cage wires. These two 
behaviors suggested an aversion to the 
familiar diet. The redirected feeding sug- 
gests conflict between desire to eat and 
the aversiveness of the food offered. 
Spillage is often seen in normal rats with 
highly unpalatable foods, such as quinine 
adulterated diets. . . 
When offered the old deficient food ina 
new container (metal instead of glass) in 
a new location, these deficient rats showed 
little potentiation of eating and continued 
to show the behavior described above. 
Apparently the vessel and its loc € wre 
not controlling the aversion. owever, 
when a completely new deficient food was 
offered in the familiar vessel and location, 
uninterrupted avid eating (e a 
gesting that the aversion was specie hy 
the food. The deficient rats were sub- 
recover on a new, 
sequently allowed to Peover S ques 
vitamin enriched diet. Following one week 
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of recovery, and after a 16-hour period of 
food deprivation, they were presented again 
with the familiar deficient food for the 
first time since the onset of recovery. These 
food deprived rats, showing no signs of 
deficiency at this time, responded to the 
familiar deficient food as they had before, 
with minimal ingestion and occasional 
spillage and redirected feeding. In this 
case, a "normal" hungry rat prefers staying 
hungry to eating its original diet which 
provides, in fact, perfectly adequate nutri- 
tion for him at the time, and is normally 
quite palatable. ^ Preference of hunger 
(eating nothing) to ingestion in hungry 
rats and the similarity in the rat's be- 
havior toward deficient and highly un- 
palatable (quinine adulterated) diets sug- 
gests strongly that we are dealing with an 
aversion to the familiar food. 

The learned aversion interpretation 
places specific hungers in a new perspec- 
tive. We can consider the deficient diet 
as a CS and the nausea or other ill effects 
produced by its ingestion as a UCS. 
Presumably, the classically conditioned 
"ill effects" lead to avoidance of the 
familiar food. The mechanism suggested 
for the specific hunger phenomenon, as 
seen in the standard two-diet choice, would 
then be that the rat learns an aversion to 
the familiar deficient food. Before the 
time of choice, he has already done a most 
significant part of his learning: He knows 
what not to eat. The initial preference 
for the new food follows. Its maintenance 
when the new food is enriched could be 
accounted for as an additional learned 
preference for the new food or as a failure 
to develop an aversion to it (see section 
on learned preference). " 3 

To the extent that specific aversions 
play a key role in specific hungers, there 
is an obvious parallel between specific 
hungers and poisoning. Both involve 
learned aversions; vitamin deficient diet 
is a slowly acting poison. The aversion 
experiment thus suggests that these two 
sets of phenomena are closely related, 
Furthermore, the same basic Problems 
raised here (Delay, Which food? Why 
food ?) arise with respect. to poisoning. 
Since poisons are designed and synthesized 
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by man, it is unreasonable to hold that 
rats show innate aversions to them. In 
comparing the literature on poisoning (see 
Barnett, 1963; Richter, 1953; RZoska, 
1953) to that on specific hungers, there is 
one apparent contrast. We have reported 
an increased preference for new foods in 
white rats, while the poisoning literature 
which focuses on wild rats strongly in- 
dicates the opposite: an exaggerated neo- 
phobia following poisoning. That is, wild 
rats, who show a much greater base-line 
tendency to avoid new objects or events 
than do white rats (e.g., see Galef, 1970), 
show a further exaggeration of this tend- 
ency, often to an extreme (Richter, 1953) 
following poisoning experiences. This dis- 
parity can be accounted for as a pro- 
cedural difference, since the new-familiar 
choices we offered to our domestic rats 
were different from those usually offered 
poisoned wild rats. In particular, in the 
novelty experiments, which were done 
before we realized that specific aversions 
were involved, the rat was offered a choice 
between a familiar food associated with 
deficiency and a new food. Even if the 
white rat were somewhat neophobic, this 
might not appear since the alternative 
choice was a strongly aversive familiar diet. 
In order to provide a meaningful com- 
parison between poisoning and specific 
hungers, both sets of phenomena would 
have to be demonstrated under the s; 
sets of conditions and in the same strain 
of rats. In an experiment meeting these 
criteria and comparing half-wild and do- 
mestic rats, a paradigm was employed that 
allowed fuller expression of the rat's neo- 
phobic or neophilic tendencies. Rats were 
raised on Diet A, prior to induction of 
aversive consequences. Deficiency or poi- 
soning (or nothing in the case of controls) 
occurred in the presence of Diet B. In 
the final test, rats were offered these two 
diets and a completely new one, Diet C, 
Therefore, rats were faced with a choice 
among a familiar safe diet (A), a familiar 
aversive diet (B), and a 
diet (C) (Rozin, 1968). 
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all x Single important result is that 
ats suffering poisoning 
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completely new 


or deficiency 


showed an increased preference for the 
familiar safe food, that is, a neophobia. 
Half-wild rats showed a stronger neophobia 
following the aversive experience, but 
half-wild controls also showed a higher 
base-line neophobia. All experimental rats 
almost completely avoided the familiar 
aversive food (B), but ate some of the 
completely new food. There were no 
major differences between the specific 
hunger and poisoning groups. Therefore, 
it appears that we can consider specific 
hungers as a parallel to poisoning. Again, 
the behavior makes sense in an adaptive 
framework: following an unpleasant food- 
related experience, the rat tends to return 
to a known, safe food. 


Resolution of the Inconsistencies with a 
Learning Interpretation 


The specific aversion explanation of 
specific hungers, the realization of the 
importance of the familiarity-novelty di- 
mension, and the appearance of two 
articles of major significance by Garcia 
and his colleagues enable us to resolve 
the four basic problems with a learning 
interpretation. 

Preference. after recovery. 
accounted for as a retained aversion to the 
familiar food. Rats made deficient in 
thiamine and then recovered by injection 
lor 12 hours or 3 days (while continuing 
to eat the same deficient diet) show a 
strong initial novel-food preference when 
offered à novel-familiar food choice, with 
thiamine absent from both choices (Rozin & 
Rodgers, 1967). In these experiments, 
recovery took place in the presence of the 
familiar food. The existence of a novelty 
effect Suggests that the aversion had not 
fully extinguished by the time of testing. 
It is significant in this regard th 
of our experiments (some 
the recovery e 
tion f 


This can be 


at in most 
unpublished) 
ffect showed some diminu- 
as time of recovery increased. The 
recovery effect should be maximal when 
recovery is associated with a new food, so 
that the old diet aversion does not ex- 
tinguish. Strong "recovery" effects are 


in fact seen under such circumstances in 


"S 


ERN NN OUHU)EIIDUI"HI OOV 


ADAPTIVE SPECIALIZATIONS OF LEARNING 465 


the aversion experiment, with a single 
choice (Rozin, 1967b) and in typical 
two-choice experiments with long recovery 
periods (Rozin, unpublished data). 

Which food? |n the "standard' two- 
choice situation, this problem is easy to 
deal with; the rat learns to avoid the 
deficient food when it is the only diet 
available. There is no Which food? 
problem here. Because of this learned 
aversion, rats show an immediate, vir- 
tually complete preference for the novel 
food, thus making it possible for them to 
learn about its consequences. 

The testing situations described up to 
this point, with the partial exception of 
the testing environment in the aversion 
experiment, have been very limited, well 
defined, and unnatural. Rats normally 
live in a much larger area than was per- 
mitted to them in these studied, have an 
elaborate social life, and seldom face 
simple binary food choices. It is not un- 
reasonable to assume that one or only a 
few foods might be available to the rat 
during the deficiency period in nature, 
since with a wide variety of foods available, 
a deficiency would be quite unlikely. In 
their classic work on B vitamin hungers, 
Harris et al. (1933) offered deficient rats 
a choice among a large variety of foods, 
with only one containing B vitamins in 
significant amounts. By and large, they 
found that with large numbers of choices 
(6-10 foods), rats were unable to select 
the vitamin enriched source, and they 
found it necessary to "educate" the rats 
by offering only the enriched food for a 
period of days in order to establish a main- 
tained preference for this food. Examina- 
tion of the day-to-day intake of these 
animals before the education period sug- 
gests a distinctly nonrandom food selection 
pattern: food intake seemed to be restricted 
to only a few of the many choices on any 
particular day. This seemed to be an 
adaptive way of unconfounding the situa- 
tion and suggested an analysis of the meal 
patterns of deficient rats faced with the 
choice of a number of new foods (Rozin, 
1969a). A thiamine supplement was placed 


in one of four diet choices for each rat. 


Feeding was restricted to eight hours a day, 
and the food intake from each cup was 
measured at hourly intervals. Four of the 
10 deficient rats studied developed clear 
preferences for the enriched choice within 
a few days, and two other rats developed 
strong preferences for two of the four 
choices, where one of the preferred choices 
was enriched. Analysis of the meals in- 
dicates a characteristic pattern, both on a 
daily and hourly basis. Means, except for 
an initial daily sampling of many or all of 
the choices, tended to be restricted to one 
food. The rat’s feeding pattern maximizes 
the possibility of associating each diet 
with its appropriate consequences, since 
meals tend to be isolated in time and to 
consist of a single food. The emergence of 
a strong preference for the enriched food is, 
in each case, preceded by a clearly defined 
meal of that food. Furthermore, no rat 
failed to develop a clear enriched food pref- 
erence if it ate an isolated meal of at least 
1 gram from the enriched food. The rats 
that failed to show adaptive preferences 
in the initial part of the experiment failed 
to sample significant amounts of the en- 
riched food. Similar, though less well 
defined, sampling patterns are seen in 
normal rats. Normals mix meals more, 
but this may be a direct consequence of the 
fact that their meals are larger. The 
anorexia of deficiency may, in and of itself, 
exaggerate in an adaptive way the normal 
feeding pattern of the rat. It is clear that 
we have here another part of the answer 
to the Which food? problem. : 
Finally, social factors have become im- 
plicated in food selection in some sig- 
nificant recent experiments. Galef _and 
Clark (in press) studied responses to poison- 
ing in colonies of wild rats observed in the 
laboratory under seminaturalistic condi- 
tions. They offered a group of wild rats 
two foods, “one of which was poisoned. 
After a short period of time, all rats 
avoided this food. The poison was re. 
moved, but the rats still completely 
avoided the base. A litter was born, and 
the behavior of the young toward these new 
foods was observed. The parents and 
young were fed for three hours a day and 
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were constantly under observation during 
this period. During a two-week period 
(Days 14-28`of life in the pups), the pups 
came out to eat, but ate only the safe 
food (possibly because this was the only 
food being eaten by the parents). When 
the young were fully weaned and placed 
in a new cage, separated from their parents, 
they continued for a six-day period to eat 
only the safe diet and ignore the formerly 
poisoned diet. Further experiments (Galef 
1971; Galef & Clark, in press) suggest a 
straightforward explanation for this effect. 
Rat pups tend to approach areas where 
adults are located and begin feeding there. 
In this way, they become familiar with the 
foods eaten by the adults and avoid al- 
ternative diets as a result of their neo- 
phobia. More recent work (Galef & Clark, 
1971; Galef & Henderson, in press) sug- 
gests an additional mechanism: chemical 
cues from mother's milk seem to function 
to familiarize the young with certain char- 
acteristics of the mother's food and produce 
initial postweaning preferences. 

Why food? Rats in the aversion experi- 
ment (Rozin, 1967b) did not show an 
aversion to the container or the location of 
the deficient food. While introduction of 
à new food dramatically increased inges- 
tion, change to a new location and con- 
tainer did not. Apparently what the 
animal learned was specific to the food. 

In an independent and much more com- 
peling experiment, Garcia and Koelling 
(1966) provided evidence that there was a 
specific tendency for taste "CSs" to be 
associated with certain types of visceral 
"UCSs," while exteroceptive CSs such as 
light and sound would be preferentially 
associated with exteroceptive UCSs, such 
as shock. They paired light, sound, and 
taste simultaneously with either electric 
shock or poisoning in different groups of 
animals. The shocked animals developed 
an avoidance of the light and sound, but 
not of the taste, The poisoned animals, 
Subjected to the same light-sound-taste 


pairing, avoided the taste and not the 
Sound or light. 
Delay. Ty ; : , 
: The re 
Bungers interpretation of specific 


» 10 Itself, does not bring us much 
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closer to solving this important problem. 
It does not appear to be possible to rein- 
terpret what appeared to be long-delay 
learning in terms of short delays. However, 
a critical experiment by Garcia et al. (1966) 
demonstrates that long-delay learning can 
occur in this system. They induced an 
aversion to saccharin in rats by injections 
of apomorphine, a drug that presumably 
produces gastrointestinal upset. Aversions 
were produced when the interval between 
termination of drinking and injection of 
the drug was one-half hour or more. Only 
a few trials were necessary to establish a 
clear aversion. 


A New Problem—Are There Learned Pref- 
erences as well as Learned Aversions? 


Before we leave the problem of specific 
hungers, there is one new issue to resolve. 
Much of the specific hunger phenomenon 
can be explained in terms of aversion; Is 
there a positive side also? 

Up to this point we have provided 
evidence for three categories of foods based 
on the animal's experience (Rozin, 1968). 
These are unexperienced or novel, familiar- 
safe, and familiar-aversive. "The question 
is, Is there a fourth category, familiar- 
positive? That is, is the experiencing of 
positive consequences any different from 
the experiencing of neutral consequences 
when these experiences are contingent upon 
ingestion of a particular substance? 

Much of the evidence which appears to 
Monstrate positive preferences can be 
reinterpreted in terms of learned aversions. 
For example, Harris et al. (1933) found 
that when only one of a large number of 
diets contained adequate thiamine, most 
deficient rats were not successful in select- 
Ing the correct diet. However, if the rats 
were “educated” by being offered only 
the adequate food for several days, they 
showed a preference for this food w f 
larger number of diets were 
This experiment does not d 
tween learned-preference 
sion interpretations, 

ing the nonthiamine d 
of learned aversions 
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hen the 
again offered. 
istinguish be- 
and learned-aver- 
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iets by a combination 
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is, after “education” the rats have learned 
that the thiamine-containing diet is "safe," 
but it is not certain whether they have 
learned that it has any special “recovery 
from deficiency” properties which dis- 
tinguish it from other safe diets, since we 
know that deficient rats prefer “old safe” 
foods to either old aversive or novel foods 
(Rozin, 1968). Similarly, in the sampling 
experiment (Rozin, 1969a), the fact that 
the rat eventually prefers the only en- 
riched diet in a four-choice situation 
could mean that he had developed aver- 
sions to the other choices. 

There are some recent experiments 
which present more serious challenges to a 
pure aversion model. Garcia, Ervin, 
Yorke, and Koelling (1967) repeatedly 
made rats thiamine deficient and produced 
recovery by thiamine injection. Just prior 
to thiamine injection, rats drank saccharin 
solution. At all other times, water was 
the only fluid available. When thiamine 
deficient, the rats showed an increase of 
saccharin intake over trials, both with 
respect to their own water intake and to 
the saccharin intake of similarly treated 
animals where thiamine injection was not 
contingent on saccharin ingestion. Camp- 
bell (1969) has also demonstrated that 
thiamine deficient rats show an increase in 
their absolute intake of a sucrose solution 
which has been associated with recovery 
from deficiency. Zahorik and Maier (1969) 
used Garcia et al.'s (1967) procedure but 
with a modified test and found that rats 
prefer the taste associated with recovery 
to both the taste associated with deficiency 
and a novel taste. Furthermore, this 
preference was apparent in both deficient 
and recovered rats. . . 

These experiments, which certainly pro- 
vide evidence for learned preferences, can 
nonetheless be explained in terms of the 
three basic categories of food. A more 
decisive experiment. would have to show 
that rats prefer a “recovery _ solution to 
an old "safe" solution, that is, one they 
drank without aversive consequences at 
a time when then were not vitamin deficient. 

Revusky (1967; Revusky & Garcia, 
1970) performed a series of experiments to 
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demonstrate that food with clear positive 
consequences would be preferred to foods 
with relatively neutral consequences. In 
his simple design, rats were fed one nutrient 
solution while hungry and a different one 
when satiated. After five days of this 
training, a significant preference developed 
in a two-bottle test for the solution drunk 
while deprived. This result is interpreted 
in terms of the greater (delayed) rein- 
forcing effect of the solution drunk during 
deprivation. Since both solutions were 
equally familiar and the rats drank the 
solution offered in satiated conditions 
voluntarily (so that an acquired aversion 
would be unlikely), this experiment meets 
the requirements for demonstrating a 
learned preference. The effect is clear, 
though not large by comparison to the 
aversion data (see Revusky & Garcia, 
1970, for additional data). 

It is noteworthy that the positive prefer- 
ence effects reported have been rather 
small by comparison with learned aversions, 
and difficult to obtain (Kalat & Rozin, 
unpublished; Revusky & Garcia, 1970). 
We cannot satisfactorily explain this asym- 
metry. Possibly the rat is better prepared 
to learn aversions because rapid learning 
there has particular survival value; that is, 
mistakes are very costly. This remains, for 
the moment, an intriguing problem, with 
possibly great implications for the regula- 
tion of food intake. 


Other Specific Hungers 


We have offered a theory of thiamine 
specific hunger. We believe that it holds 
for other learned specific hungers as well. 
Novel-food preferences, which imply a 
learned aversion mechanism, have been 
demonstrated in calcium, sodium, and 
magnesium deficient rats (Rodgers, 19672) 
and in pyridoxine and riboflavin deficient 
rats (Rozin & Rodgers, 1967). The pre- 
ference after recovery phenomenon appears 
in identical form in thiamine, riboflavin, 
and pyridoxine deficiency (Rozin & Rod- 
gers, 1967). 

It seems reasonable to assume 
anorexia characteristic of most 
deficiencies reflects, at least ij 
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learned aversion to the deficient food. This 
is dramatically clear in the case of thiamine 
deficiency, where the anorexia symptom 
disappears precipitously when a new diet 
is offered. On the other hand, there is 
| relatively little anorexia in vitamin D or 
| vitamin A deficiency, and in both cases 
1 there has not been a clear demonstration of 
a specific hunger. 
| There is an impressive body of research 
1 on amino acid specific hungers and re- 
| sponses to amino acid imbalance (Harper, 
1967). These ''deficiencies" are char- 
acterized by anorexia and seem to fit into 
the scheme we have described. Recently, 
Rogers and Harper (1970) presented evi- 
dence for a positive preference for a solu- 
tion that corrects an amino acid imbalance. 
We cannot complete this reconsideration 
of specific hungers without mentioning one 
particularly serious shortcoming of all the 
mechanisms we have discussed. Although, 
in principle, they can account for most of 
the individual vitamin or mineral specific 
hungers, it is not clear how the classic 
"cafeteria" self-selection of Richter (1943) 
can be accounted for. Rats appear to 
self-select an extremely well balanced diet. 
Unless we assume that they develop 
incipient deficiencies of a variety of nu- 
f trients and learn aversions and prefer- 
ences on the basis of these minimal symp- 
tons and their abatement, we have no ex- 
m planation of this remarkable phenomenon. 
At this time, long-delay learning mech- 
anisms appear inadequate to the task, 
since we cannot identify obvious candidates 
for unconditioned stimuli.* 


‘Another remaining problem concerns the diffi- 
| culty in demonstrating specific hungers with the 
| nutrients in an aqueous solution (Appledorf & 
Tannenbaum, 1967; Rozin, Wells, & Mayer, 1964). 
This contrasts with Richter, Holt, and Barelare's 
(1937) dramatic demonstration of a specific hunger 
for thiamine in solution. We are inclined to believe 
| that the difficulty frequently encountered with 
Wu Eon solutions derives from pure water being one 
"n a choices available. Water's great familiarity 
| Roclline tee it from becoming aversive (Garcia & 
| Schm Rd Zahorik, & Albin, 1971; 
n Various choices zolu alatability differences in the 
| 


| Rogers & Harper, S" might also be involved 


Wuy Is Sopium HUNGER INNATE? 


An alternative to the mechanism we have 
discussed is an almost completely genetic- 
ally preprogrammed specific hunger. So- 
dium hunger seems to be such a case. 
Rats sodium deficient for the first time 
show a strong preference for sodium im- 
mediately upon tasting it (Nachman, 
1962; Richter, 1936, 1956; Stricker & 
Wilson, 1970). They also show strong 
preferences for similar-tasting salts which 
will not actually alleviate their deficiency 
(Nachman, 1962), and they will work 
at a high rate in extinction at a lever which 
previously produced a salt solution which 
they drank while sodium replete (Krieck- 
haus & Wolf, 1968). 

We do not know why sodium differs 
from other specific hungers, but at least 
three explanations are available. First, 
the great importance of sodium in body 
fluid balance, its common, scarcity in the 
environment (witness salt licks and salt 
wars), and the large amount needed by the 
individual organism might place particular 
selection pressure in favor of a surefire 
sodium hunger. The various physiological 
adaptions directed toward regulation of 
body sodium are impressive. Second, it is 
conceivable, as Rodgers (1967b) has sug- 
gested, that sodium ingestion by sodium 
deficient organisms may have initial nega- 
tive effects? associated with large-scale 
electrolyte shifts. There is not much 
evidence on this point. Sodium deficient 
animals do develop an aversion to sodium 
deficient diet (Rodgers, 19672). However, 
injections of sodium into sodium deficient 
animals have not successfully produced 
preferences for foods ingested just prior 
to Injection (Rodgers, 1967b). These 
possible initial negative events may pre- 
vent the operation of the learning mech- 


$ Scott, Verney, and Morissey (1950) noted that 
magnesium deficient rats selected magnesium defi- 
cient food in preference to magnesium enriched food. 
Rodgers (1967a) confirmed this “inverse” specific 
hunger and suggested that the initial ingestion of 
magnesium by magnesium deficient rats produces 
aversive consequences. Here, those consequences 
might be considered as “coming down” from the 


hyperirritability or “high” characteristic of mag- 
nesium deficiency. 
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anisms normally implicated in specific 
hungers. Third, sodium as a stimulus is 
well defined by the taste system. It would 
be easy for a genetic program to tie into 
the proper class of stimuli. (Conversely, 
one might argue that the fact that sodium 
recognition seems to be so fundamental to 
the taste system suggests again the great 
importance of sodium as a directive force 
in evolution.) Although a unique learning 
ability might have evolved to meet this 
particular problem, the actual genetically 
programmed solution seems preferable. 
However, important biological functions are 
often overdetermined ; this may be one ex- 
ample. It has been shown that normal 
rats can learn the location of sodium 
sources in their environment and go to them 
promptly when a sodium need is induced 
(Krieckhaus & Wolf, 1968). 

But now we must turn the question 
around and ask, Why are all specific 
hungers not innate? One basic learning 
mechanism can provide an adaptive solu- 
tion for almost all specific hungers (and 
also poisoning). Instead of programming 
specific innate recognitions of a variety of 
nutrients, the organism can rely on the 
fact that the malaise produced by most 
deficiencies (and possibly the consequent 
recovery on ingestion of the needed nu- 
trient) is adequate to establish aversions 
(and preferences). There remains the 
fascinating question of whether long-delay 
learning originally evolved in response to 
proper selection of nutrients or the regula- 
tion of food (caloric) intake, or both. 

Sodium hunger raises the general issue 
of the factors influencing the role learning 
will play in solving à particular problem. 
With alternative mechanisms available to 
deal with an environmental problem, the 
solution achieved by a species should be 
determined by factors of economy (cost) 
dictated by the actual situation and fea- 
tures of the development of the organism. 
We can see the interplay of these factors 
in the case of what Stricker (personal 
communication, 1971) has pointed out Is 
another specific hunger: thirst for pe 
Morgan (1894) studied the Lakes gr x 
drinking in young chicks and found the 
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they apparently had no innate recognition 
of water. For example, they would run 
through water trays without drinking. At 
some point in the first few days of life, 
the bird would ingest water by pecking at a 
water drop or at an irregularity on the 
surface or in the bottom of the water pan. 
As soon as water entered the bird's mouth, 
it would adopt characteristic drinking 
movements, and after only one or two 
such experiences, would show a clear visual 
recognition of water. Hunt and Smith 
(1967) repeated and extended these results. 

This system is highly predetermined: all 
components seem to be genetically pro- 
grammed, except the recognition of the 
visual stimulus for water. Even a reason- 
ably precise regulation of water intake is 
programmed, ready to come into operation 
as soon as water is experienced. Stricker 
and Sterritt (1967) have shown that on its 
very first drink, a chick's water intake is 
proportional to its deficit. We suggest 
that the visual recognition of water is 
learned because this is an efficient way of 
accomplishing the task at hand. The situa- 
tion in the environment is such that a bird 
looking for food (defined as small irregu- 
larities) will certainly end up with water in 
his beak, and since he is prepared to learn 
this and has all of the other genetic equip- 
ment necessary to handle water ingestion 
and balance, he is "home free." We em- 
phasize that this mechanism is successful 
because the bird's normal pecking and 
eating behavior (genetically programmed) 
invariably brings it into a position where 
water enters the beak. Hunt and Smith 
(1967) show that the first time a bird pecks 
at a dew drop (likely to be his first ex- 
perience with water in the real world), it 
responds with a “feeding” peck, which is 
characteristically different from an ap- 
proach to water. Only after the water 
enters the beak does drinking-type be- 
havior appear, and subsequently, the peck 
at dew drops can be seen to be qualitatively 
different from the peck at food. 

We are suggesting that there are two 
types of advantages (and therefore evolu- 
tionary pressures) to learned solutions to 
problems. The first, and most obvious, 
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has to do with the variable environment. 
In the face of a changing and variable en- 
vironment, it would be maladaptive to pro- 
gram specific modes of response, since these 
would often be inappropriate. „Clearly, a 
plastic organism can do better in a plastic 
environment. (We expect that mono- 
phagous animals are less capable of learn- 
ing about food than omnivorous ones.) 
The second factor predisposing to learning 
has to do with the “cost of preprogram- 
ming." This has two components. First, 
preprogramming of a particular behavior or 
stimulus configuration preempts a portion 
of the total genetic material, which could 
otherwise be used for some other purpose. 
Second, there may be greater costs as- 
sociated with the evolution of genetic- 


ally programmed solutions, insofar as 
built-in solutions involve more genetic 
reorganization. 


Of course, in some cases, a genetically 
programmed recognition may be replaced 
by a learning mechanism. If an animal 
had the capability of learning something 
that was genetically programmed, other- 
wise favorable mutations in the genetic 
material responsible for this behavior could 
be selected for, even though the genetic 
basis for the behavior would be destroyed. 
Given a general learning capability, it 
could often be cheaper to allow the en- 
vironment to supply the appropriate stimu- 
lus configurations, even if these will not 
vary much throughout life. We suggest 
that this “balance sheet" type of approach 
may be fruitfully applied to all specific 
hungers. In the case of the chick, we sug- 
gest that it is necessary to genetically 
specify those portions of the feeding and 
drinking system which are distinctively 
different (the ingestive responses and 
regulation). Since the chick's behavior 
leads to water in the beak anyway, be- 
cause the preprogrammed food recognition 
overlaps with this aspect of drinking, a 
cheap way out can be found. The costs 
are minimal, and the genetic savings, 
Significant. i 

SUMMARY 
B ams of specific hungers rests 
actual ene amental points: . (a) The 
gencies in the feeding situa- 
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tion are not what they were thought to be. 
(b) The novelty-familiarity dimension is 
of particular importance to the rat. (c) 
'To account fully for the phenomenon, two 
new "principles" of learning were needed: 
belongingness and long delay. It is re- 
markable that each of Garcia et al.'s two 
"classic" experiments (Garcia & Koelling, 
1966; Garcia et al., 1966) dealt directly 
with one of these problems at just the time 
that these two issues became the two prob- 
lems in specific hungers. 

In the context of the problems of specific 
hungers, it seemed clear that the basic 
principles demonstrated in these experi- 
ments must be essentially correct. Both 
principles, belongingness and long-delay 
learning, seem highly adapted to the pro- 
perties of the feeding system. Tastes are 
in fact causally linked to gastrointestinal 
events, and there is a long inherent delay 
between the taste and its consequences. 
We suggest that specific learning mech- 
anisms have evolved in response to specific 
problems. In the following section, we 
shall consider kc learning in 
greater detail, in order to monstrate the 
extensive ways in which learning mech- 
anisms may be modified to satisfy particular 
demands. 


TASTE-AVERSION LEARNING: AN 
EXAMPLE OF ADAPTIVE SPECIAL- 
IZATION OF LEARNING 


In this section we shall consider learning 
about food. In considering whether it is a 
new type" of learning, we shall examine 
both the belongingness and delay principles 
in some detail, consider the importance of 
familiarity and novelty, and explore other 
possible differences between learning in 


the feeding system and more traditional 
types of learning. 


Principles of Stimulus Selection 


i4. We can presume, from the material on 
specific hungers, that when faced with a 
bewildering array of stimuli as candidates 
for association with a gastrointestinal 
event, the rat has available principles with 
which to sort them out. One concerns his 
past experience with these stimuli, the 
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novelty-familiarity dimension, and the 
other certain presumably built-in tend- 
encies to associate certain categories of 
stimuli with certain relevant events (be- 
longingness). We shall consider each in 
turn. 


Novelty-Familiarity Dimension and 
Associability 


We have argued that the novelty (or 
familiarity) of a stimulus is of particular 
importance for a rat (see Galef, 1970; 
Rozin, 1968). This distinction has par- 
ticular significance in determining the 
magnitude of a learned aversion to a given 
taste. Revusky and Bedarf (1967) and 
Wittlin and Brookshire (1968) showed that 
rats learn aversions much more readily to 
novel than to familiar solutions, even when 
the familiar solution is drunk after the 
novel solution (and, of course, before 
poisoning). In these experiments, familiar- 
ization occurred over a period of days, but, 
in fact, a 20-minute exposure to a solution 
followed by neutral consequences will pro- 
duce virtually the same effect, for such a 
solution is then quite resistant to associa- 
tion with poisoning (Kalat & Rozin*). 
This minimal (single) experience can have 
occurred three weeks before the poisoning 
without significant attenuation of the 


effect. 


Belongingness 
The “belongingness”’ (Garcia & Koelling, 
1966) or "preparedness" (Seligman, 1970) 
or "stimulus relevance" (Capretta, 1961) 
phenomenon—that is, the tendency to as- 
sociate tastes with aversive internal con- 
sequences as opposed to associating either 
element with anything else—seems emi- 
nently sensible from an adaptive point of 
view. Gastrointestinal and related internal 
events are, in fact, very likely to be in- 
itiated or influenced by substances eaten, 
and taste receptors, by virtue of their loca- 
tion, provide information about these same 
substances. Itis essential for an omnivore 
to have the ability to learn rapidly which 
Learned safety as an 


6], Kalat and P. Rozin. a 
: i inforcement 
explanation for taste-aversion delay of rei 


gradients. In preparation. 


substances to eat and which to avoid. 
However, an equal ability to associate 
lights and sounds with gastrointestinal 
consequences would be far less adaptive; 
in fact, the common result would be 
“superstitious” learning. The belonging- 
ness phenomenon receives support not only 
from these adaptive arguments but also 
from neurological considerations. The 
gustatory receptors and the gut receptors 
are similarly classified as visceral sensory 
inputs and show close neurological re- 
lationships, specifically in the medulla and 
possibly in the hypothalamus. 

It appears that under some circum- 
stances, exteroceptive cues can become as- 
sociated with poisoning (Garcia, Kimel- 
dorf, & Hunt, 1961; Rozin, 1969b). 
Rather rapid learning of aversion to loca- 
tions and shape of drinking vessel can be 
demonstrated if the UCS (apomorphine) 
is introduced during drinking from the 
appropriate vessel (Rozin, 1969b). On the 
other hand, even with a modest 30-minute 
CS-UCS interval and the use of a con- 
catenation of nontaste cues including 
vessel shape, location, solution texture, and 
temperature, virtually no aversion appears 
after many trials (Rozin, 1969; but see 
Nachman, 1970a, for a demonstration that 
under proper circumstances oral tempera- 
ture—a nontaste but interoceptive cue— 
alone may be an effective cue). The sug- 
gestion here is that at least part of the 
"weakness" of exteroceptive cues derives 
from a very rapid decay of their associ- 
ability with time. F : 

The belongingness principle in relation 
to taste-aversion learning is elaborated by 
Garcia and Ervin (1968) and Garcia, et al. 
(1971). Further extensions of the principle 
to areas outside of feeding have been dis- 
cussed by Seligman (1970) and Shettle- 


worth (1971).7 


7 The principle of belongingness makes some pre- 
dictions about what aversive stimuli should be em- 
ployed in various types of aversion therapy. For 
example, it would be expected that nausea-producing 
drugs such as apomorphine should be more effective 
than electric shock in treating alcoholism. At the 
present time, the data are not clear on this diss 
(Rachman & Teasdale, 1969). The issue is s n 
plicated because it may be the case that hu ‘ome 
learn different things with electric shoc| Mans 
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Salience. ‘There is evidence for "intra- 
modality" belongingness. Rats tend to 
associate poisioning with some novel solu- 
tions more than others (Kalat & Rozin, 
1970). Rats drank one novel solution 
briefly, 15 minutes later drank. a second 
novel solution, and another 15 minutes 
later were poisoned. ‘The following day, 
the rats were offered both solutions simul- 
taneously. Under these conditions, certain 
solutions, which we describe as highly 
salient, became more aversive than others. 
The salience of a solution proved to be a 
more potent predictor of amount of 
acquired aversion than temporal proximity 
to poisoning. It was found possible to 

rank novel solutions in a stable, transitive 

"salience" hierarchy, such that each solu- 

tion proved more salient (associable with 

poisoning) than all solutions lower on the 
list. Evidence for a salience effect on the 

"positive" side (recovery from thiamine 

deficiency) has been reported (Campbell, 

1969). 

As yet it is not known what constitutes 
the defining characteristics of the salience 
dimension. It is probable that the “rela- 
tive novelty" of these operationally novel 
solutions contribute to the effect. By this 
we mean that more salient solutions may 
be more different from previously ex- 
perienced solutions. There may be addi- 
tional factors associated with intrinsic prop- 
erties of the solutions. Certain tastes (e.g., 
bitter) may have a particular tendency to 
be associated with aversive consequences. 

A functional definition of belongingness. 
At this time we know that some category 
of CSs, including tastes, preferentially 
associates with some category of UCSs, in- 
cluding “gastrointestinal distress." We 
would like to define these categories more 
precisely. The unconditioned stimuli em- 
ployed have been described variously as 
"poisoning," “nausea,” and "gastrointesti- 
nal upset." But the category of effective 
stimuli may include stimuli not suggested 
by these terms. Ingested foods can cer- 
tainly produce significant internal effects 
s ei is gastrointestinal system, and 
Be sc eed possible that the primary action 

me of the UCSs presently employed is 
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outside of the gastrointestinal system. On 
the other hand, it is hard to understand, 
from an adaptive point of view, why pain 
in the lungs or heart, for example, should 
be selectively associated with taste. As a 
matter of fact, it is not known whether 
gastrointestinal pain is selectively as- 
sociated with taste. Much more research 
is needed to better define the range of 
visceral sensations with which tastes have 
high associability. The extensive Russian 
literature on interoceptive conditioning 
(see Bykov, 1957) demonstrates that ex- 
teroceptive cues can be attached to visceral 
UCSs. Whether there is some belonging- 
ness operating here remains to be seen. 

Yet another dimension of belongingness 
concerns the temporal parameters of the 
various CSs and UCSs. In the taste- 
visceral system, stimuli tend to have slow 
onsets and to last for long periods. Ex- 
teroceptive CSs and UCSs, in contrast, are 
characteristically brief and well defined in 
time. The importance of these dimensions 
and the visceral field for taste-aversion 
learning could both be determined by using 
experimentally controlled UCSs, such as 
electric shock to the stomach mucosa in- 
stead of the ill-defined poisoning procedures. 

The category of effective CSs should 
differ from species to species. It was 
argued above that the specific ability to 
learn _taste-poison associations is highly 
adaptive because foods are the usual cause 
of any aversive gastrointestinal stimula- 
tion. This argument assumes that the 
animal identifies its food by taste. How- 
ever, some species use other modalities 
as well. In particular, there is reason to 
believe that birds put main emphasis on 


visual cues in the identification and selec- 


tion of food. A number of experiments by 


Brower (1969) indicate that birds can 
readily learn aversions to the sight of 
food,* 


and experiments by Wilcoxon, Dra- 


* One type of mimicry is based on the fact that 
many birds and some other species readily learn to 
avoid toxic or unpalatable prey. In these cases, à 
safe, palatable species evolves coloration similar to 
that. of an unsafe or unpalatable one and thereby 
obtains some protection. due to the predator's 


generalization of its learned aversi r, 1969; 
Rothschild, 1967; Wickler, 1968) SERA 
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goin, and Kral (1971) indicate that Jap- 
anese quail learn poison-based aversions 
more readily to the color (or optical 
density) than to the taste of a solution. 
Rats and bobwhite quail were poisoned 
half an hour after drinking a solution 
which was either blue, sour, or both blue 
and sour. The rats subsequently showed 
considerable aversion to the sour water, 
but none to the blue (dense) water. The 
quail, however, showed an aversion to 
both; furthermore, the quail poisoned on 
blue-sour water generalized their aversion 
to blue water and not to sour water. To 
show that the aversion to blue water was 
not based on the taste of the blue coloring, 
the investigators showed that quail could 
form an aversion to plain water in a dis- 
tinctively colored tube. 

We suggest that the critical dimension 
for poison-based aversion learning may 
not be "taste versus other modalities" but 
"eating-related cues versus other cues." 
This type of functional categorization of 
input is in harmony with Gibson's (1966) 
view of perceptual systems. Eating-related 
cues would be whatever type of cue— 
gustatory, visual, or otherwise—a particu- 
lar species uses to identify food. Because 
of the intimate and relatively invariant 
relationship between taste receptors and 
food ingestion, and because of the neuro- 
logical association between taste and vis- 
ceral receptors, it is likely that taste cues 
would always be classified as eating- 
related so that taste-poisoning belonging- 
ness should be practically universal. How- 
ever, in some species, other modalities 
may also be eating related. ; 

Nontaste sensory modalities could in- 
clude both eating-related and non-eating- 
related cues. For instance, it may be that 
birds can form poison-based aversions to 
the sight of potential prey, but not to 
Other sights. Also, it is known that odors 
are less effective than tastes in poison- 
based aversions in rats. Possibly odors 
are more effective when they emanate from 
à food source or if they are experienced 
simultaneously with a taste. It would be 
interesting to investigate the effective e 
for poison-based aversion learning in other 
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species in which food recognition is known 
to occur partly via nongustatory cues. 
For instance, frogs have specific visual cells 
which respond maximally to flying insects 
and similar stimuli (Lettvin et al., 1959). 
The above analysis suggests that stimuli 
which excite these “bug detectors’ might 
be more easily associated with poison than 
other types of visual stimuli. 


Loxc-DELAY LEARNING 


Until recently it appeared that learning 
of both classical and operant types was 
almost impossible with delays of rein- 
forcement exceeding a few seconds. The 
importance of close temporal contiguity 
has been demonstrated in a considerable 
variety of experimental paradigims, and 
apparent exceptions seemed to depend on 
secondary reinforcement (Kimble, 1961). 
There were cogent theoretical and adaptive 
reasons for assuming the universal im- 
portance of close temporal continguity. 
Premonitions existed, however, of a pos- 
sible problem with learning about food 


ingestion. 


History 

The first such reference is Pavlov (1927). 
Using morphine as a UCS, Pavlov noted 
signs of nausea and profuse salivation in 
response to the touch of the experimenter, 
which preceded injection. Needless to say, 
the UCS (i.e., nausea, etc.) did not occur 
until many minutes after the morphine was 
administered. Pavlov evidently regarded 
the effect as a species of conditioning, thus 
implicitly accepting the possibility of 
learning over long delays: 


This experiment provides a clue to the well-known 
fact that dogs will eat meat the first time it is 
offered them, after removal of their parathyroids, or 
after an Eck fistula and tying of the portal vein, but 
on all subsequent occasions will refuse it. Evidently 
in these cases the appearance and smell of meat 
produce of themselves a reaction identical with that 
produced through direct pathological action in the 
absence of the parathyroids or the portal circulation, 
by those toxic substances resulting from digestion of 


the meat [p. 36]. 


A similar recognition of the phenomenon 
of long-delay learning is present in the 
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biological literature on poisoning, regula- 
tion of food intake, and specific hungers. 
Thus, Harris et al. (1933), Scott and 
Verney (1947), Roska (1953), and Le- 
Magnen (1969) explicitly implicated learn- 
ing mechanisms; to biologists without any 
particular commitments to psychological 
theory, this explanation seemed perfectly 
plausible. 

One way of reconciling these data to 
traditional S-R learning theory was to 
assume that animals associated a food with 
the consequences of a previous meal of the 
same food, thus achieving temporal con- 
tiguity (Rozin et al., 1964; Smith & 
Capretta, 1956). But rats need only a 
single exposure to a toxic food to learn to 
avoid it (RZoska, 1953); thus some other 
explanation is necessary. 

The discovery that long CS-UCS in- 
tervals are possible in learning about the 
consequences of foods occurred gradually. 
Garcia et al. (1961) in their early work 
found that rats learned food aversions on 
the basis of simultaneous exposure to X 
rays. Although it was known that X rays 
had their main effect with a considerable 
delay ("radiation Sickness"), it was as- 
sumed that some immediate effect of X 
rays was involved in the learning, and 
slight evidence was offered to support this 
position. McLaurin (1964) was the first 
to operationally manipulate CS-UCS inter- 
vals over a wide range in taste-aversion 
learning with long delays of reinforcement. 
However, a methodological flaw precluded 
meaningful interpretation of this result; 
McLaurin tested for aversion immediately 
after exposure to X rays, and it was later 
demonstrated (Smith & Schaeffer, 1967) 
that the rats were learning aversions to the 
test solution during the test itself. "That 
is, the drinking of the solution during 


testing was simultaneous with the develop- 
ment of 


aversive consequences of the 
X rays, and temporal contiguity was 
achieved. Garcia et al. (1966) avoided 


this problem by giving the test for sac- 


charin aversion three days after exposure 
to apomorphine. 


cessfully demonstra 
Clays of reinforce 


This experiment suc- 
ted learning with long 
ment of the order of 
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one hour. Smith and Roll (1967) found 
similar results using X rays and either 
saccharin (up to 12-hour delays) or sucrose 
(up to 6-hour delays). Replications by 
Revusky (1968) using sucrose CS and 
X-ray UCS and Rozin (1969b) (using 
saccharin or casein hydrolysate as CSs and 
apomorphine as UCS) confirmed this effect. 


Hypotheses to Explain the Long Delay 


Peripheral—the aftertaste theory. Be- 
cause of the revolutionary nature of this 
finding, there was considerable interest in 
the possibility that the apparent absence of 
temporal contiguity was illusory. Although 
the CS-UCS interval was ostensibly long, 
some peripheral trace of the CS might re- 
main throughout the interval in the form 
of an aftertaste, regurgitation, or a high 
concentration in the blood. It could be 
this trace that was involved in the condi- 
tioning. However, a fair amount of data 
are now available to oppose this explana- 
tion. The main import of the data is that 
taste-aversion learning is possible under 
conditions that should greatly minimize 
any aftertaste. 

First, taste-aversion learning to sucrose 


and saccharin solutions occurs on 
trial with 


of about 


a single 
maximum delays of poisoning 
7 and 12 hours, respectively 
(Revusky, 1968; Smith & Roll, 1967). It 
1$ very doubtful that an aftertaste or 


perceptible change in blood concentration 


persists for such long periods. It is even 
less plausible to suggest that enough 
Sucrose or saccharin remains in the stomach 
at this time to be tasted by regurgitation. 
Actually the regurgitation model never 
had much applicability to rats anyway, 
since rats rarely if ever vomit. A second 
argument against the aftertaste and re- 
lated models is the fact that rats can learn 
aversions to a .05% HCl solution with a 
one-hour delay of reinforcement (Garcia, 
Green, & McGowan, 1960). A litmus 
Paper test showed no measurable amount 
of HCI on the tongue two minutes after 
the animal stopped drinking. Thus, the 
likelihood of a conventional peripheral 
aftertaste an hour later is minimal. Fur- 


thermore, the solution was less concentrated 
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than the HCl already in the stomach. 
Consequently, this experiment is peculiarly 
effective in climinating regurgitation, stom- 
ach-tasting, and blood-tasting models. A 
third line of evidence is Rozin's (1969b) 
demonstration that rats can learn an 
aversion to a specific concentration of a 
solution as opposed to other concentra- 
tions of the same solution. The animal 
learned the aversion just as easily to the 
lower as to the higher concentration. Pre- 
sumably after the 30-minute delay used in 
this experiment, the aftertaste of a more 
concentrated solution should taste more 
like a less concentrated solution of the 
same substance. Similarly, the blood or 
stomach concentration of the substance at 
the time of poisoning should not be uniquely 
related to a particular concentration of 
original solution by a function known in- 
dependently by the rats. A fourth line 
of evidence is Nachman's (19702) demon- 
stration that rats can learn an aversion to 
à particular temperature of water; an 
aftertaste of a temperature is difficult to 
imagine. Fifth, rats can learn an aversion 
to a solution even if one or more solutions 
is drunk between the first solution and 
poisoning (Kalat & Rozin, 1970, 1971a; 
Revusky & Bedarf, 1967). The interven- 
ing solutions would surely minimize any 
aftertaste of the first. Finally, it has been 
argued (Revusky & Garcia, 1970) that 
even if the aftertaste model were correct, 
it would be difficult to reconcile the taste- 
aversion learning results to conventional 
learning theories. Even if there is an 
aftertaste, it would have been present and 
declining in intensity for such a long 
period that there would be no precedent 
for expecting learning to occur—let alone 
in one trial. 

It may be argue 
arguments completely 
taste theory, and that cert: 
Could be conducted, for exan 
ing to produce aversions 
intubated intragastrically- ete 
evidence suggests that 1 su Ba 
occur at all, they are much smaller than 

b d a : 1970). 
taste-mediated aversions (Smith, 
However, even if demonstrab 


d that none of these 
demolishes an after- 
ain other tests 
nple, attempt- 
to solutions 
Some recent 


le, intragastric 
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aversions would not solve the delay prob- 
lem, as the HCl experiments (Garcia et al., 
1969), concentration aversions (Rozin, 
1969b), and solution temperature aversions 
(Nachman, 1970a) are no easier to explain 
in terms of contiguity with intragastric 
stimuli. 

Bradley and Mistretta (1971) have 
demonstrated the development of aversions 
to intravenously introduced solutions (sac- 
charin) in rats, using X rays as the UCS. 
A circulating "slug" of high-concentration 
saccharin stimulates taste receptors in the 
tongue. This interesting experiment pro- 
vides another mechanism of learned aver- 
sions, but it cannot be the only mechanism 
involved in orally mediated aversions. 
The blood concentrations used in these 
experiments are much higher than any 
which would occur naturally or in the 
long-delay experiments, and the same ex- 
periments discussed above as raising prob- 
lems for an intragastric-tasting mechanism 
would be equally troublesome for a ‘‘blood- 
tasting mechanism.” 

At present there is no evidence in favor 
of an aftertaste theory and a considerable 
body of evidence against it. Establish- 
ment of an alternative theory seems a more 
appropriate strategy than accumulation 
of still further evidence against aftertaste 
interpretations. Of course, from the point 
of view of this paper, long-delay learning 
is exactly what should be expected in this 
situation, and the central-mediation alter- 
native appears quite acceptable. 

Central. mechanisms—interference. Re- 
vusky (in press; see also Revusky & 
Garcia, 1970) proposes that the maximum 
CS-UCS interval in all types of learning 
depends not on time per se, but on the 
number of other potential CSs that in- 
tervene between the CS being tested and 
the UCS. That is, the animal tends to 
associate the UCS with the most recent 
potential CS, or perhaps. the last several 
such stimuli. As the CS-UCS interval 
expands, the probability increases that the 
organism will perceive other sigh ts, sounds, 
etc., and consequently the probability de- 
creases that the animal will associate the 
UCS with the CS in question. In taste. 


^ 
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aversion learning, the argument holds, the 
range of potential CSs is much more re- 
stricted. Only tastes have a substantial 
tendency to be associated with aversive 
gastrointestinal stimulation, and typically 
the test solution is the most recent taste 
the animal experienced prior to poisoning. 
Since no other potential CSs are present, 
there is nothing to interfere with learning 
over long delays. In taste aversion as in 
other learning, the animal tends to as- 
sociate a UCS with the last potential CS; 
the only difference is that in taste-aversion 
learning, the potential CSs are fewer and 
less frequent. We would like to point out 
that the taste modality differs from the 
commonly studied exteroceptive modalities, 
in that virtually all stimulation comes from 
contacts initiated by behavior. Rats do 
not taste unless they approach something 
and introduce it into their mouth. 

This theory is very attractive because 
it proposes that the differences between 
taste-aversion learning and other types 
of learning may all be derived from the 
general principle of belongingness, without 
postulating an independent difference in 
the delay of reinforcement function. 

Nevertheless, the theory, if taken as the 
sole of primary explanation of delay, faces 
two serious problems. It predicts that in 
the absence of interfering taste cues, there 
should be little or no decrement in le 
as the CS-UCS interval is increased. This 
is not the case. Garcia et al (1966), 
Nachman, (1970a), Revusky (1968), and 
Kalat and Rozin (1971a) have all found 
an orderly decrease of aversion with 


arning 


in- 
creasing CS-UCS interval. Furthermore, 
all experiments have found a maximal 


interval, varying from about 2 to 12 hours, 
beyond which no learned aversion can be 
demonstrated. Kalat and Rozin (1971a) 
deprived rats of both food and water during 
the CS-UCS interval, and still observed 
an orderly decrease in learned aversions 
with increasing CS-UCS interval. 
.. Not only is there a decrement in learn- 
ing in the absence of explicit interference, 
but the addition of explicit interfering cues 
oes not markedly reduce learning. The 
Consumption of two or even three salient 
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novel solutions during a 30-minute delay 
between sucrose consumption and poison 
does not eliminate the sucrose aversion 
(Kalat & Rozin, 1971a). The maximal 
interference effect we observed using three 
novel interfering solutions following sucrose, 
and one poisoning, left sucrose equally 
palatable with water, to which it is normally 
strongly preferred. 

These experiments, then, suggest that 
retroactive interference can clearly weaken 
a potential association, but it is highly 
unlikely that the normal delay function c n 
be largely accounted for in these terms: 
What could interfere with drinking of a test 
solution during six hours of no eating or 
drinking that would be more effective than 
three novel solutions? Proactive interfer- 


ence is an unlikely explanation. Effects 
from past taste experiences should be 


minimal, since most rats have e 
only highly familiar rat chow 
mother's milk. 

Recent findings by Wilcoxon et al. 
(1971) pose further problems for Revusky's 
theory. Unlike rats, bobwhite quail poi- 
soned 30 minutes after drinking unflavored 
water from a blue tube learn an aversion 
to drinking from that tube. Since the 
quail presumably saw a great many visual 
stimuli in the 30-minute delay, their 
ability to learn over long delays in this 
Situation. cannot. be explained simply in 
terms of absence of interference, i 
: Although it Seems unlikely that inter- 
ference represents the only basis for the 
CS-UCS delay gradient, it does appear to 
be a contributing factor, Since a rat 
normally experiences fewer tastes than 
other stimuli within à given period, the 
Revusky theory may account for part of 


the difference in the delay gradients be- 
tween taste-aversion and other types of 
learning 


xperienced 
» Water, and 


Slow deca Y of associabilit y 
Property of the taste system, 
to the view that the delay-of-reinforce- 
ment gradient is q function of interference 
is the view that the delay gradient is an 


as un inherent 
Àn alternative 


inherent Property of the System itself; 
memory or associability decays much 
more slowly for taste than for other 
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modalities. This leaves unanswered the 
question, What accounts for this decay? 

One possibility is that the delay func- 
tion represents the decay of short-term 
memory. According to this hypothesis, a 
stimulus is associable with other events 
only while it is held in short-term memory, 
and tastes remain in short-term memory 
lor unusually long periods. (Although 
there is extensive data on short-term 
memory in humans, virtually none has 
been collected on taste, and it is con- 
ceivable that tastes do not compete with 
exteroceptive cues for space in short-term 
memory.) 

One way of testing this hypothesis is by 
means of electroconvulsive shock, which 
has been assumed to eradicate short-term 
memories or to block their conversion to 
long-term memories. If taste associations 
must be made from short-term memory 
extended in time, it should be possible to 
demonstrate disruptive effects of electro- 
convulsive shock, presented within the 
CS-UCS interval but with longer delays 
following the CS than are ordinarily 
effective in disrupting other types of 
memory. [n a very careful study, Nach- 
man (1970b) found amnesic effects of 
clectroconvulsive shock in some rats when 
electroconvulsive shock was presented im- 
mediately after 10 seconds of drinking but 
not after 30 seconds of drinking. He also 
found some amnesic effects from electro- 
convulsive shock presented 25 seconds 
after a 5-second drinking period. Kral 
(1970) also found small amnesic effects 
with electroconvulsive shock, using delays 
of 2 or 25 minutes following a 10-minute 
drinking period. As is usual with elec- 
troconvulsive shock experiments, the tem- 
poral parameters seem to iai ind diem 
One experiment to another. DW ae, mi 
both cases, the effect of electroconvulsive 


shock is small; it impeded but did not 
prevent learning. And in both cases, the 
effective electroconvulsive shock times were 
Within the range of times ape rd 
lectroconvulsive-shock-ammnesie effects ut 
Other systems. Thus there 15 nO mI 
that the transfer of taste stimuli into s 


term memory is unusually 


slow, or 


4 


tastes remain in short-term memory for 
long time. 

Another possible mechanism for the dela 
function is that some central long-terr 
"trace" of the taste is decaying over tim 
Unfortunately, this is a difficult hypothesi 
to test. One experiment (Rozin & Ree! 
at least puts certain constraints on th 
type of decay which is possible. Rat 
were anesthetized for 6-8 hours during th 
interval between consumption of a sucros 
solution and LiCl poisoning. These rat 
showed strong learned aversions at delay: 
of poisoning considerably longer than tho 
which are effective in the absence ol 
anesthesia. Thus anesthesia seems tc 
retard whatever process mediates the 
delay of reinforcement gradient. If this 
process is to be described as "decay," it is 
evidently an active rather than a passive 
type. 

Central mechanisms—learned safety. The 
suggestion of an “active decay” process 
raises still another possibility, which is not 
a decay mechanism at all. Perhaps the 
delay gradient should be viewed not as a 
forgetting curve but as a learning curve. 
That is, in the absence of unfavorable 
gastrointestinal events, as time passes 
following consumption of a novel solution, 
the animal /earns that the solution is safe 
(Kalat & Rozin, see Footnote 6). 

As evidence for this, it has been demon- 
strated that an animal does learn some- 
thing about a solution when consumption 
of that solution is followed by no negative 
consequences. We have already discussed 
the evidence indicating that rats learn less 
aversion to familiar solutions than to novel 
solutions (Revusky & Bedarf, 1967) even 
when the familiar solution was experienced 
once for only 20 minutes, three weeks 
before poisoning (Kalat & Rozin, see 
Footnote 6). This interpretation goes on 
to make a stronger claim: The learning of 
"safety" takes place within the period 
measured by the maximum delay of 
reinforcement. At the end of that period, 


? p, Rozinand P. Ree. Long extension of effective 
CS-UCS interval by anesthesia between CS and US 
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the “trace” has not decayed; it has merely 
been associated with the absence of aversive 
consequences. At intermediate delays, the 
intermediate amount of learned. aversion 
reflects the fact that the animal has 
learned an intermediate amount of safety. 
Although this interpretation is grossly 
different (and not inconsistent with) the 
interference or trace-decay interpretations, 
it is not easy to separate the alternatives 
experimentally. The results of Rozin and 
Ree (see Footnote 9), which suggest that 
indefinitely long CS-UCS intervals will 
support aversion learning if the rat is 
anesthetized during the interval, are con- 
sistent with learned safety, since it is 
reasonable that the rat would be unable to 
learn safety while anesthetized. 


Wuar ALL Tais MEANS TO tHe Rat 


We are now prepared to describe how a 
rat can handle some of the complex prob- 
lems in food selection. The first thing to 
realize is that the situation is probably less 
complex than it might appear when the 
rat's natural behavior is considered. For 
example, the rat who gets sick in the gar- 
bage dump probably did not recently 
sample all the choice delicacies available 
(Rozin, 19693). His choice behavior 
itself will help to unconfound the situation. 
He may have eaten a few different foods. 
He “knows” it was a food that made him 
sick (the belongingness principle) and 
can discount any familiar safe foods (the 
novelty principle). With the capability 
of forming associations over long delays, 
he is now likely to associate his illness 
with the last relevant (as defined above) 
thing or few things he ate over the last 
few hours. Although some of these foods 
may become more aversive than others be- 
cause of their intrinsic properties (salience 
effect), the rat will acquire a significant 
aversion to each of them (small inter- 
ference effect), with those closer in time 
to the aversive event picking up somewhat 
More aversion (temporal contiguity). Simi- 


l ani 

M mechanisms would be employed to 
a oO i 

: unt for important aspects of the regula- 
10n of food intake 
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What Kind of Learning Is This? 


We have already described a number of 
differences between taste-aversion learn- 
ing and traditional learning. We shall now 
consider some additional evidence in order 
to determine how fundamental this dis- 
tinction is. Recent experiments have 
suggested two additional differences. First, 
it appears that rats can learn taste aver 
sions when poisoned under anesthesia 
(Berger; Roll & Smith, in press). Anes- 
thesia was continued for a long period fol- 
lowing UCS administration, so that it was 
highly likely that the learning occurred 
under anesthesia. Second, one of the more 
"complex" characteristics of classical con- 
ditioning, the Kamin blocking effect (Ka- 
min, 1969), is either nonexistent or rela- 


tively weak in taste-aversion learning 
(Kalat & Rozin, 1971b; Revusky, in press). 


One interpretation of why taste-aversion 
learning does not clearly show the Kamin- 
type effects is that the taste does not be- 
come a signal for poison in the sense that 
a tone or light becomes a signal for shock, 
In. taste-aversion learning, the animal's 
perception of the taste itself or of its 
affective value may change (suggested by 
H. Gleitman, personal communication, 
1971). The taste itself may become 
aversive or unacceptable, as if it were un- 
palatable (Rozin, 1967b). By contrast, 
stimuli associated with shock do not them- 
iSe become aversive; they evoke little 
avoidance outside the training situati 
This difference seems aa tg oe 
Ported “irrationality” of learned taste 
aversions ; humans commonly maintain 
an aversion to foods they ate prior to 
becoming Nauseous, even w 


hen they are 
sure that some other agent was responsible 
for the nausea, 

Seligman (1970) has suggested that 
jaste-aversion learning may be a particu- 
arly primitive type of learning. The 
evidence mentioned above i. quite con- 
sistent with this view. 


i Furthermore, in a 
it makes sense that this 
Primitive. The problems 


biological context, 
ability should be 


OB. Berger. I 


-earning in the 
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anesthetized rat. 
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of food selection and the regulation of food 
intake are pervasive ones, and both are 
likely to involve long-delay learning. The 
essential problems are relatively invariant 
across species: any new food has some 
probability of being toxic, and for almost 
all species, the caloric density of foods will 
vary. Therefore, it makes sense that taste 
aversion and related types of learning 
should be more or less the same throughout 
most of the vertebrate subphylum. 


ADAPTIVE SPECIALIZATIONS OF LEARNING: 
GENERALITY AND RELATION To 
OTHER POSITIONS 


The work on specific hungers and poi- 
soning suggests that there are two aspects 
of adaptive specializations of learning. 
First, some mechanisms of learning may 
differ markedly (at least in terms of large 
quantitative differences in basic parameters) 
in the feeding system, as compared to other 
systems. Second, understanding of an 
adaptive specialization includes delimita- 
tion of the situations in which it applies 
and elaboration of its relationship and 
interaction with the animals’ natural be- 
havior (e.g., sampling, neophobia) in the 
relevant situation. The variation in inter- 
play among naturalistic context, genetic 
programming, and learning is clearly il- 
lustrated in the contrast between water 
Or sodium hunger, and other specific 
hungers. 

Our emphasis has been on the first aspect 
of adaptation, while the ethologists have 
focused more on the context of learning 
and interplay of prestructured and ac- 
quired components: 


The student of innate behavior, accustomed to 
studying a number of different species and fhe entire 
havior pattern, is repeatedly confronted with the 
fact that an animal may learn some things much 
More readily than others. . . In other words, 
there seem to be more or less strictly localized 
"dispositions to learn." Different species are 
disposed to learn different parts of the pattern, So 
far as we know, these differences between species 


have adaptive significance [ Tinbergen, 1951, p. 145]. 


- . . innumerable observations did MUS 
tend to show that modifiability occurs, if at P only 
in those performed places where bo a ag 
Mechanisms are phylogenetically programme 


perform just that function. How specifically these 
mechanisms are differentiated for one particular 
function is borne out by the fact that they are very 
often quite unable to modify any but one strictly 
determined system of behavior mechanisms [Lorenz, 
1965, p. 47]. 


We are aware of only two well-studied 
Systems showing adaptive specializations of 
learning. One is feeding, which we have 
already discussed, and the other involves 
imprinting, as a mechanism of species 
recognition. 

Two of the prominent features of im- 
printing—the special sensitivity during a 
critical period early in life and the great 
resistance to extinction—can be seen as 
adaptations to limit the likelihood of 
errors in species recognition. The learning 
should take place soon after hatching, since 
the probability of exposure to a conspecific 
(ie., the mother) is highest at this time, 
and the bird should be less sensitive to 
later experiences, since the frequency of 
contact with members of other species is 
likely to increase greatly after the nestling 
period. 

Feeding and imprinting can be con- 
sidered as two exceptions to an otherwisc 
correct "general process" view of learning, 
or they can be considered as examples 
of a basic adaptational principle pervading 
much or all of learning. We prefer the 
latter alternative and believe that the 
absence of additional known instances of 
adaptive specializations may reflect learn- 
ing psychologists’ reluctance io study 
potentially learned behaviors which do not 
fit into the general process paradigms. 
(Significantly, the case for adaptive spe- 
cializations in both imprinting and feeding 
came from outside the psychology of 
learning.) 

At this point one can only speculate 
about what other systems will show special 
adaptations of learning. _Bees may possess 
a wide variety of adaptive specializations 
(von Frisch, 1953, 1967). A particularly 
promising example is their navigational 
ability. First, there is evidence for 
“belongingness.”’ 


Honey bees can learn to use irregular forms, like 
those of trees or rocks, as landmarks by which ts 


steer a course to and from the hive; but they cannot 
Y cc ] 


+ 
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even by the most subtle conditioning technique, be 
taught to use the same forms as positive or Tow 
signals indicating the presence or absence of 105 dn 
a tray, as von Frisch (1914) has shown. As signal 
for food, bees can distinguish different forms only if 
they are geometrically regular, prelerbly radially 
symmetrical (Hertz, 1937) [Lorenz, 1965, p. 47]. 


Second, a limited amount of experience in 
observing a piece of the sun's arc enables 
the bee to “project” the rest of the arc. 
'That is, bees raised without the oppor- 
tunity to see the sun have great difficulty 
compensating for sun movement during 
navigation. But bees that have observed 
sun movement only during a limited period 
in the morning are fully capable of com- 
pensating for sun movement in the after- 
noon (von Frisch, 1967). In this situa- 
tion, the environmental input produces a 
long-term change, and does not act ‘‘as- 
sociatively," but rather provides a refer- 
ence point. Other examples of such 
"calibrational learning" (Lorenz, 1965) 
might include adaptation to visual dis- 
placement produced by prisms and caloric 
regulation." 
The naturalistic literature is replete with 
other examples of surprising abilities of 
animals—such as digger wasps’ memory of 
the location and state of their nests 
(Baerends, 1941), gobies’ latent learning 
of the location of tidepools (Aronson, 
1951), salmon's recognition of home-stream 
odor (Hasler, 1966), doves' individual mate 
recognition (Morris & Erickson, 1971), 
sparrows' acquisition and storage of song 
dialects (Marler, 1970), ete. These have 
u We can suggest a role for "calibrational learn- 
ing" in the regulation of food intake. It has been 
known for some time that rats and other mammals 
respond to changes in caloric density of food by 
appropriate modulation of volume intake. In the 
rat, this compensation occurs largely as an increase 
in meal size, rather than in number of meals 
(Snowdon, 1969; itelbaum & Campbell, 1958). 
When a standard diet is diluted, the rat ends up 
ting larger meals: he has "recalibrated" his meal 
Size on the basis of the metabolic afteretfects of his 
meals. [t is possible to imagine a mechanism which 
compares some measure of the amount ingested with 
its delayed metabolic effects, and adjusts future 
M downward or upward so that the metabolic 
malu. ques Will approach some preferred or ideal 
©monstration of a long-delay learning 


Ee The 

mechanis » Ü m E 

and FIDES makes this type of explanation feasible 
"DJect to Investigation, 
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not been thoroughly studied as possible 
specialized learning mechanisms. 

The "heuristic" value of an adaptive 
evolutionary point of view can be suggested 
by considering the types of predictions 
that might be made about some basic 
learning and memory relationships. For 
example, an organism’s memory of some 
aspect of the environment is useful only 
if that aspect is predictable or controllable; 
otherwise, rapid forgetting might be ad- 
vantageous. It is probably of no use for 
bees, birds, or other organisms with com- 
plex navigational abilities to remember 
whether it was cloudy yesterday or which 
way the wind was blowing, though both 
may have been important at the time. 
Under these circumstances one might 
expect to find a specialization in short- 
term memory such that information could 
be stored for longer periods and in greater 
quantities than usual without entering 
long-term memory. 

Another example concerns extinction. 
Extinction, from an adaptive point of 
view, allows an organism (a) to correct for 
mistakes (fortuitous conditioning) and (b) 
to constantly reshape itself to adapt to a 
variable environment. One might expect 
the rapidity of extinction to depend on the 
probability that either of these events 
would occur. In the case of imprinting, we 
have great resistance to extinction in a 
case where clearly the environment will 
not vary (i.e., the species will not change), 
and the proper imprinting object is almost 
Certain to be present at the time of im- 


printing. The great resistance to extinc- 
tion of avoidance conditioning (Solomon & 
Wynne, 1954) may be an 


adaptively 
of this learning: the costs 
ssion here are high. One 
rapid extinction of the 
and stimulus properties 
i where these sources are 
subject to marked seasonal or other tem- 
poral variations. An example might be 
extinction of responses to particular types 
and locations of flowers 
One might observe slow | 
resistance to extinction in 
the environment 


selected feature 
of errors of omi 
might expect 

learned. location 
of food Sources, 


in foraging bees. 
earning, but high 
a situation where 
is stable but a critical 
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difference is relatively hard to detect 
(e.g., prey density), and "averaging" must 
be done before a clear choice or preference 
is established. One might observe rapid 
learning and rapid extinction in a rapidly 
changing environment (see Shettleworth, 
1971, for additional comments on this 
point). 

Recently, a number of authors have ex- 
pressed positions related to ours (Garcia & 
Ervin, 1968; Garcia et al., 1971; Revusky, 
in press; Revusky & Garcia, 1970; Selig- 
man, 1970). We differ from all of these 
authors in the sense that they see be- 
longingness, in one form or another, as 
the unique phenomenon to be explained, 
whereas we see it as an example of the 
general adaptational principle; animals 
may not onlv learn some things more 
easily than others, but they may also 
learn some things in a different way than 
others. The contrast is clearest in the 
case of Revusky, who attempts to pre- 
serve traditional learning theory intact, 
with the introduction of a new “belonging- 
ness” assumption. 

Seligman (1970) proposes a new 
mension, “preparedness,” based on 
belongingness relationship, to incorporate 
recent findings into a more viable learning 
theory. Preparedness represents the tend- 
ency of certain inputs (and/or outputs) to 
be associated with one another, this 
tendency resulting from natural selection. 
Seligman proposes that highly prepared 
associations are established with a minimal 
input (e.g, number of pairings). In 
addition to very rapid learning, prepared 
associations would tend to show learning 
with long delays of reinforcement and per- 
haps high resistance to extinction. We 
are certainly in accord with the general 
flavor of Seligman's position, but we feel 
that in his desire to reorder the phenomena 
of learning, he has not fully appreciated 
the diverse natural forces that can shape 
behavior and learning mechanisms. In 
imprinting, for example, i crai cna 
view of rapid learning (and high resistance 
to extinction) fits very well, the presence 
of the critical period cannot be aeron 
modated without additional assumptions. 


di- 
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More significantly, the long-delay learning 
found in feeding is probably not character- 
istic of other "prepared" associations, and 
in our view, it should not be, since in most 
cases close temporal contiguity is the best 
predictor. Furthermore, we see no reason 
to expect a consistent relationship between 
rapid learning and high resistance to ex- 
tinction (see discussion of extinction above). 

In short, we disagree with Seligman that 
diversity can be ordered along any single, 
operationally meaningful dimension, that 
is, preparedness. If preparedness meant 
adaptedness to situational demands, it 
would be an acceptable but not clearly 
meaningful dimension. Seligman has given 
it operational meaning, but narrowed the 
scope of the phenomena he can account for 
in the process. In a more recent statement 
of the preparedness position, Seligman and 
Hager (in press) have acknowledged some 
of these possible limitations. 

Shettleworth (1971), in a paper written 
concurrently with this one, has presented 
a position very similar to ours. It focuses 
on belongingness, broadly conceived, but 
also describes learning in general in terms 
of adaptive specializations, and provides 
examples from the naturalistic literature. 
A recent paper by Lockard (1971) also 
explicitly discusses the diversity of learn- 
ing mechanisms as an important feature 
ignored by most psychologists and implies 
that the search for common elements in 
learning across species and situations is 
almost hopeless. 

We differ slightly from Shettleworth and 
markedly from Lockard with respect to 
our optimism about the possibility of find- 
ing order within the diversity of learning 
mechanisms. Given the constraints on 
adaptations produced by basic properties 
of the nervous system, the cost of evolving 
specializations, and the fact that most 
species face a common set of problems, we 
doubt that a separate learning mechanism 
would exist for every situation, or that 
there would be separate laws for each 
species. It may yet be possible to formu- 
late laws of some degree of generality, 


taking ecological factors into account (see 


Walls, 1942, for an example in the area of 
vision). 
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It follows from the point of view pre- 
sented here that an organism may have an 
ability that manifests itself in only a small 
number of the total possible situations in 
which an experimenter might test for it. 
The ability in question might be inac- 
cessible to or "unconnected" with the 
machinery for modulating and controlling 
most behavior. Using a computer analogy, 
we might suppose that a particular routine 
is designed (evolved) to handle a specific 
problem. At this point in time, it is 
physically connected only into the relevant 
inputs, outputs, or systems, and is in- 
accessible to the rest. Under these cir- 
cumstances, it would be difficult to describe 

a species' learning capacity. 

The interesting demonstrations by Bit- 
terman (1968), Gonzalez and Bitterman 
(1969), and Mackintosh (1965) of dif- 
ferences in learning abilities in certain 
situations between a few species of fish 
and mammals do not in themselves in- 
dicate complete absence of these abilities 
in any species of fish (see Gleitman & 
Rozin, in press, for a general review of this 
issue with respect to learning and memory 
in fish). A species should be tested for 
any ability in those situations where its 
existence would have the greatest survival 
value. The failure of a few fish species to 
demonstrate abilities such as progressive 
improvement in habit reversal in a few 
laboratory situations does not meet this 
demand. 


However, the data gathered by Bitter- 
man, by Gonzalez and Bitterman, and by 
Mackintosh do suggest some interesting 
phylogenetic generalizations about learn- 


Ing capacities. Since the laboratory ap- 


paratus used to study rats and other 
poms is often not ideally suited to 
their 


natural behavior, just as the fish 
apparatus is not, it is quite interesting that 
as reliably show the greater plas- 
E these “unnatural” situations. One 
Bitterma T Mation for this, offered by 

an (1964), is that the rat possesses 
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certain higher learning abilities that “fish” 
do not possess. Another not mutually ex- 
clusive possibility is that the rat and 
probably most other mammals are “gen- 
eralists," compared to the majority of other 
vertebrates. That is, abilities initially 
evolved to handle a specific situation and 
limited in application to the appropriate 
system may turn out to be useful in others 
and may through evolution be “connected” 
into new systems. To what extent do the 
higher abilities of the mammals represent 
an increase in accessibility of specialized 
capacities, an “emancipation,” to borrow 
a word from the ethologists, of a capacity 
from its original tight motivational system? 

The proposed increase in accessibility of 
capacities in phylogeny may have a par- 
allel in ontogeny. Within the Piagetian 
framework, it is apparent that particular 
cognitive structures may have only limited 
applicability at any point in development. 
Piaget applies the name "decalage' to this 
feature of development. (Flavell, 1963, 
pp. 21-24). For example, in the Piagetian 
scheme, the same cognitive structure is 
necessary for the achievement of mass and 
weight conservation, yet the latter occurs 
about two years after the former. Cogni- 
tive development may consist, in part, of 
the extension of existent capacities to new 
Situations, in parallel to our scheme for 
phylogeny. 

We have argued for the e 
adaptive specializations in | 
memory. _ Since, by their nature, such 
Specializations are limited to a relatively 
narrow range of situations, we have pointed 
out that they must be inaccessible to most 
functioning systems. We believe that this 
general formulation has a wide application, 
and we are presently applying it to an 
understanding of the difficulties in initial 
acquisition of reading (Rozin & Kalat, 
in press; Rozin, Poritsky, & Sotsky, 1971). 

Ic is the basic thesis of this paper that in 
a biological-evolutionary framework, spe- 
cifically adapted abilities are extremely 
™portant and should not suffer [rom 
neglect as a consequence of the search for 
Sreat generalities. To understand a set 
ol phenomena, within humans or across 


xistence of 
earning and 


: 


C. 


ADAPTIVE SPECIALIZATIONS OF LEARNING 483 


the animal kingdom, is to be able to de- 
scribe and explain diversity, as well as to 
extract common elements. 
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syllogisms than has previously been given. 


Columbia University 


s paper outlines a more complete explanation of how people solve ordering 


Earlier, one of the authors proposed 


that people combine information from the two syllogistic premises using a 


strategy which involves spatial imagery. 


total process of solving th 
comprehension of the adject 


A more complete explanation of the 


yllogisms would include an account of the 
s and comparative sentences which must be 


understood to solve the problems, as well as an account of question answering. 
The present paper is concerned with these issues and, in addition, with an 
evaluation of the relative explanatory value of the spatial ima; ery roposal 
as against more abstract formulations. MS 


The present paper is concerned with the 
mental processes involved in solving order- 
ing syllogisms or *'three-term series" prob- 
lems. These problems typically include 
two comparative sentences which specify 
the order of three items on some dimension, 
plus a question about order; for example, 
“Tom is taller than Harry. Joe is shorter 
than Harry. Who is tallest?" Since the 
information about item order is given in 
two separate sentences, each of which 
describes the relation between two of the 
items, people must put together informa- 
tion from these separate sentences to de- 
termine the order of all three items. In 
order to put that information together, 
they must be able to understand the com- 
parative sentences, and to do this, in turn, 
they must understand the adjectives within 
these sentences. Finally, in order to solve 
the problem, they must also understand the 
question. 

In explaining the mental processes of the 
problem solver, one may deal only with the 
issue of how information from the two 
premises is combined, as Huttenlocher 
(1968) did. This process is traditionally 
what is meant by reasoning, as separable 
from merely understanding the individual 
premises. A more complete explanation of 
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the mental processes people use in getting 
from the statement of the problem to the 
answer would, in addition, include an ac- 
count of the comprehension of adjectives 
and comparatives as well as an account of 
question answering. In an earlier paper, 
Clark (1969) attempted to do the latter, 
while basically ignoring the question of how 
information from the two premises is com- 
bined. In this paper, we consider all three 
aspects of the problem-solving process. 


ADJECTIVES 


Adjectives fall into various semantic 
classes (e.g., height, color). The members 
of such classes (e.g., for height: tall, short; 
for color: red, yellow; etc.) have many 
elements of meaning in common. Such 
semantic classes can be grouped into differ- 
ent types which vary with respect to how 
the member adjectives are related to one 
another. Our concern in this paper is with 
the type of semantic class where the 
member adjectives fall into single continu- 
ous dimensions allowing for the ordering of 
various items within them. Examples are 
tall-short, beautiful-ugly, hot-cold, etc. 
Such dimensions account for a large number 
of adjectives and can be contrasted with 
somewhat different adjective classes—di- 
chotomous classes (e.g., dead-alive, male— 
female, etc.) and classes where the members 
cannot fit on a single dimension (e.g., shape, 
with the members circular, square, etc.; 
color, with the members red, yellow, etc.), 
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Different Kinds of Continuous Dimensions 
= 


There are many kinds of continuous di- 
mensions, involving all those ways people 
rank objects, persons, and events with 
respect to particular properties, qualities, 
and values. One kind of dimension which 
is easily distinguishable from others is the 
kind which describes quantitative proper- 
ties of material objects and of space, time, 
etc.; for example, length (long-short), 
width (wide-narrow), speed (fast-slow). 
The domain described by the adjectives in 
such dimensions usually involves a metric 
(inches, pounds, miles per hour, etc.) as 
well as a true zero point (no length, weight, 
or speed). Thus the dimension describes a 
ratio scale, and one can make statements 
like “John is five feet tall," “Building A is 
twice as high as Building B.” 

Outside this set of metric dimensions, 
many of which are spatial, the largest 
number of other continuous dimensions 
lack both a metric and a zero point, thus 
forming only ordinal scales. ‘Therefore, 
items can only be rank ordered within them. 
Included in this set are various dimensions 
of value and quality; for example, beauty 
(beautiful-ugly), kindness (kind-unkind), 
happiness (happy-sad), and evaluation 
(good-bad). "There are, in addition, some 
dimensions which lack a true zero point 
but nevertheless allow one to consider 
relative differences between different pairs 
of points, thus forming interval scales; for 
example, temperature (hot-cold) and intel- 
ligence (intelligent-unintelligent). 

Metaphoric usage of adjectives is com- 
mon, and one cannot necessarily conclude 
that an adjectival expression involving a 
spatial dimension is necessarily being used 
in its mundane measurement sense; for 
example, “He felt twice as tall after winning 
the award," "He has a shallow knowledge 
of foreign affairs." It also must be pointed 
out that the above division of adjectives 
into semantic sets of certain characteristics 
does not cover every case. At least one 
spatial dimension, that of overall size as 
encoded in the terms "big" and "small," 
has no metric although it surely has a zero 
point. A dimension which would seem to 
have an obvious zero point and an obvious 
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metric, that of wealth (“rich” or " poor"), 
is not often used in that way (“He is 
$250,000 rich" is not an acceptable sen- 
tence) but rather is used chiefly as an ordi- 
nal scale which seems to reach out un- 
bounded in both directions. 


Marked and. Unmarked Adjectives 


The polar adjectives for the two ends of 
most continuous dimensions are asymmet- 
rical in various ways; one of these polar 
adjectives is typically designated as 
"marked" and the other as “unmarked.” 
In his analysis of spatial adjectives (a 
category which is largely synonymous with 
our category of ratio adjectives), Bierwisch 
(1967) has proposed that the meanings of 
such polar pairs might be broken up into 
various elements, most of which the two 
adjectives share, but on one of which they 
differ, namely, which pole each designates. 
He suggested assigning to one adjective a 
“4 and to the other a “—.” The former 
is typically called the unmarked and the 
latter the marked adjective. 

For the spatial adjectives he has dealt 
with, there are two bases for deciding which 
adjective should be assigned a ‘'-++”: First, 
the unmarked adjective indicates the pres- 
ence of a property which can extend in- 
definitely in an upward direction. The 
other adjective, which is assigned the ''—" 
pole, indicates the absence of that property, 
the extreme lower bound being zero. To be 
tall, wide, etc., is to possess something, 
whereas to be short, narrow, etc., is to lack 
rather than to have something. Second, 
as Bierwisch (1967) and Clark (1969) have 
pointed out, while both the marked and the 
unmarked adjective may be used contras- 
tively to attribute to an item or class a 
particular position along the dimension, 
(eg; "A is tall," "A is wide," "A is 
narrow," "A is short") the unmarked ad- 
| cpi ce ne e a nominal usage, 

í there is no parallel with the 
E id Mi is, the unmarked 
entire fo erige stag ^s i 
girl?" "How wide id ee 
the marked adjecti us MAE Mo 4 
anomalous iis n wein sway as either 

ms to imply a prior judg- 


ment about the item one is referring to, as 
in “How short is the girl?” 

These two sources of asymmetry between 
the polar adjectives from ratio scales are 
obviously related. ‘That is, a dimension 
designates a property which can vary in 
extent. Since the unmarked adjective 
designates the presence of the property, 
whereas the marked adjective designates its 
absence, the former would seem the more 
sensible way to name the dimension than 
the latter. That is, there are units of 
length, width, etc., but not of shortness, 
narrowness, etc. 

The dimension name is not always a form 
of the unmarked adjective; for e ample, 
height, speed, etc. Nevertheless, for these 
dimensions as well, the unmarked adjective 
can be used in the same neutral sense as 
when these words do coincide, whereas the 
marked adjectives cannot; for example, one 
can neutrally say “How tall is the girl?” or 
"How fast can he run?" but not "How 
short is the girl?" or “How slow can he 
run?" 

For adjectives from ordinal scales, the 
n marked and unmarked 
from the dimen- 
and extending 
ection. Rather, 


asymmetry betwee 
adjectives does not derive 
Sion having a zero point 
indefinitely in an upward dir 
Most such scales seem to extend indefinitely 
in both directions, as in good-bad and 
happy-sad. Therefore, the marked adjec- 
tives from such scales do not literally 
indicate complete absence of a property 
defined by the unmarked adjective. That 
is, whereas narrow means the absence of the 
Property of width, sad does not so much 
Mean the absence of happiness as it means 
the presence of a property ol sadness. 
Nevertheless, as is shown below, people do 
treat the marked adjectives from ordinal 
Scales as if they designated the absence of 
. € property exhibited the unmarked 
adjective, ` 
t The other basis for the 2s} 
Ween the marked and unmarkec 
ai ratio scales was that the à 
jective could be used to refer to i 
` mension, This is true of the ee in 
Mliective for ordinal scales as M eh or 
Ow happy is he at his new JO?* 
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"How good is that restaurant?" although 
it could be argued that the unmarked adjec- 
tive from ordinal scales does not cover the 
whole dimension in the same sense as the 
unmarked adjective from ratio scales. 
That is, as was said above, sadness is in 
some sense a property in itself rather than 
just the absence of happiness As with ratio 
scales, the marked adjective from ordinal 
scales cannot be used nominally in a neutral 
sense; for example, “How sad is he at his 
new job?" 

Given that ordinal scales, unlike ratio 
scales, extend in both directions from what 
might be called a zero point, one could in 
some sense regard them as separate dimen- 
sions; for example, happiness and sadness. 
It is not clear, therefore, why one of the 
adjectives is chosen over the other to be 
used neutrally. One common factor uniting 
those adjectives which can be used neu- 
trally is that they designate the affectively 
or evaluatively positive poles of these or- 
dinal scales; for example, kind, happy. 
good, etc. 

Not only is the affectively positive adjec- 
tive from ordinal scales used neutrally, and 
taken to indicate the presence of a property, 
but, conversely, the unmarked adjective 
from ratio scales is quite commonly used to 
indicate positive affect, as in such meta- 
phoric phrases as “broad interests." Simi- 
larly, the marked adjective from ratio scales 
“be used metaphorically to indicate 
n "low down deal, 


can 
negative affect, as i 
"short end of a bargain," etc. , 

For both ordinal and ratio scales, the di- 
mension designated by the polar pair, X and 
v, includes a region around the midpoint 
between the two poles where an item Is 
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“y” but this neutral 


includes not only ^? neu 
region. Even those marked adjectives 
adding a negative 


which are formed by 
prefix to their unmarked counterparts do 


not simply mean *not-X" ; it is possible to 
without being "unwise," or 
' without being “unkind.” 
his neutral region, an 
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The meanings [9 
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marked polar opposites differ in complexity, 
in that the meaning of the marked adjective 
seems to be built on the meaning of the 
unmarked adjective plus a logical operation 
of opposition. That is, “John is short" 
seems to mean "John is the opposite of 
tall" Furthermore, the logical operation 
by which the meaning of the marked adjec- 
tive is built upon the meaning of its un- 
marked counterpart is in some sense nega- 
tive. That is, if "x" is the unmarked and 
"y" the marked adjective, "y" seems to 
mean "no x." "No x" here includes only 
the marked end of the dimension, in con- 
trast to the negation of "x," “not-x,” 
which includes all portions of the dimension 
except the "x" portion. For ratio scales, 
the marked adjective literally designates 
the absence of a property, and the neutral 
region designates some average amount of 
the property. For ordinal scales, as was 
noted above, one could argue that such an 
interpretation of the scale is only meta- 
phorical. Nevertheless, the evidence for the 
logical negativity of the marked adjectives 
involves both ratio and ordinal scales and 
comes from several sources: from the forms 
of definitions given for both types of adjec- 
tives, from internal linguistic evidence, and 
from differences in comprehension difficulty. 
Mann (1968) reports 
where subjects (Ss) were 
certain "favorable" and "unfavorable" ad- 
jectives. Generally, the "favorable" and 
"unfavorable" adjectives come from ordi- 
nal scales and could be classified as “un- 
marked" and “marked” adjectives, respec- 
tively. The Ss were more likely to define 
"unfavorable" than "favorable" adjectives 
by denial. The vast majority of the denial 
definitions for ‘‘unfavorable’’ adjectives in- 
volved denial of the presence of their 
"favorable" opposites. 


We did an analogous study specifically 
involving marked and unmarked adjectives 
from both ratio and ordinal scales. We ex- 
tracted definitions from dictionaries in order 
to compare the extent to which marked 
adjectives were defined as negations of the 
unmarked form and vice-versa. 
Pairs of common 
Chosen (see 


an experiment 
asked to define 


Twenty 
polar adjectives were 
Table 1). Of these 40 adjec- 
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TABLE | 


TWENTY PAIRS OF POLAR ADJECTIVES FOR WHICH 
Dictionary DEFINITIONS WERE OBTAINED 


Unmarked Marked 
big small 
hot cold 
high low 
noisy | quiet 
deep | shallow 
good bad 
rich poor 
tall Short - 
wide narrow 
dirty clean 
kind cruel 
happy sad 

old young 
heavy | light 
fast slow 
strong weak 
beautiful ugly 
hard soft 

fat thin 
smart stupid 


tives, 31 were in the highest frequency 
category (AA) of the Thorndike and Lorge 
(1944) word frequency. count. We ran- 
domly assigned these adjective pairs to 
four groups of five pairs each, and then as- 
signed each group to one of four major dic- 
tionaries—Webster, Random-House, Ox- 
ford, and Funk and Wagnall. To make our 
analysis, we used the first dictionary defini- 
uon which was specifically applicable to 
the dimension we were considering. If this 
first definition Was stated in terms of the 
negative of the other member of the pair, 
we gave it one point. Otherwise, it was 
given a zero. We set a very stringent 
criterion for negativity, excluding even 
those adjectives defined relative to the norm 
or standard of the dimension using words 
such as “less than,” “having little” For 
instance, “bad” was defined as “not good” 
and received one point. “Small” was de- 
fined as “comparatively less than another 


or than a standard" and received a zero. 
Not one 


defined 
member of the pa 


adjectives from and ratio 


he negation of the 


3 


oS 
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unmarked member of the pair. Thus 
marked adjectives are often defined in 
terms of "lacking" or "wanting" in the 
Property designated by the unmarked 
member of the bi-polar pair. Additional 
evidence that the marked adjective is 
thought in terms of the unmarked plus 
Some additional operation, but not the 
reverse, is found in a memory experiment 
by Clark and Card (1969) where Ss more 
often recalled a marked adjective as un- 
marked than vice-versa. 

Another type of evidence about the nega- 
tivity of marked adjectives comes from lin- 
guistic data showing that marked adjectives 
function in some ways like negatively 
prefixed adjectives. Klima (1964) dis- 
Cusses sentence frames which are acceptable 
only when a negatively prefixed word is 
inserted in the blank. He notes that certain 
"adversatives" or "inherent negatives" 
function like negatively prefixed words. 
At least some marked adjectives seem to be 
adversatives in this sense, since they fit 
meaningfully into sentences with "any," 
Which require a negative element, whereas 
their unmarked counterparts do not. For 
example, (a) He would be stupid (*smart) 
to take any more. (b) It was cruel (*kind) 
to do any more. (c) He was sad (*happy) 
to see any more conflict. 

Additional linguistic evidence also sug- 
Sests that marked adjectives from both 
ratio and ordinal scales may incorporate 
an implicit negative. Zimmer (1964) points 
Out that negative prefixes are rarely at- 
tached to the marked adjective, whereas 
they are commonly attached to the un- 
Marked form; for example, unhappy, un- 
attractive, but not *unsad, *unugly, etc, 
If the marked adjective contains an implicit 
Negative, then adding a negative prefix 
Would result in a double negative, which one 
Would expect people to avoid, y" heg 
Senerally avoid expressions un pari 
negatives. For instance, as. sad : plc 
‘ned as “unhappy,” "unsad" would mean 
"ununhappy." 

It might seem reasonable s 
to give an alternative argument—that it 8 
the negative affect of the marked adjective 
father than its logical negativity which 


at first glance 


makes it unacceptable to add the negative 
prefix. This seems unlikely, however, 
since there are examples of adjectives with 
negative affect which do take a negative 
prefix: unspoiled, unbroken, uninjured, etc. 

Further evidence of the logical negativity 
of marked adjectives comes from a psycho- 
logical study by Sherman (1969). He 
showed that negative sentences with 
marked adjectives are more difficult to com- 
prehend than identical sentences with their 
unmarked counterparts. The addition of 
a marked adjective increased the difficulty 
of sentence comprehension in much the 
same way as the addition of another explicit 
negative did. As is shown below, it is also 
more difficult to comprehend comparative 
expressions involving marked adjectives 
than those involving their unmarked 
counterparts. 

An alternative explanation for the differ- 
ence in the difficulty of marked and un- 
marked adjectives is that the unmarked 
adjective is often more frequent than its 
marked counterpart. However, there is 
some evidence that frequency is not the 
determining factor. In a few cases, the 
marked adjective is more frequent than 
the unmarked adjective, according to 
the  Thorndike-Lorge frequency count 
("short" and "poor" are more frequent 
than "tall" and "rich," respectively) and, 
for several other pairs, the frequencies of 
marked and unmarked pairs are equal. 
The results for such pairs parallel the results 
for other pairs (Huttenlocher, 1968; also 
Huttenlocher?). In fact, one reason marked 
adjectives are generally less frequent than 
unmarked adjectives may be because they 
are negative and thus more complicated, 


rather than vice-versa. 


Other Adjectives from Continuous 
Dimensions 

For certain dimensions, there are in ad- 
dition to the two adjectives which can be 
designated as marked and unmarked, other 
adjectives which are not simply synonyms 
of these two polar adjectives. For example, 
with respect to size there are, in addition 


sy. Huttenlocher. On answering questions, Üi 


preparation. 


——— 


to the polar pair "big-small," the terms 
“tiny,” "huge," miniscule," and "gigantic. 
With respect to human girth there are, in 
addition to the polar pair "'fat-thin," the 
terms "obese," "skinny." For temperature 
there are two commonly used polar pairs; 
namely, *warm-cool," and “hot-cold, 
and, in addition, terms like “tepid and 
"lukewarm." Many of these additional ad- 
jectives, like "huge" and "tiny," refer to 
portions of the dimension more extreme 
than those ordinarily used to designate the 
poles. We will have reason to discuss pairs 
of extreme adjectives of opposite meaning 
below when we consider the synonymy of 
different types of expressions involving 
pairs of polar adjectives. 


EXPRESSIONS INVOLVING ADJECTIVES 


We consider three types of assertions 
about items, classes, or events (which we 
designate as A, B, C, etc.) involving adjec- 
tives (which we designate as x and y for a 
polar pair) We also consider the syn- 
onymy of different adjectival expressions 
which arise from the semantic relations 
among polar opposites. 

Aisx 


Examples: John is short, Elephants are 
big. 
It has often been claimed that assertions 
of this type are implicitly comparative; 
that the meaning of the first example 
actually is "Tohn is short compared to the 
average boy of his age," and, of the second 
example, “Elephants are large compared 
to the average of all classes of animals.” 
Certainly, these assertions only indicate 
A's position on the dimension with respect 
to some particular superordinate class, 
rather than that A meets some absolute 
standards. Thus, it is perfectly acceptable 
to say, “This elephant is small" or “This 
turtle is fast." While the entire class of 
elephants is large and that of turtles is slow 
in comparison to various other classes, the 
particular items can be small or fast com- 
pared to other members of the class. 
These adjectival expressions, however, 
can be used to assert the relation of an item 
to a superordinate class independent. of 
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absolute standards only in certain cases; 
namely, when they include the common 
polar adjectives from ratio scales. They 
cannot be used independently of absolute 
standards with extreme adjectives or with 
polar pairs from many ordinal scales. 
Consider, for example, the case of extreme 
adjectives from ratio scales. One cannot 
“This elephant is miniscule’; the ad- 
jective seems to apply only within an 
absolute range of sizes from which even the 
smallest of elephants is excluded. The 
same is true for extreme adjectives from 
interval scales. For instance, “hot” cannot 
be used to apply to objects normally con- 
sidered cold, such as ice water, if one only 
means to imply that the object is less cold 
than the average of its superordinate class. 

In addition, one cannot use ordinal scale 
adjectives like “happy,” "sad," "pretty," 
"homely," etc., to compare some person, 
object, or event with the norm of a super- 
ordinate class without taking into con- 
sideration the absolute qualities of the 
items involved. For instance, in a beauty 
contest, it could not be asserted, “That 
beauty contestant is homely" just because 
the girl was less beautiful than the average 
beauty contestant. For ordinal and ex- 
treme adjectives then, expressions of the 
form "A is x" are not comparative as are 
those comparative expressions with ratio 
scale adjectives. To say '"The water is 
cold,” "Joan is pretty,” asserts that an 
item meets certain absolute standards, and 
in that sense such adjectivals are more like 
assertions from noncontinuous adjective 
classes like “The apple is red," or “The 
patent is alive" than are expressions 
involving adjectives from ratio scales, like 
This elephant is small." Thus, as one 
might expect, the usage of ordinal and ratio 
scale adjectives does differ, even though 


the marked adjectives from both types ol 
scales 


are treated as designating the 
absence of the property exhibited by 
the unmarked adjective. That is, despite 


the evidence about the 
of the marked adjective from both types o! 
scales, we have here an indication that, 2$ 
was mentioned above, the ordinal scale i" 
Some sense extends in both directions from ? 


logical negativity 
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zero point and thus involves two separate 
properties rather than simply different 
amounts of the same property. 


A is too x, B is not x enough 


amples: The child is not old enough to 
go to school. The man is too short to be 
à professional basketball player. 

Such expressions assert that an item, 
class, or event is excluded from the portion 
of the dimension which would enable it to 
be a member of some category, or to be 
appropriate for certain purposes. Bier- 
wisch (1967) proposes that these two types 
of expressions are synonymous for pairs of 
antonymous polar adjectives; for example, 
"A is too short" implies “A is not tall 
enough" and “A is too tall" implies “A is 
not short enough." 

For adjectives from ratio scales, which 
compare an item to other items of its class 
independent of absolute standards, these 
expressions can indeed be used freely to 
exclude an item from any portion of the 
dimension; for example, while the example 
about the basketball player above excluded 
an item from only a small area at one ex- 
treme of the dimension, such expressions 
can also be used to exclude an item from a 
very large portion of the dimension, as in 
“The man is too short for his feet to reach 
the floor when he sits on a chair.” How- 
ever, for those adjectives which indicate 
that an item meets some absolute standard, 


these expressions cannot be used freely in all 
portions of the dimension. One cannot say, 
for instance, “The elephant is not miniscule 
enough to fit through the temple doors" 
(note that the expression is acceptable if 
the word “miniscule” is replaced by 
“small’’), For such extreme adjectives, as 
well as for many polar adjectives from 
ordinal scales, the two forms of expression 
are not synonymous for pairs with opposite 
meaning as they are for the bipolar pairs 
from ratio scales; for example, “A is too 


rae. na g 
cold” is not really synonymous W ith “A is 
“A is not pretty 


Not hot enough” nor is ? 
5 "A ds too 


enough" really synonymous with 
homely,” 
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A is more x than B, C is not as x as D 


Examples: Tom is more agile than Harry. 
Jane is not as smart as Mary. 

The most common way to compare two 
items, classes, or events is the “more” form, 
as in “more beautiful," “more friendly," 
etc. For many adjectives, the compared 
"more" form is expressed in a single word 
involving the adjective plus "er" which 
means "more," as in "taller," "happier," 
etc. There is a symmetrical form of ex- 
pression for explicitly comparing two items; 
namely, “A is less x than B.” The latter 
is used less frequently than the synonymous 
form "A is not as x as B" which itself is 
less frequently used than the “more” form. 
he "A is not as x as B" form is called a 
"negative-equative" and derives from the 
equative "A is as x as B” which asserts the 
equality of the two items being compared. 
However, the negative-equative is not, in 
fact, the negation of the equative, since 
the negation would only assert that the two 
items are not equal without specifying di- 
rection. Rather, the "not" applies as in 
many statements of quantity to mean 
“lacking in” or “less.” 

It has often been claimed that the “A is 
more x than B" form is a short form for a 
sentence which says, in full, *A is x to a 
greater extent than B is x." One possible 
way to interpret such statements is to say 
that they are made up of two nuclear propo- 
joined by the relation “more than," 
erting “A is x," and the other that 
Such an analvsis would imply 


that one cannot make a comparison be- 
tween two items, A and B, without attrib- 
uting the quality x to both items. That 
this is not the general case is shown by the 
following examples: "A hippopotamus is 
smaller than an elephant," “A jet plane is 
slower than a rocket ship.’ According to 
the proposed analysis, the first comparative 
would presuppose the truth ol the assertions 
"An elephant is small" and "A hippo- 
potamus is small," and the second compar- 
ative would presuppose the truth of the 
assertions “A jet plane is slow" and “A 
rocket ship is slow." Clearly, these com- 
paratives do not presuppose any such thing, 
but only that both items can be evaluated 


494 


with respect to the dimension to which 
those adjectives belong. That i6; oie can 
compare two items using bigger" or 
“smaller” as long as they are material ob- 
jects, or using "faster" or slower as long 
as they move. Insofar as two items can be 
compared using either of the two polar ad- 
jectives, no matter where the items are lo- 
cated along a dimension, one can describe 
the same relation between the items in two 
alternative ways using the "more" form of 
comparison: “A is more x than B" or “B 
is more y than A." 

As these examples of comparatives with 
ratio scale adjectives show, one must reject 
any general proposal that the comparative 
“A is more x than B” can be broken up into 
the two assertions “A is x” and "B is x." 

Nevertheless, comparatives with extreme 
adjectives or polar adjectives from ordinal 
scales which involve absolute standards do 
seem to presuppose that both items possess 
the quality x. To say "A is more obese 
than B" does seem to presuppose that “A is 
obese" and "B is obese." Similarly, to 
say "C is homelier than D” does seem to 
presuppose that "C is homely” and “D is 
homely.” 

We would tentatively propose the follow- 
ing: If an adjective indicates that an item 
meets certain absolute standards, then it is 
only appropriate to use it when the items 
meet that standard; alternatively, if an 
adjective can be used independent of 
absolute standards, then it can also be used 
to describe the relative positions of the two 
items no matter where along the dimension 
those items fall. Thus, for adjectives where 
absolute standards are involved, the two 
expressions “A is more x than B” and “B 
is more y than A” with “x” and “y” being 
adjectives of opposite meaning, are not 
synonymous; for example, “A is prettier 
than B" and “B is homelier than A" are 
not really ionymous.  llowever, un- 
marked adjectives from ordinal scales can 
probably be used more neutrally than their 
marked counterparts; this may be related 
to the fact, noted above, that the unmarked 


adjective from these scales can be used 
nominally. 


Even for comparatives involving the 
polar adjectives from ratio scales, the loca- 
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tion of the items along the dimension may 
well have some influence on which polar 
adjective a speaker chooses; that is, if the 
items do possess the quality x to a notable 
extent, then, other things being equal, the 
speaker might use “x” in a comparative de- 
scribing the relation between them. How- 
ever, the location of items on the dimension 
is only one of the factors which determines 
which adjective is used. To describe a 
particular relation between two items using 
a "more" comparative, a decision about the 
grammatical subject will determine which 
polar adjective must be used. ‘That is, 
when one of the items is the topic of conver- 
sation and is selected as grammatical sub- 
ject, this determines the adjective. Other 
aspects of the speaker's focus of concern 
may also influence which adjective is 
chosen; for example, if a basketball coach 
has an extra player and must reject one, he 
may ask, "Is player A worse (shorter, 
slower) than player B?" Therefore, the 
adjective used in a "more" comparative in- 
volving ratio adjectives and some ordinal 
adjectives is not a sufficient clue as to the 
location of the items; one must also take 
the conversational context into account. 
We would guess that there would be more 
reason for the listener to infer the location 
of the items along a dimension from a 
“negative-equative” than a "more" com- 
parative. The reason is the following: The 
“more” form is the common way to com- 
pare two items and one would guess that a 
speaker would use it unless it did not allow 
him to express what he wanted to say. 
This could happen if a speaker both had 
reason to make one of the two items the 
grammatical subject, and also wanted to 
use a particular adjective. For example, in 
a discussion of the nation’s poor where one's 
topic was migrant farm workers, one might 
Say, Migrant farm workers are not as 
poor as the urban unemployed" to avoid 
the adjective "rich" which would appear 
in the "more" comparative. 
This tentative formulation of what i$ 
assumed about the items in 
not as" comparatives, a. 
reip ul a 
Em nm e NP iter cs For example: 
ave Ss formulate comparative? 


"more" and 
nd in comparatives 


p 


ADJECTIV 


under varying conditions to determine the 
effects of these conditions on the type of 
comparative they will use; one can ask 
listeners to infer the location of items along 
a dimension given different comparative 
sentences in varying conversational 
contexts. 


SYLLOGISMS 


Comprehension of the Premises of the 


Syllogism 

The ordering syllogism is a special con- 
text in which to comprehend comparatives. 
Syllogisms require the problem solver to de- 
termine the order of the three items de- 
scribed in the premises. Even if the com- 
parative premises do provide him with 
information that the items are at a particu- 
lar end of the dimension, such information 
is totally irrelevant to the reasoning task, 
which is to determine item order. In fact, 
Ss can solve syllogisms even when the two 
premises involve two extreme adjectives of 
opposite meaning, both of which would 
ordinarily require items to meet certain 
absolute and opposing standards. For 
example, the premises ‘‘A is more saintly 
than B. C is more wicked than B" are 
admittedly odd, but within the context of 
the syllogism, people can regard them as 
descriptions of the relative positions of the 
three items on a single dimension. In 
fact, as we show below in the section on 
answering questions, there is no evidence 
that Ss’ representation of item order pre- 
information about which ad- 


Serves any t 
jectives were used in the premises. 

The Ss must combine information from 
rmine the order of items 
(a) the adjective, which 
designates one of the poles, and (b) the 
type of comparative, "more" versus “not 
as” or "Jess," which indicates how the ad- 
jective is used to make the comparison. 
Both factors affect the difficulty of ordering 
items from a comparative, because for each 
Source of information, there 1S both a posi- 
tive and a negative form. That is, the = 
Marked adjective and “more i gern "i 
Presence of a property, whereas the marke 
adjective and “not as" or "Jess" designate 
its absence. Since the notion of absence 


two sources to dete 
Ih a comparative: 
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seems to be more psychologically complex 
than the notion of presence, both factors 
affect the difficulty of a premise. 

"Not as" and "less" comparatives are 
used less frequently than **more" compara- 
tives, but it seems most reasonable to sup- 
pose that these forms are infrequent be- 
cause they are inherently hard to interpret, 
rather than that they are hard to interpret 
because they are infrequent. In fact, we 
proposed in a previous paper (Hutten- 
locher, Higgins, Milligan, & Kauffman, 
1970) that “not as" comparatives are so 
difficult to comprehend that Ss tend to re- 
organize them into the familiar "more" 
form in the process of understanding them. 
Which "more" comparative they use (“A 
is not as x as B" might be reorganized either 
as "B is more x than A" or as "A is more y 
than B") depends, we hypothesized, on 
which item seems to them to be the topic, 
what type of adjective is involved, etc. 
When a premise has both the “not as" 
form and a marked adjective, there would 
be the greatest tendency to change it to 
the familiar “more” form, because such a 
statement, we would argue, involves à 
double negative. Indeed, Ss often spon- 
taneously describe sentences like ‘‘A is less 
short than B" as "double negatives." 

Our Ss frequently report that they 
represent information from both the adjec- 
tive and the comparative form spatially 
when their task is to determine the order of 
the items, either from three-term series 
problems or single comparatives (as in the 
problem “Sam is worse than Harry. Who 
is better?"). The individual items them- 
selves are generally reported as points in 
space with names. It is the order of these 
points which Ss claim to imagine as a spa- 
tial array. DeSoto, London, and Handel 
(1965) have pointed out that people tend to 
think about adjectival dimensions spatially, 
with the unmarked adjective frequently 
thought of as being at the top and the 
marked adjective at the bottom. Our Ss 
also report that they tend to think about 
“more” and "Jess" in terms of directional 
movement, with "more" being a movement 
in an upward direction and “less” a move- 
ment in à downward direction. 

Our Ss report that imagery is intimately 
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tied up with determining the order of items 
from comparative expressions. According 
to Ss’ introspections, they construct im- 
aginary spatial arrays from comparatives 
by imagining placing the grammatical 
subject and then moving the grammatical 
object with respect to it, using spatially 
represented information about the adjective 
and the comparison term. 

DeSoto et al. (1965) have proposed that 
people prefer a premise to proceed from the 
top downward. They argue that Ss find a 
syllogism easy when its premises contain 
adjectives which they assign to the top 
because then the grammatical object goes 
beneath the grammatical subject, allowing 
Ss to proceed downward. It just so hap- 
pend though, that for those adjectival di- 
mensions where people have strong spatial 
preferences, it is the unmarked adjective 
which they prefer at the top. DeSoto et al. 
did not discuss the meanings of the polar 
adjectives per se; their proposal seems to 
suggest that Ss interpret the premise before 
they give spatial assignments to the adjec- 
tival dimension, and that it is the way these 
spatial assignments are made which de- 
termines the difficulty of the problem. At 
least on the face of it, this proposal differs 
from the proposal that a syllogism is easier 
when its premises contain unmarked ad- 
jectives because these are less complex in 
meaning than marked adjectives. 

As we have seen, there is evidence that 
the meanings of marked and unmarked 
adjectives differ in complexity apart from 
any spatial assignments these adjectives 
may begiven. People report spatial images 
only when they order items along a di- 
mension. The evidence that marked adjec- 
tives are more complex than unmarked 
adjectives because they include an implicit 
negative comes from a variety of sources, 
many of which do not involve spatial 
imagery ; for example, from the way people 
define marked adjectives, from the way 
marked adjectives can. be combined with 
other words, and from the difficulty of 
interpreting sentences with marked adjec- 
lives where ordering is not involved. Since 
the proposal that unmarked adjectives are 
simpler than marked adjectives covers all 
three cases, including ordering problems, 
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one need not invoke spatial assignments as 
an explanatory principle. 

To conclude that one need not postulate 
spatial preferences to explain the differ- 
ences in difficulty between comparatives 
with unmarked and marked adjectives is 
not to say that spatial images serve no 
function in ordering items. Furthermore, 
whether or not they serve a function, the 
very fact that people report images in which 
they give consistent spatial assignments to 
unmarked and marked adjectives is itself a 
phenomenon to be explained. There are 
two separate issues involved: why people 
report spatial images at all; and given that 
they do so, why they consistently assign 
unmarked and marked adjectives upward 
and downward positions, respectively, and 
regard "more" and “less” as upward and 
downward movements, respectively. We 
will consider cach of these issues in turn. 

The most plausible explanation of why 
people report spatial images when they 
decode comparatives in ordering problems 
is that they regard ordering as a spatial 
operation. To order real objects along a 
nonspatial dimension, people assign spatial 
coordinates to the poles of the dimension so 
they can use the spatial array to preserve 
order information, as, for example, in 
arranging clothing in stores by size. If 
items are ordered spatially, one can easil 
determine what item has a 
relative position; for example, 
which is “most x," “most 5 


particular 
find the one 


ios ete. [tis no 
more surprising that Ss should imagine 
themselves to be making 
when they must orde 
than th 


spatial arrays 
r items in a syllogism 
at they should imagine themselves 
to be moving chessmen when evaluating 
alternative moves in a chess game. There 
is, however, a question about the function 
of such imaginative activities in either case; 
an Issue considered later in this paper. 

lhe second question is why people report 
consistent spatial preferences rather tha” 
simply assigning spatial axes arbitrarily: 
We speculate that both people's spatia 
preferences and the fact that they encode 
marked adjectives in a more complex way 
than unmarked adjectives derive from the 
underlying way they think about adjectiv? 
dimensions. As we have seen, adjectiv? 
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dimensions for ratio and even ordinal scales 
seem to designate single properties which 
vary in extent, with the unmarked adjec- 
tive designating notable presence and the 
marked adjective notable absence of the 
same property. 

Consider the possibility, then, that for 
both ordinal and ratio scales, people think 
of the extensiveness of a property in terms 
of mass, as if the property were a pile of 
some material substance. We could then 
explain people's spatial preferences in 
ordering problems—both for unmarked and 
marked adjectives, and for the comparative 
terms "more" and "less." The unmarked 
adjective would be toward the top because 
it designates the presence of a space-occupy- 
ing property, and the marked adjective 
toward the bottom because it designates 
the absence of that property. “More” 
would be an upward movement because it 
designates addition, and *'less" would be a 
downward movement because it designates 
subtraction. (Our pilot studies show that 
the unmarked adjective is represented as 
upward because it indicates the presence 
of a property, not because it doubles as the 
name of the dimension. The evidence 
comes from extreme pairs where neither 
adjective can be used neutrally, like “gor- 
geous-hideous" and  "saintly-wicked." 
People regard "gorgeous" and “saintly” as 
spatially upward and “hideous” and 
"wicked" as spatially downward. In addi- 
tion, syllogisms with the adjectives “gor- 
geous” and “saintly” in both premises are 
easier than those with the adjectives ''hide- 


ous" and “wicked” in both premises 


[Higgins]. 

In conclusion, any spatial assignments of 
polar adjectives are, logically speaking, 
equally good for preserving information 
about order, as long as these spatial assign- 
ments are made consistently. within a 
problem, The fact that Ss exhibit consis- 
tent spatial preferences across | different 
problems with the unmarked adjective on 
top, and that unmarked adjectives are 


arked, suggest that .Ss' 


simpler than m 
poles are related to the 


assignments of the 
1E, T. Higgins. A look at extreme adjectives in 


5 ; sparation. 
relation to syllogisms. In preparatio 


meanings of the adjectives. We presented 
a tentative hypothesis to account for these 
findings; namely, that adjectival dimen- 
sions are represented as properties which 
accumulate mass as they increase in extent. 
According to this hypothesis, a marked 
adjective, y, is difficult and is imagined 
downward for the same reason; that is, 
because it designates the absence of prop- 
erty x. 


Combining Information from the Two 
Premises 


An earlier paper (Huttenlocher, 1968) 
proposed a mechanism whereby people 
combine information from the two premises 
to determine the order of all three items. 
This proposal was based on a behavioral 
test of Ss’ introspections about how they 
solve these problems. The Ss claim that 
they imagine the described items to be 
material objects to be arranged in space, 
and the premises to be instructions about 
how to arrange these imaginary items. 
According to their introspections, they use 
the first premise to create an imaginary 
array of the first two items. They claim 
to use the second premise as an instruction 
about how to fit the third item in among 
the two items from the first premise. They 
say that the second premise is not decoded 
into a separate spatial array, but rather, is 
used to find the position of the third item 
with respect to one of the items which is 
already positioned in the array. 

Our original proposal about the construc- 
tion of spatial images reported a test of 
this claim. We had found in other experi- 
ments that when people arrange real items 
according to instructions, the difficulty of 
arranging a mobile item with respect to 
one or more fixed items depends on its 
grammatical role in a sentence. That is, 
when a mobile item must be arranged with 
respect to fixed items, it 1s easiest to place 
when it is described as («) the grammatical 
subject of a “more” comparative (Hutten- 
locher & Strauss, 1968), or as (b) the actor 
in either an active or a passive sentence 
with a transitive verb (Huttenlocher, 
Eisenberg, & Strauss, 1968). We also 
found that for "not as" comparatives, the 
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oa of placing a real mobile item We 
respect to a real fixed item varies s t ae 
adjective, depending, as we argued eae, 
on how Ss reorganized the negative- 
equative in the process of understanding it. 
We tested Ss’ claims about constructing 
imaginary spatial arrays by presenting 
them with syllogisms which were broken 
into two parts. First, Ss were given the 
first premise and were asked questions 
about the order of the first two items. 
Next, they were given the second premise 
and were asked questions about the order 
of all three items. The pattern of results 
for the syllogisms exactly paralleled the 
pattern of results for the arrangement tasks. 
The difficulty of answering questions about 
the order of all three items varied with the 
grammatical status of the new item in the 
second premise, just as the difficulty of 
placing a real mobile item with respect to 
real fixed items varied with its grammatical 
status in the instruction. That is, a syllo- 
gism is easier when the new item in the 
second premise, which .Ss imagine to be 
mobile is described as (a) the grammatical 
subject in a “more” comparative, or as (b) 
the actor in either an active or passive sen- 
tence with a transitive verb (we used 
"leads," “is led," "trails," “is trailed”). In 
addition, the results for syllogisms with 
negative-equatives also paralleled those for 
arrangement tasks. That is, the results for 
both syllogisms and arrangement tasks 
varied with the adjective used, and both 
varied in exactly the same way. 

The occurrence of a parallel between the 
difficulty of arrangement tasks and syllo- 
gisms is consistent with but does not prove 
that people imagine syllogisms to be ar- 
rangement tasks. The question of how to 
interpret. the observed parallel requires a 
consideration of the reasons for the findings 
of the arrangement tasks. We have argued 
elsewhere that the arrangement studies 
Suggest that the grammatical role of an 
item in the description of a transitive rela- 
Gon affects the difficulty of understanding 
how that item is related to the other item 
(Huttenlocher & Weiner, in press). It is 
is this reason that the difficulty of arrang- 
DIE à real item that one is trying to find out 
about (new? iten Jj ESTED , 

1) with respect to an 
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item which is fixed in an array (“old” 
item) varies with the grammatical role of 
the "new" item. 

For syllogisms, there are no real items, 
so Ss need not take account of the fact that 
one of the items in the second premise is 
"new" and the other is “old” when they de- 
code the second premise. Rather, they 
could order the two items from the second 
premise, and only afterward combine this 
information with that from the first premise. 
However, the parallel between the relative 
difficulty of ordering the “new” item from 
the second premise in a Syllogism and of 
ordering the “new” item in an arrangement 
task strongly suggests that Ss do treat thc 
third item in the second premise as “new” 
and the other item as “old.” This is clearly 
consistent with their constructing spatial 
images. However, the grammatical role of 
the "new" item would affect the difficulty 
of understanding how it was to be ordered 
even if Ss did not construct a spatial array. 
Thus, one could argue that it is not neces- 
sary to postulate that Ss represent order 
spatially in these reasoning problems; that 
is, that they actually imagine adding a new 
item into a spatial array of fixed items. One 
could instead describe the process of com- 
bining information from the iw 
noncommittally, without postulating the 
construction of spatial images. That is, 
one could simply postulate that Ss order the 
set of three items by determining from the 
Second premise the relation of the "new" 
item to the “old” item, while remembering 
at the same time that the “old” item is it- 
self ordered with Tespect to yet another 
item, 

To conclude that one could just as well 
reformulate our imagery proposal in some 
more abstract logical fashion would be to 
ignore its major Strength. "That is, the 
iMagery hypothesis provides a single over- 
all account of our experimental findings 
and, in addition, links Ss’ behavior to their 
introspections, It accounts not only for 
Ss introspections about the nature of the 
solution process, but also their introspec- 
Hons about why they make errors. That is, 
the spatial imagery hypothesis provides Ss 
with an algorithm for obtaining answers tO 
all soluble syllogisms, and for recognizing 


O premises 


| 
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which syllogisms are insoluble, To attrib- 
ute this strategy to our Ss, who make many 
errors, we must postulate performance 
factors which limit their success. If Ss do 
construct spatial images, they must re- 
member boih the names and relative po- 
sitions of the first two items while decoding 
the second premise; so they might some- 
times remember where they placed an 
item, but forget the names. The situation 
would be analogous to doing long division 
without pencil and paper; a person might 
have no problem about what procedures to 
follow, yet have trouble keeping track of all 
the necessary information. Our Ss com- 
plained, for example, “1 figured it out, but 
then I forgot the name of the best guy." 

Tt is not obvious at present whether any 
theory which postulates imagery as a mech- 
anism for solving problems can or cannot, 
in general, be reformulated in an abstract 
logical fashion that, nevertheless, makes the 
same behavioral predictions. We know 
from people’s overt behavior that they do 
use various kinds of symbolic schemes to 
represent information; they talk, make 
graphs, build scientific models, ete. How- 
ever, one can only infer the forms of people's 
mental representations of information inso- 
far as these produce different behavioral 
consequences. If alternative forms of rep- 
resentation serve equivalent functions in 
preserving and transforming information— 
for example, if graphic representation is 
equivalent to algebraic formulation, or di- 
agrammatic representation to verbal de- 
Scription—one could not identify these as 
distinct forms of mental representation. 
In that case, such questions as whether 
thought involves imagery, implicit speech, 
etc., might prove meaningless. Rather, 
one could only give an abstract logical 
Specification of Ss’ strategy. The general 
issue of whether different types of symbolic 
Schemes vary in their potential for repre- 
Senting and transforming different kinds 
of information has not yet been explored in 
depth. . : 

A different type of behavioral evidence 
for imagery has been reported by Brooks 
(1967, 1968). He found mutual inter 
ference between visual or auditory Imagery 
during problem solving and the intake of 


sensory information in the modality in 
which the imagery occurs. This clearly 
demonstrates the "psychological reality" of 
imagery and, in addition, provides a tool for 
evaluating whether such imagery has oc- 
curred in particular cases. Shaver (1970) 
used such an interference technique to 
investigate whether imagery is involved in 
solving ordering syllogisms. The Ss were 
presented problems either visually or orally, 
with presentation time controlled for the 
two modes. (The Ss either heard problems 
on tape or viewed a film where one word was 
presented at a time. The time between 
words was the same for both modes of 
presentation.) The Ss did significantly 
better with oral rather than visual presen- 
tation. Shaver offers evidence that this 
finding is due to interference rather than 
inherent differences in the difficulty of 
auditory versus visual presentation. While 
more evidence is needed, his findings sug- 
gest tentatively that imagery is indeed 
involved as a mechanism for solving order- 
ing syllogisms. 

Even if there proves to be good reason to 
postulate that people's mental represen- 
tations are isomorphic with the physically 
realized representations they use in solving 
analogous problems (graphs, maps, etc.), 
the major issues about human problem 
solving will remain to be resolved. One 
wants to explain how people use various 
symbolic schemes to preserve and trans- 
form information, regardless of whether 
that information is represented in thought 
or on paper. That is, it seems to us that 
the really challenging issue is raised by the 
fact that people are able at all to use 
spatial arrangements to preserve informa- 
tion about the order of items along non- 


spatial dimensions. 


Answering Questions 

The spatial imagery proposal does not 
predict differences in the difficulty of re- 
trieving the information pertinent to alter- 
native questions. The proposal is that, 
before answering any question, the problem 
solver creates a spatial array of all three 
items on the basis of which he can answer 
any question about the order of the three 


items (except, of course, when he has failed 
to construct the array correctly). One 
would expect questions to differ in difficulty 
depending on how hard they themselves 
are to understand. Thus, questions with 
marked adjectives should be more difficult 
than those with unmarked adjectives. In- 
deed, they are. 3 TQUE 
The difficulty of answering a question is 
also affected by the relation between the 
way the question is stated and the way the 
premises are stated. After examining this 
interaction, we believe that it is best ex- 
plained by postulating that the meaning 
of an adjective becomes temporarily more 
accessible immediately after it has been 
used. Our reasons will become apparent 
from the analysis below. (Since we did not 
originally investigate the effect of question 
form on problem difficulty, we use the data 
from Clark, 1969.) 
For both more" and “not as" syllogisms, 
the first and second premises may involve 
. either the same or different adjectives. In 
the former case, the adjective in the ques- 
tion can either match or fail to match the 
adjectives in the premises. In the latter 
case, the adjective in the question can 
either match the first premise and not 
match the second, or can match the second 
premise and not match the first. 
1 less of whether the two premises have the 
same or different adjectives, Clark's data 
show that a question is easier to answer 
when the adjective in the question matches 
| that in the second premise. 
This match-mismatch effect 
derives entirely from the second premise, 
since the size of the effect is equally large 
whether or not the adjective in the first 
premise also matches the one in the ques- 
tion: When the adjective is the same in 
both premises, the average reaction time 
(RT) in seconds is 5.77 when the adjective 
from both premises match that in the 
question, and 6.37 when they do not match 
it; when the adjective is different in the 
two premises, RT is 5.62 when it is only 
the adjective in the second premise which 
Matches that in the question, and 6.24 when 
the second Premise adjective does not 
oem Indeed, when the answer to the 
10n is in the first premise, it is totally 


Regard- 


apparently 
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irrelevant whether or not that question 
uses the same adjective which was in the 
first premise. That is, when the answer is 
in the first premise, RT is 6.12 when the 
adjective in the question matches that in 
the first premise, and 6.00 when the adjec- 
tive in the question does not match that in 
the first premise. (Since the effects for 
question answering do not differ for “more” 
and “not as" problems, we combined these 
for all our comparisons.) 

This match effect is not restricted to 
cases where the item which is the answer 
is contained in the second premise, Even 
if the answer is in the first premise, it is 
still easier to answer the question if it 
involves the adjective used in the 
premise, although not as much so 
the answer is contained in the second 
premise. When a question uses the same 
adjective as the second premise, RT is 
5.42 when the answer is in the second pre- 
mise and 5.97 when it is in the first, as 
opposed to 6.31 when the adjective in the 
question does not match the adjective in 
the second premise. (The fact that a 
question about an item in the second pre- 
mise is easier than one about an item in the 
first premise may well be due to another 
type of recency effect.) 

The fact that only the adjective 
second premise affects c 
suggests that the 
ticular adjective, 
tion in determinin 


second 
as when 


in the 
luestion difficulty 
only function of the par- 
over and above its func- 
g the order of items, is to 
make the meaning of the adjective in the 
question more accessible. That is, if a 
person has just determined the meaning of 
à particular adjective in order to compre- 
hend the second premise, he finds it easier 
to determine its meaning again when that 
adjective appears immediately afterward in 
the question. If this accessibility factor 
is temporary, it should be s c 
Separation between the Second premise and 
the question. Smith, Coriell, and 
McMahon? find that the effects of match 
between à question and a sentence are 
indeed reduced if S must say three digits 
between presentation of the sentence and 

* K. Smith, J. Coriell, and L 


of active and passive voice 
In preparation, 


ensitive to any 


: McMahon. The role 
in answering questions. 
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the question. These results suggest that 
there is no reason to postulate that people's 
mental representation of item order in- 
cludes information that the items are ata 
particular end of the dimension. In fact, 
there is no reason to postulate that infor- 
mation about the particular adjective used 
is stored more than momentarily. 


CrtARK's PROPOsAL ABOUT How 
PropLE SOLVE ORDERING 
SYLLOGISMS 


In this section, we review a paper by 
Clark (1969) which proposes three princi- 
ples to explain how people solve syllogisms. 
The first principle, the primacy of func- 
tional relations, asserts that the primitive 
conceptual relations out of which sentences 
are constructed are stored in a more readily 
available form after comprehension than 
other kinds of information. Clark claims 
that all comparative sentences, such as 
“Joe is worse than Sam” can be analyzed 
into: (a) two base strings, “Joe is bad" and 
"Sam is bad," which he takes to be the 
functional relations, and (^) the comparison 
"more than" ; and that a is more available 
than. We rejected the claim that compar- 
atives with ratio scale adjectives can be 
broken into two such propositions. We 
did not reject this analysis for ordinal scale 
adjectives or extreme adjectives. However, 
as we noted above, in the context of the 
syllogism it is only necessary to postulate 
^, not a. That is, it is not necessary to 
bostulate that Ss retain, more than mo- 
Mentarily, any information from the prem- 
'ses concerning at which end of the dimen- 
Sion the items are being compared. 

Since Clark's first principle asserts that 
comparatives with “good” and “bad” have 
different underl ing meanings, and no 
Specification is given of the relation between 
these meanings, it should be impossible to 
Solve a syllogism which has different polar 
Adjectives in its two premises. It is only 
"cause “good” and "bad" can both be 
used to order items along the same dimen- 
Sion that it is possible to solve a problem 
Where the two premises are “Sam 1s ne 
than Harry, Joe is worse than Harry. : 

The second principle, the principle of 


lexical marking, attempts to explain the 
fact that comparatives involving unmarked 
adjectival forms are easier to interpret 
than those involving marked forms. It is 
based on the fact that unmarked adjectives 
can be used in either a nominal sense to 
indicate a dimension or in a contrastive 
sense to indicate one pole of the dimension, 
whereas marked adjectives can only be 
used contrastively. He points out that 
the nominal use can be considered lexically 
simpler than the contrastive use in that it 
indicates the dimension without indic ting 
the pole and concludes that the unmarked 
adjective is easier than the marked because 
only it can be used nominally. First, there 
is no empirical evidence that nominal usage 
of unmarked adjectives is easier for people 
to understand than their contrastive usage. 
The factors which might lead linguists to 
represent contrastive usage as more com- 
plex might or might not be paralleled by 
factors which would lead the psychologist to 
conclude that contrastive usage is cogni- 
tively more complex. Second, as noted 
above, syllogisms with “+” pole extreme 
adjectives in both premises are easier than 
those with " —" pole extreme adjectives in 
both premises even though the “+” pole 
extreme adjectives cannot be used nomi- 
nally, Finally, if the differences in diffi- 
culty between expressions involving marked 
and unmarked adjectives result from the 
unmarked adjective being used in a nominal 
sense, then expressions which involve the 
contrastive senses of unmarked and marked 
adjectives should be equal. However, 
Sherman found significant differences be- 
tween marked and unmarked adjectives 
were used contrastively; for 


where both Live 
example, “The man is not sad” is more 


difficult than “The man is not happy." 

To pursue his argument, Clark adopts a 
peculiar position which he mistakenly 
takes to derive from linguistic theory; 
namely, that comparatives with unmarked 
adjectives involve nominal use and com- 
paratives with marked adjectives involve 
contrastive usage. This would suggest that 
the unmarked form could be used to make 
comparisons anywhere within the scale byt 
that the marked form could only be used 
near one end of the scale; for example, that 
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“taller” or "faster" could be used to com- 
re items throughout the dimensions of 
PM or speed, but that "shorter" or 
"slower" could only be used to compare 
s m " or * " This is 
items that were "short" or “slow. 
a false claim; recall our example above, 
* A jet plane is slower than a rocket ship. 
The reader can easily verify for himself 
that, for adjectives from ratio scales, there 
is no such asymmetry between the breadth 
of applicability of marked and unmarked 
forms in "more" comparatives. — 

In any case, this second principle con- 
flicts with the analysis of the comparative 
construction in the first principle. Recall 
that Clark claims that these include two 
underlying propositions, each clearly in- 
volving contrastive usage, regardless of 
whether the adjective is marked or un- 
marked. Clark never resolves this contra- 
diction by retracting his first principle for 
the case of unmarked adjectives and sug- 
gesting some other analysis. 

Given this contradiction, it is difficult to 
consider the predictions to which the analy- 
sis might lead. Since the second principle 
only provides a rationale for the finding 
that comparatives are easier with unmarked 
rather than marked adjectives, let us con- 
sider the predictions from the first principle. 
This principle has two parts: that the com- 
parative is analyzed into two simple propo- 
sitions and that the information in each of 
the two is more accessible than the com- 
parative information about the relation 
between the two items. The second part, 
the matter of relative accessibility, is never 
used to make any predictions. The first 
part, that Ss do analyze comparatives into 
such underlying assertions, is tested in 
conjunction with his third principle, 
“congruence.” 

The principle of congruence asserts that 
the form of a question must match the form 
of stored information concerning the correct 
answer in order for S to be able to answer. 
Clark does find that in the case of single 
comparative sentences it is easier to 
answer “better” questions for comparatives 
involving "good" (both “A is better than 
B and "B is not as good as A”) and 
Worse” questions for comparatives in- 
volving bad." ‘This fact is taken to verify 


both the first and third principles, namely, 
that the underlying form of "good" com- 
paratives must include 'good" and that 
congruence between statement and question 
is necessary. As was pointed out above, 
when we proposed a "temporary accessi- 
bility" factor, what is important is the re- 
lation between the adjective in the question 
and the adjective in the second premise, not 
whether the adjective of the question 
matches the adjective of the premise which 
contains the answer. In addition, this 
congruency principle would not predict 
that match effects between a question and 
a premise would be reduced by the inter- 
polation of digits as found by Smith et al. 
(see Footnote 5). 

These three principles concern the storage 
and retrieval of information from individual 
comparative sentences. They do not in- 
volve any proposal about how information 
from these individual comparative sen- 
tences is combined to yield the order of all 
three items. In order to explain effects 
due to combining information from two 
sentences—notably, the fact that reordering 
a particular pair of premises has marked 
effects on problem difficulty—Clark in- 
vokes the notion of "compression," which 
is in no way related to his other three 
principles. This notion is introduced as if 
it were an empirical generalization which is 
of no theoretical significance. lt is never- 
theless made to bear the major burden in 
predicting differences between alternative 
forms of the three-term series problems. 

Compression" refers to a postulated ten- 
dency to retain only one of two items from 
the first premise; for example, from the 
premise "John is better than Bill," the 
claim is that Ss retain only “John is the 
better one." When the second premise i$ 
either "Sam is better than John" or “John 
's worse than Sam," the name retained from 
the first premise is mentioned in the second: 
Although Clark does not indicate how S° 
combine this information about John with 
the information about John and the third 


person in the second premise, it seems pO 
sible that such a mech 


à ilex D 
anism might D 
worked out ; 


ut; but, even then, only one of th^ 
two possible questions could be answered’ 


For instance, in our example above, the 
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question "Who is worst" is unanswerable, 
as information about Bill has been lost. 
When the second premise is either “Sam is 
worse than Bill” or "Bill is better than 
Sam," however, the item S retains from the 
first premise is not even mentioned in the 
second. In this case, the information about 
John from the first premise bears no relation 
to the information about Sam and Bill from 
the second premise. Clark (1969) is aware 
of this problem and proposed that 

in this case S must try to recover the whole first 
proposition or back-track with the information that 


“Bill is the better one" to apply it to the first pro- 
position or revert to some other time-consumin 
g 


strategy [p. 397]. 

Since Clark does not tell us what this time- 
consuming strategy might be, he cannot be 
said to have provided a mechanism by 
which Ss can solve the problems in these 
cases. Furthermore, the claim that these 
problems require a time-consuming strategy 
is contradicted by Clark’s data as well as 
that of other investigators. The problems 
which are easiest overall do not mention in 
the second premise the item Clark claims 
that Ss retain from the first; namely, 
problems with the following premises: "A 
is better than. B; C is worse than B" or 
"C is worse than B; A is better than B." 

In addition to the above criticisms of 
Clark's principles, there is further empirical 
evidence indicating that one or more ol 
his principles must be incorrect, since 
together they make predictions which are 
contradicted by Clark's own data. 

Asan example, consider the two forms (a) 
“A is better than B, B is better than C; who 
is best?," and (b) “C is worse than B, B is 
worse than A; who is worst 2," and compare 
them with the forms (c) "B is worse than 
A, B is better than C; who is best?" and 
(d) *B is better than C, B is worse than A; 
We pick this comparison 
the context of Clark's 
principles, one cannot possibly explain why 
a and b, where the question 1S congruent, 
should not be easier than c and d, where the 
question is incongruent. First, B is 
described as both *'bad--" and good t, 
in problems c and d, which, a 
Out above, should render these pro 
insoluble. Second, if Ss compress the first 


who is worst?" 
because, within 


s we pointed 
blems 


premise, then a and b must be easier than c 
and d because, for the former, the item re- 
tained from the first premise (A or C, re- 
spectively) is the answer, whereas, for the 
latter, the item retained (B in both cases) 
is not the answer. Third, while a and b 
versus c and d do differ in terms of the prin- 
ciple of lexical marking, this factor should 
have no overall effect on difficulty because 
a has two unmarked adjectives and b has 
two marked adjectives whereas c and d each 
have one premise with a marked and one 
with an unmarked adjective. Certainly 
then, c and d should have longer latencies 
than a and 6 if Clark's approach is correct. 
This is not the case. (The latencies in 
seconds from Clark's data are as follows: 
c = 5.00 and d = 5.45 versus a = 5.42 and 


b = 6.53.) "The reader can find similar 
contradictions in the negative-equative 
problems. 


CONCLUDING REMARKS 


In this paper, we have considered how 
people process various types of adjectives 
and comparative sentences; how they 
combine information from two comparative 
expressions to determine the order of all 
three items; and how they answer questions 
posed in the problems. Many of the issues 
we have considered cannot be fully re- 
solved to our satisfaction at present, par- 
ticularly the issue of whether imagery can 
be conclusively demonstrated to be superior 
to more abstract logical formulations in 
explaining the behavioral data related to 
combining information from the two syllo- 
gistic premises. In any case, the obvious 
difficulty of resolving this question raises 
the interesting and more general issue of 
whether different types of sym bolic sc hemes 
vary in their potential for representing and 
transforming different kinds of information. 
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In attempting a rapprochement of J. 
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the linguistic theory. This reply discus: 
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locher and Higgins made of the linguistic theory fails on empirical grounds. 
In addition, this reply points out several fundamental misrepresentations 


Huttenlocher and Higgins made of the 
correct use of the terms “marked” and 
that the available evidence invalidates tl 
tion of how people solve ordering syllogi 


(1971) at- 


Huttenlocher and Higgins 
the 


tempted a rapprochement between 
reasoning theories of Clark (1969a) and 
Huttenlocher (1968), hereafter the lin- 
guistic and imagery theories, respectively, 
both of which were designed to account for 
how people solve three-term series problems 
(e.g., "John is better than Pete, John is 
worse than Sam; who is best?"). In their 
attempt, they accepted the imagery theory 
without any fundamental change and dis- 
cussed the linguistic theory in detail, 
rejecting each of its proposals and devising 
Others where needed. But in their paper, 
they failed to consider a sizable body of 
available evidence which shows that (a) 
the imagery theory, far from being totally 
acceptable, is empirically wrong, (b) their 
rejections of the linguistic theory's prin- 
ciples of congruence and lexical marking 
fail empirically, (c) their rejection of the 
Principle of functional relations fails on 
both psychological and linguistic grounds, 
and (d) their notion of "temporary accessi- 
dility,” replacing the principle of con- 
8ruence, is disconfirmed by the data. In 
is reply was supported in 
ublic Health Service. Re- 
he National Institute 
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linguistic theory as well as their in- 
“unmarked.” [t is argued, in sum, 
hose authors’ “more complete explana- 


sms." 


addition, Huttenlocher and Higgins used 
the terms "marked" and “unmarked” in- 
correctly throughout most of their paper. 
These deficiencies in Huttenlocher and 
Higgins’ paper are serious, for, if real, they 
overturn each of its major points and 
totally invalidate their attempt at a "more 
complete explanation of how people solve 
ordering syllogisms." Thus, it is important 
to consider each of these deficiencies in 
detail. 


DISCONFIRMATION OF THE 
IMAGERY THEORY 


In the imagery theory, the subject is 
seen as placing the three terms of a problem 
into a mental array. For the premise 
combination “Abel is better than Baker, 
Charlie is worse than Baker," for example, 
the subject first places Abel and Baker of 
the first premise into the array, then he 
places the “new” term of the second pre- 
mise (Charlie) into the array with respect 
to the "old" term of the second premise 
(Baker). It is the grammatical function 
of the “new” term in the second premise 
that is crucial. Whenever it 1s the subject 
of the second premise (as Charlie is, above), 
is easy; whenever it is not, 
placement is difficult. This prediction 
derives from actual placement tasks in 
which people find it easier to place a "new" 
object into à physical array when it is the 
subject of a comparative sentence than 


placement 
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‘TABLE ! 
IN 
PERCENT ERRORS FROM CLARK (1969b) 
"DETERMINATE" PROBLEMS FAVORED 
AND DISFAVORED BY THE 
IMAGERY THEORY 


7 
Fa- |Disfa-| pi. 
| vee (ue ns 
Problems prob- | prob- | favored 
lems | lems 
Comparative, TN mis | x 
i vith same adjective | 3 d 
dc T E E aiy | 10 | 32 | Faz 
Total | 13 33 + 20** 
Vegative-Equative : 
NR AN same adjective | 33 24 = oe 
Premises with different adjectives 40 28 — 12 
Total | 36 26 — 10** 
Note.—Each nontotal percentage is based on 400 responses, 
*p <.01. 
** 5 <.001. 


when it is not. This theory, however, is 
disconfirmed by two independent pieces of 
evidence, both derived from data not con- 
sidered by Huttenlocher and Higgins. 

1. In Clark (19692), the imagery theory 
was put to an explicit test by making use 
of problems containing only negative-equa- 
tive premises (e.g., Baker isn't as good as 
Abel"). This test depended on only one 
assumption: Assumption 1: People find it 
easier in an actual placement task to place 
a "new" item into an array whenever it is 
the subject of a negative-equative sentence, 
Given the imagery theory plus Assumption 
1, it was possible to classify every three- 
term series problem in Clark (1969a) as 
either favored by the theory (because its 
"new" term is the subject of the second 
premise) or as disfavored (because its 
"new" term is in the predicate of the second 
premise). The error data from Clark 
(1969b), summarized in Table 1, show that 
whereas the imagery theory is correct in its 
predictions for problems containing com. 
parative premises, it is consistently incorrect 
for problems containing negative-equatives. 
"Therefore, if Assumption 1 is correct, the 
imagery theory is disconfirmed by these 
data as well as by parallel data in Clark 
(19692) and Huttenlocher, Higgins, Milli- 
gan, and Kauffman (1970). 

To explain these negative-equative data, 
however, Huttenlocher et al. more recently 
contended, instead, that Assumption 2 is 
the proper one to make: Assumption 2: 
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People find it casier in actual placement 
tasks to place a “new” item into an array 
whenever it is the second term of certain 
types of negative-equative sentences (e.g., 
those with “good” or bad"), As evidence, 
they pointed to two tasks. In Task 1, sub- 
jects solved three-term | series problems 
while explicitly manipulating figures on a 
felt board ; in Task 2, other subjects solved 
the same problems in their heads. The 
results of Tasks 1 and 2 were parallel. From 
this, Huttenlocher et al. argued: (a) Task 1 
revealed the difficulties subjects had in 
manipulating objects overtly; therefore (b) 
the results of Task 2 were parallel because 
the Task 2 subjects were also manipulating 
objects in arrays, although covertly. For 
all this to work out, Huttenlocher et al. had 
to make Assumption 2 based on the sup- 
position that subjects implicitly “convert” 
premises like “Abel isn't as bad as Baker” 
into "Baker is worse than. Abel" 
reasoning tasks. Logically, however, Hut- 
tenlocher et al. could equally well have 
argued for the reverse: (a) Task 2 revealed 
the linguistic difficulties subjects had in 
taking in and using the language of these 
problems; therefore (P) the results of Task 
1 were parallel because the Task 1 subjects 
were also prey to these linguistic difficulties. 
Thus, because all difficulties in both tasks 
could he linguistic in origin, Huttenlocher 
et al.'s evidence cannot distinguish between 
Assumption 1 and Assumption 2. The 
resolution of this issue therefore requires an 
independent and more direct test of As- 
sumptions 1 and 2. 
Evidence reported 


in such 


: in Clark (1970a, p- 
300) provides iust Such a test, and it shows 
that Assumption 1 is correct and Assump- 
tion 2 is incorrect, The evidence is that 
adults find it easier in placement tasks to 
place a “new” item into an array when it i$ 
expressed as the Srammatical subject in 
both the comparative sentence and the 
negative-equative Sentence, More recent 
unpublished experiments? haye replicate 
this consistent result in considerable detail: 


Showing how subjects could not have bee? 


making the “conversions” Huttenlocher € 


“H H. Clark. Difficulti 


r et^ 
i h es peopl in answ 
ing the question “Where eo 


Sp jet 
1s it?” p preparatio! 


: | 
| 


I 


v- 


ve 
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al. assumed and explaining why Hutten- 
locher et al.’s Task 1 and Task 2 had 
parallel results. In other words, the 
original disconfirmation of the imagery 
theory in Clark (19692) still holds, since it 
was based on the correct Assumption 1. 

2. Other data from Clark (1969b), on the 
so-called “indeterminate” problems, pro- 
vide an independent disconfirmation of the 
imagery theory without the use of Assump- 
tion 1 or Assumption 2. In “indeter- 
minate" problems, the two premises (e.g., 
"Abel is better than Baker, Alan is better 
than Baker") describe the three terms in an 
indeterminate order. For this particular 
Pair of premises, one cannot tell whether 
Abel or Alan is better, so when asked "Who 
is best?" one must answer "cannot tell," 
although when asked “Who is worst?" one 
is able to answer "Baker." The imagery 
theory applies to these problems too, for, 
as Huttenlocker and Higgins state, “the 
spatial imagery hypothesis provides .Ss 
with an algorithm for obtaining answers to 
all soluble syllogisms, and for recognizing 
Which syllogisms are insoluble [pp. 498- 
499]" And like the "determinate" prob- 
lems, each "indeterminate" problem can be 
classified as either favored or disfavored by 
the theory. 

As the data summarized in Table 2 
indicate, there is no overall difference in 
"rrors between favored and disfavored 
Problems. And since no differences emerge 
even for comparative and negative-equative 
Problems taken separately, the imagery 
theory is disconfirmed regardless of whether 
One makes Assumption 1, as was done in 
Table 2, or Assumption 2. Yet if one does 
make Assumption 2 along with Hutten- 
locher et al., then the comparative and 
Negative-equative problems with the same 
adjective in both premises show a signifi- 
Cant difference (x? = 5.00, df = 1, p < .05) 
'n the wrong direction. The 0% difference 
between favored and disfavored problems 
in Table 2 should be compared to the large 
20% difference in the comparative problems 
of Table 1, the difference the imagery 
theory was originally designed to account 
for, phe linguistic theory, in contrast, 
"Ccounts for all the major differences in 
lese "indeterminate" problems. 
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TABLE 2 
PERCENT ERRORS FROM CLARK (1969b) ix 
“INDETERMINATE” PROBLEMS FAVORED 
AND DISFAVORED BY THE 
IMAGERY THEORY 


Fa- | Disfa- | Disfa- 
P inis vored | vored | vored 
roblems probe | Bret | oes 
lems | lems | vored 
th same adjective | 43 10 S 
th different adjectives | 37 40 +3 
| 25 25 0 
Negative-Equative | 
Premises with same adjective 19 24 tse 
Premises with different adjectives 53 48 TS 
Total | 36 | 36 [U 


| i 


Note.—Each nontotal ercentage is based on 400 responses. 
“p <.10. p S 3 


To summarize briefly, two independent 
pieces of available evidence, not considered 
by Huttenlocher and Higgins, show that the 
imagery theory is empirically wrong. 


ON HUTTENLOCHER AND HiGGINs' CRITI- 
CISMS OF THE LiNGUISTIC THEORY 


In considering the linguistic theory, 
Huttenlocher and Higgins rejected the 
principles of congruence, lexical marking, 
and functional relations as given in Clark 
(1969a) and proposed certain fundamental 
revisions. This section details how each of 
their criticisms and revisions fails on 
empirical grounds and, furthermore, how 
they erred in their use of certain linguistic 
terms and linguistic evidence. 


The Principle of Congruence 


This principle posits, roughly, that a 
question should be easier to answer when- 
ever the question and the premise con- 
taining the answer are congruent in their 
underlying representations. After aaa 
ing certain solution times in Clark (19692), 
Huttenlocher and Higgins rejected this 
principle, since they found that the question 
uppeared to be easier only when it was 
congruent with the second premise. So, ag 
an alternative to congruence, they sug- 
gested that "the meaning of an adjective 
becomes temporarily more accessible im- 
mediately after it has been used [p. 500." 
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TABLE 3 


Two EXEMPLARY “INDETERMINATE” 
PROBLEMS AND THEIR 
PRINCIPAL ERRORS 


T 


| | Per- 
Problem | Erroneous answer | cane 
cd —— — 
VII. If Abelis better than | Abel (congruent) 26 
vnd Baker is worse than. | Alan (incongruent) | 6 
E | i 
Thee ho is best? Baker (neither) 2 


VII’. If Abel isn't as bad as Abel (incongruent) | 12 


NK Beker isn't as good | Alan (congruent) 30 
as Alan, 
Then who is best? " 


Baker (neither) | 2 


Note.—Each problem was attempted by 100 subjects. 


In relying on only the solution times in 
Clark (1969a) for evidence, however, 
Huttenlocher and Higgins rejected con- 
gruence and accepted their own “accessi- 
bility” notion on a single unreplicated 
result. While they made detailed use of 
the solution times, which were drawn from 
13 subjects with two observations per 
subject per problem, they bypassed the 
more precise error data collected on these 
same problems from 100 subjects in Clark 
(1969b). The latter data give complete 
support to congruence and no support to 
"accessibility." First consider only the 
"determinate" problems, the same prob- 
lems on which Huttenlocher and Higgins 
based their rejection of congruence. When 
both premises contain the same adjective, 
problems favored and disfavored by con- 
gruence are also favored and disfavored by 
"accessibility," and these produced 24% 
and 30% errors, respectively. This result 
parallels the solution times. But when the 
two premises contain different adjectives, 
congruence and "accessibility" make differ- 
ent predictions. The problems favored and 
disfavored by congruence produced 19%, 
and 36% errors, respectively, in parallel] 
with the solution times and in agreement 
with the data from all previous experiments 
(as reviewed in Clark, 1969a). But the 
problems favored and disfavored by “‘acces- 
sibility” produced 28% and 27% errors, 
respectively, a result slightly in the 
direction. This difference is not parallel to 
the solution times in Clark (1969a) and 
Suggests that the difference favoring "acces- 


wrong 
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sibility” in the solution times was a chance 
effect. This conclusion is confirmed by an 
examination of the “indeterminate” prob- 
lems. “Accessibility” was favored 16% to 
16% in problems with the same adjective 
in both premises and 44% to 45% in 
problems with different adjectives in the 
two premises, both differences being non- 
significant. In contrast, congruence was 
supported 18% to 23% (a significant 
difference) in the first type of problem, 
whereas no such direct classification of 
problems is possible with the latter type of 
problem (but see below for other predic- 
tions). To summarize, where the pre- 
dictions of congruence and "accessibility" 
are different, congruence is supported by 
(a) the latencies on "determinate" prob- 
lems, (b) the error data on "determinate" 
problems, and (c) the error data on “in- 
determinate" problems, with no counter- 
evidence. "Accessibility," on the other 
hand, is supported only by the unreplicated 
difference Huttenlocher an iggins noted 
in the solution times of eterminate" 
problems. Congruence is well established 
by these data. " Accessibility" is not. 

In their rejection of congruence, Hutten- 
locher and Higgins also failed to consider 
perhaps the most striking evidence support- 
ing congruence—the Specific errors the 
subjects in Clark (1969b) made on "indeter- 
minate" problems with different adjectives 
in the two premises. Consider the two 
illustrative problems in Table 3. Although 
the correct answer to both problems is 'can- 
not tell," subjects frequently erred, most 
often giving one of the two "better" terms 


(Abel or Alan) as answers. Of these 
two terms, however, subjects overwhelm- 
ingly chose the term from the premise con- 


gruent with the question, Abel for Proble™ 


VII and Alan for Problem VII’. Over al 
tight such “cannot tell” problems, the 
choice was 26%, to 9C. for the congruen! 
term, a result predicted by congruence but 
not by “accessibility.” This result illus" 
trates one of the strengths of the linguisti 


theory: it predicts not only how many erro? 
subjects should make 


instances, which errors, 
theory predicts only how 
failing to account for t 


, but also, in mam 
The imaget? 
many, thereP? 
hese data. 
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Congruence, moreover, has been demon- 
strated to be a very general cognitive 
principle in other studies. It has been 
shown to apply to verification and question- 
answering in tasks using negatives (Chase 
& Clark, in press; Clark, 1970b, in press; 
Trabasso, Rollins, & Shaughnessy, 1971), 
locatives of various sorts (Clark, in press), 
actives and passives (Clark, in press), as 
well as comparatives. Thus, to reject 
congruence and replace it by “accessi- 
bility” generally, Huttenlocher and Higgins 
would have to contend with this much wider 
range of phenomena. Yet the latter sup- 
port congruence and will not admit an 
explanation based simply on the “tem- 
porary accessibility" of a word's meaning. 
One must conclude that the single un- 
replicated result Huttenlocher and Higgins 
quote is insufficient to support their case. 


Principle of Lexical Marking 


A major part of Huttenlocher and 
Higgins’ paper consists of a detailed re- 
capitulation of the linguistic evidence for 
marked and unmarked adjectives presented 
in Clark (19692) plus a discussion of some 
other interesting evidence on adjectival 
Scales. The main goal of all this was to 
consider two proposals for the advantage 
of unmarked over marked adjectives (e.g., 
"good" over '"bad") when used in three- 
term series problems: Proposal 1: An un- 
marked adjective used in its "nominal" 
sense is easier to comprehend than an un- 
marked or marked adjective used in its 
“contrastive” sense. Proposal 2: Certain 
adjectives are the "negatives" of either 
their unmarked or their positive counter- 
parts, and for this reason, the former ad- 
jectives are more difficult to comprehend. 
Huttenlocher and Higgins rejected Pro- 
posal 1 from Clark (1969a)—although on 
no negative evidence—and suggested Pro- 
’ Proposal 2 is reasona- 


Posal 2 in its place. i 
ble and has been previously made in two 
recent revisions of the linguistic theory 
(Clark, 1970b, in press). On the other 
hand, the rejection of Proposal 1 is wrong. 

Huttenlocher and Higgins rejected I TO- 
Posal 1 on the grounds that "there is no 
*mpirical evidence that nominal usage ol 


unmarked adjectives is easier for people to 
understand than their contrastive usage 
[p. 501]" But they did not consider 
the solution times, collected by Jones 
(1970), on three-term series problems in 
which the adjective of both premises was 
"thicker" or "thinner," or “fatter” or 
"thinner." As Jones pointed out, "thick- 
thin" is an unmarked-marked pair, where- 
as “fat—thin" is not, so "thick" can be used 
in a nominal sense in comparatives, whereas 
fat" cannot. But "fat-thin," according 
to Huttenlocher and Higgins, is a positive- 
negative pair. l'roposal 1 alone predicts an 
advantage of "thicker" over "thinner" in 
Jones' data, but no advantage of “fatter” 
over "thinner." Proposal 2 alone predicts 
the same advantage of “fatter” over 
"thinner" as for "thicker" over "thinner," 
since “thinner” is "negative" in both pairs. 
Jones’ results show a significant 4.5-second 
advantage of "thicker" problems over 
"thinner" problems, but onlv a nonsig- 
nificant 1.9-second advantage of “fatter” 
problems over "thinner" problems. This 
constitutes support for Proposal 1 indepen- 
dent of Proposal 2, contrary to Hutten- 
locher and Higgins’ claim of “no empirical 
evidence." Jones’ analogous results on 
"heavier-lighter" (an unmarked-marked 
pair) versus 'darker-lighter" (not an un- 
marked-marked pair) also constitute sup- 
port for Proposal 1, no matter whether one 
considers the “positve’’ member of the 
latter pair to be "dark" or "light." 

The conclusion drawn in Clark (in press), 
therefore, was that the attested advantage 
of unmarked over marked adjectives in the 
“higher-lower,” “taller-shorter, 
s "farther-nearer, thick- 
hallower," "bigger 


pairs 
“longer-shorter,” 
er-thinner," “deeper-shailo A eei 
wee-er"  "'more-léss," faster-slower, 

s , p. t whaz 
"warmer-colder," '"better-worse, hap- 
pier-sadder," and "older-younger" (cf. 
Clark, 19692) could be accounted dor by 
Proposal 1 or Proposal 2 Or Voth in cón- 
junction. At this point, there is evidence 
ur both proposals and no evidence to reject 
either one. Furthermore, Huttenlocher and 
Higgins' unqualified use of the word “nega- 
ý discussing Proposal 2 leaves the 


a g 
tive” in m 
impression that marked adjectives like 


“short” should behave psychologically like 
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height 


Secondary RP- 
Tertiary RP 


Primary RP 


Fic. 1. A schematic diagram of the primary, 
secondary, and tertiary reference points (RPs) on 
the scale of height for the comparative “A is shorter 
than B." 


the negatives ‘‘not, "no," none," etc. Thisis 
not the case. In timed verification tasks, 
the pattern of reaction times to true and 
false sentences containing marked adjec- 
tives is like that for positive sentences, not 
like that for sentences with "explicit nega- 
tives,” even though marked adjectives 
without -er take longer to verify than 
unmarked ones (cf. Clark, 1970b; Just & 
Carpenter, 1971). 

An additional serious problem in Hutten- 
locher and Higgins' discussion of Proposals 
1 and 2 is that they continually misapplied 
the terms “unmarked” and “marked,” two 
terms central to their paper. They first 
defined "unmarked" adjectives as those 
(like "long") that can take on a neutral 
sense in questions (How long is the board ?), 
in the scale name (length), or in quantified 
expressions (two feet long) ; this definition, 
also found in Clark (1969a), agrees with 
the linguistic application of these terms 
(Greenberg, 1966; Lyons, 1968, p. 466). 
But then, without indicating a change in 
definition, Hfuttenlocher and Higgins ap- 
plied the term “unmarked” to any term 
they thought was "positive." lIn their 
Table 1, for example, they refer to “hot- 
cold," "noisy-quiet," "rich-poor," "dirty— 
clean,"  "hard-soft,"  ''fat-thin," and 
"smart-stupid" asunmarked-marked pairs, 
even though none of these fulfills the proper 
linguistic conditions. How they actually 
did make their classification is not clear. 

ey might have required that the alleged 
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“unmarked” adjective be positive, where 
"positive" is defined as an excess on the 
scale underlying the pair of adjectives. So 
"rich" would be positive since it denotes 
an excess of "amount of money." But 
even this criterion fails, for example, on 
"hard-soft." Does ''hard" refer to an 
excess on "amount of hardness" or to a 
lack on "amount of softness?” Their 
choice of the first possibility seems arbitrary 
and unmotivated. It was their confusion 
of “unmarked” and “positive” perhaps that 
led them to overlook Jones’ evidence, dis- 
cussed above, on "thicker-thinner" versus 
"fatter-thinner" and its relevance to Pro- 
posals 1 and 2. 


D 


Principle of Functional Relations 


Huttenlocher and Higgins question this 
principle mainly by questioning the analysis 
of the comparative given in Clark (19692). 
This analysis, presented explicitly as simply 
a brief paraphase of well-known linguistic 
analyses by Lees (1961), Smith (1961), 
Chomsky (1965), Huddleston (1967), and 
Doherty and Schwartz (1967), noted that 
“John is better than Pete” is derived, by a 
series of linguistic rules, from the two “base 
strings" “John is good" and "Pete is good" 
or, as expressed in Clark (in press), “from 
something like that which underlies John 
ts so good and Pete is so good, where John is 
so good means something like 
to an unspecified degree,’ "' 
and Higgins’ attack on this discussion is 
based. on several fundamental misunder- 
standings of this linguistic analysis. 

The main problem is that Huttenlocher 
and Higgins mistakenly interpret the lin- 
pm analysis as claiming that “John is 
are tian Pete” implies that both John 
and tete are good in an absolute sense: 
Their first remark, “Clark claims that these 
[comparative constructions] include two 
underlying propositions, 
volving contrastive usage 
in "John is good" ], 
the adjective is m 


‘John is good 
Huttenlocher 


each clearly in- 
[e.g., that found 
ropes qo of whethe! 
E arked o arkec 
[p. 502].” But this is explicitly contis 
dicted in the following quotations from 
Clark (19692): “It will be argued that the 
goods Lunderlying “John is better than 
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Pete" ] can be either nominal or contrastive, 
but the bads [underlying "Pete is worse 
than John” ] must be contrastive [p. 391 ]." 
And: “The intelligent underlying [This idiot 
is more intelligent than the idiot] must 
therefore be nominal, so that the base string 
this idiot is intelligent really only means ‘this 
idiot has measurable intelligence’ [p. 391].”’ 
Later, Huttenlocher and Higgins correctly 
pointed out that ‘‘Jets are slower than 
rockets" does not presuppose the truth of 
the assertions Jets are slow and Rockets are 
slow, but they incorrectly claimed that this 
contradicts the linguistic analysis of the 
comparative. In making this claim, they 
have assumed that the linguistic analysis 
disallows the use of fast and slow in their 
"relative" senses in comparatives. This 
assumption is false. 

To understand this point, consider “A is 
shorter than B” and the three zero points, 
or reference points, implicit in it. As 
illustrated schematically in Figure 1, the 
primary reference point is the zero point 
for the whole scale of “height” and defines 
the zero for the "nominal" fall (as in six 
feet tall). The secondary reference point 
is the temporary standard or middle point 
to which objects are compared; objects 
above the standard are "tall," and those 
below it are "short." Thus, the secondary 
reference point is the zero for both “con- 
trastive’’ tall (from there upward) and 
“contrastive” short (from there down- 
ward). Finally, the tertiary reference point 
is the standard chosen for the particular 
comparison “A is shorter than B,” where 
A is being described relative to B. In this 
schema, the secondary reference point can 
range almost anywhere on the scale, for 
“tall” and “short” are relative and not ab- 
Solute adjectives. One can say “John is 
very tall for a dwarf, but very short for a 
giant,” successively using two quite dis- 
Parate secondary reference points. In 
terms of this schema, the linguistic theory 
Makes Claim 1, and the imagery theory, 
Claim 2: Claim 1: The choice of “taller” for 
Comparing A and B presupposes that the 
Secondary reference point is at least as low 
as A and B; the choice of "shorter pre- 
Supposes the opposite. Claim 2: The choice 
9f “taller” or "shorter" for comparing A 


511 


and B makes no presuppositions about the 
positions of A and B relative to the second- 
ary reference point. According to Claim 1, 
"A is shorter than B" presupposes some- 
thing like “A and B are short to an unspeci- 
fied, perhaps null, extent relative to the 
secondary reference point." Thus, “A is 
shorter than B” does presuppose the short- 
ness of A and B, but only relative to the 
secondary reference point, whose position 
is perhaps very high in absolute terms. 

The evidence Huttenlocher and Higgins 
quote against Claim 1—that “Jets are 
slower than rockets’ does not presuppose 
that jets and rockets are slow—is therefore 
defective. This sentence presupposes sim- 
ply that jets and rockets are slow relative 
to some very high standard speed, the 
secondary reference point. This is possible 
because the secondary reference point can 
be placed almost anywhere, as in “Jets are 
slow for modern aerial weaponry.” When 
the secondary and tertiary reference points 
are identical, in fact, one can say “Jets are 
slow relative to rockets.” The last two 
sentences both use a “contrastive” slow, as 
required by the linguistic theory, but 
neither implies an absolute “Jets are slow,’ 
contrary to Huttenlocher and Higgins’ 
interpretation of the linguistic analysis. In 
short, the evidence they cite is irrelevant to 
Claim 1. 

But there is also positive evidence for 
Claim 1 and against Claim 2. Claim 1 is 
that (a) “Jets are slower than rockets and 
(b) “Rockets are faster than jets, when 
referring to the same situation, have 
different secondary reference points, one 


high and one low, respectively. ae lin- 
guistic theory claims that this di n 
should affect reasoning in accordance with 
the principle of congruence. The imagery 
theory denies Claim 1 and any poren e 
effect of this difference on Gon ining 
information from premises, for, as Hutten- 
locher and Higgins allege, there is no 
evidence that subjects representation of 
item order preserves any information about 
i :ectives were used in the prem- 
ises.” But all the support for the principle 
of congruence detailed above contradicts 
this allegation, supporting Claim 1 and 
disconfirming Claim 2. The allegation just 
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quoted, however, is false in an even more 
obvious way, for some problems require the 
use of the "information about which adjec- 
tives were used in the premises." To solve 
the problem "If Pete is poorer than Diei 
and John is rich, then who is poorest: 
(answer: Pete), subjects are forced to use 
the presuppositional information about 
Pete and Dick's relative poorness, informa- 
tion that Huttenlocher and Higgins agree 
is found in the first premise. This problem, 
which can be solved, cannot be solved any 
other way. It is a small step from here to 
assume that the same information facili- 
tates the solution of “If Pete is poorer than 
Dick, and John is richer than Dick, then 
who is poorest?” (answer: Pete), where the 
same presuppositional information is con- 
gruent with the question. But the pre- 
suppositions hinder the solution of “If Pete 
is poorer than Dick, and Pete is richer than 
John, then who is the poorest?" (answer: 
John), where the presuppositional informa- 
tion that Tohn is rich is incongruent with 
the question. This, in essence, is the 
position of the linguistic theory, and it 
agrees with all the data. 
Although Huttenlocher 
any difference 


and Higgins deny 
in meaning between (a) 
“Rockets are faster than jets" and (5) 
"Jets are slower than rockets," and other 
similar pairs, a closer look shows that they 
cannot be right. The presuppositions of « 
and b, though subtle, become highlighted 
when « and 5 are altered in minor ways. 
Note, first, that (c) "Rockets are no faster 
than jets" and the slightly odd (d) “Jets 
are no slower than rockets" both mean 
"Rockets and jets have the same speed," 
yet they are not semantically equivalent. 
The addition of the simple negative empha- 
sizes relative fastness in ¢ and relative 
slowness in d. To see this, contrast the 
sentence “Jets and rockets are very 
but jets are no faster than rockets” against 
the unacceptable “*Jets and rockets are 
very fast, but jets are no slower than 
rockets.” The evens in (e) "Rockets are 
even faster than jets” and (f) "Jets are even 
S'ower than rockets" have a similar effect, 
Just as they do with the noncomparative 
did (2) "Rockets are fast even relative 

Jets" and (5) "Jets are slow even relative 


y fast, 
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to rockets.” Also, many people find the 
problem (2) “If Dick runs slower than Pete, 
and John runs fast, then who is slowest?" 
determinate, just as they should if they 
keep the same secondary reference point in 
the two premises. But they do not find 
the problem (j) “If Pete runs faster than 
Dick, and John runs fast, then who is 
slowest?" determinate, again as the refer- 
ence point analysis would predict. In 
contrast, Huttenlocher and Higgins would 
claim that Problems i and j are 
in their indeterminacy, 
Huttenlocher and Higgins made at least 
two other false claims in their discussion of 
the principle of functional relations. First, 
they issued the following challenge: “The 
reader can easily verify for 
for adjectives from rg 
such asymmetry 
applicability of 
forms in ‘more’ 


equivalent 


himself that, 
itio scales, there is no 
z between the breadth of 
marked and unmarked 
: comparatives [p. 5027." 
Yet there is an obvious set of counter- 
examples. Note that “The desk is shorter 
than the door is wide” is acceptable, but 
ihe door is wider than the desk is short” 
is not. In sentences like these, the second 
adjective must be unmarked whenever it is 
on a ratio scale, Here, just as with adjec- 
tives from “ordinal” scales (cf. Clark, 
1969a, p. 391), marked and unmarked 
adjectives do differ in their "breadth of 
applicability in more comparatives," and 
the unmarked form js always broader- 
Second, Huttenlocher and Higgins alleged 
that “Clark's first principle [the primacy ol 
functional relations] asserts that compara- 
Uves with ‘good’ and ‘bad’ have different 
underlying ‘meanings,’ and no specification 
'S given of any relation betw een these mean- 
ings [p. 501]" But the discussion O” 
lexical marking in Clark (19694) propose? 
coding nominal good as [ -Evaluativt 
[I olar ]], contrastive good as [ 4- Evalua* 
uve [+Polar]], and contrastive bad 25 
C+Evaluative C—Polar]]; these are quit? 
explicit specifications of the relations De, 
tween the meanings of "good" and ‘bad. 
1 his point is expanded in Clark (19702) 
Although sentences (a) [John is better tha” 
LM ] and (b) [Peter is worse than Joh". 
translate rom or euPasitions, one C 
set of presu ppositions 
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another. Since both good and bad lie on the 
superordinate goodness scale, it is possible 
to say that worse means ‘less good than’ and 
better means ‘less bad than’ [p. 298]." 

To summarize, Huttenlocher and Hig- 
gins’ criticism of the principle of functional 
relations is based on misrepresentations of 
the analysis of comparatives in Clark 
(1969a) and elsewhere. Furthermore, their 
rejection of this principle fails empirically, 
for the data supporting congruence also 
fully support this principle. 


The “Compression” Strategy 


It was tentatively suggested in Clark 
(1969a) that subjects might be using the 
strategy of “compressing” premises—for 
example, shortening "A is better than B" 
implicitly to “A is the better one’’—in solv- 
ing the problems. This suggestion may 
well be incorrect, yet not for the reasons 
Huttenlocher and Higgins noted. Each 
piece of evidence they cited is inconclusive. 
Consider problems with the premise combi- 
nation (a) "A is better than B, and C is 
worse than B” and its inverse, and (b) “B is 
worse than A, and B is better than C," and 
its inverse. As Huttenlocher and Higgins 
pointed out, compression predicts that a 
will be harder than b, but the principle of 
congruence predicts the reverse. Since a 
was easier than b in Clark (1969a) and 
elsewhere, they rejected compression. But 
this conclusion does not follow, for com- 
pression and congruence could be operating 
against each other in these premise com- 
hinations, with congruence having the 
larger effect. Huttenlocher and Higgins 
also argued against compression by pitting 
two problems favored by compression 
against two problems disfavored by it. The 
latencies they cited from Clark (1969a) 
appeared to show the favored problems to 
be harder than the disfavored, but the 
more precise error percentages from € lark 
(1969), which they did not cite, show just 
the reverse, both for comparative problems 
(26% to 36% errors) and for negative- 
*quative problems (31% to 40% errors). 
Also, they said nothing about the "indeter- 
minate” problems in Clark (1969b) where 
it was found that compression was sup- 


ported in all eight pairs of premise 
combinations. 

Yet there are deviations from compres- 
sion in the data of DeSoto, London, and 
Handel (1965) and Hunter (1957). The 
counterevidence is found in premise com- 
binations like (a) “A is better than B, B 
is better than C” as against (b) "B is 
better than C, A is better than B." Al- 
though compression predicts b to be easier, 
DeSoto et al. and Hunter both found a to 
be easier in certain cases. However, this 
evidence is also in full contradiction to the 
imagery theory, which also predicts that b 
should be easier. Therefore, the notion of 
compression, which could hardly be said to 
"bear the major burden in predicting 
differences between alternative forms of the 
three-term series problem [Huttenlocher 
& Higgins p. 502]," remains, but with less 
than overwhelming support. 


CONCLUDING REMARKS 


To sum up, Huttenlocher and Higgins’ 
new theory of reasoning is defective, for 
they have failed to establish either that the 
linguistic theory is in need of revision or, 
more seriously from their point of view, that 
the imagery theory has any basis in fact. 
But, as they have correctly pointed out, the 
linguistic theory is not complete. For one 
thing, it does not fully specify how informa- 
tion from the two premises are combined. 
It is laudable that they have tried to fill this 
gap, particularly by appealing to the visual 
imagery many people report using m these 
problems. Yet to repeat Clark (19692), 
“the only firm conclusion we can draw at 
this time is that it has not been demon- 
strated that the use of spatial imagery 
differentially affects the solution of three- 
term series problems [p- 402)" A recent 

: 4 ses quite a different approach 
paper’ proposes GP es in the linguistic 
to this incompleteness in ing à 
theory. With new evidence, this paper 

ste an extension of the theory which is 
presen ji he present linguistic 
compatible both with the pi es g 
theory and with the fact ie: at ne some 
people report imagery 1n solving three-term 
series problems. 

1H. H. Clark. 

deductive reasoning. 


More on linguistic processes in 
In preparation. 


REFERENCES 


2 W. G; & CLARK, H. H. Mental operations 
Eon of sentences and pictures. In 
L. Gregg (Ed.), Cognition in learning and memory. 

New York: Wiley, in press. 

Cuomsky, N. Aspects of the theory of syntax. 

bridge: M.LT. Press, 1965. . . 

Crank, H. H. Linguistic processes in deductive 
reasoning. Psychological Review, 1969, 76, 387- 
404. (a) . ; 

CrARk, H. H. The influence of language in solving 
three-term series problems. Journal of Experi- 
mental. Psychology, 1969, 82, 205-215. (b) 

Crank, H. H. Comprehending comparatives. In 
G. Flores d'Arcais & W. J. M. Levelt (Eds.), 
Advances in psycholinguistics. Amsterdam: North 
Holland Press, 1970. (a) 

Crark, H. H. How we understand negation. 
Paper presented at the Committee on Basic 
Research in Education Workshop on Cognitive 
Organization and Psychological Processes. Hunt- 
ington Beach, California, August 19, 1970. (b) 

CLARK, H. H. Semantics and comprehension. In T. 
A Sebeok (Ed.), Current trends in linguistics. 
Vol. 12 Linguistics and adjacent arts and sciences. 
The Hague: Mouton, in press. 

DeSoto, C., Loxpox, M., & HANDEL, S. Social 
reasoning and spatial paralogic. Journal of Per- 
sonality and Social Psychology, 1965, 2, 513-521. 

Douerty, P., & Scuwartz, A. The syntax of the 


compared adjective in English. Language, 1967, 
43, 903-936. 


Cam- 


HERBERT H. CLARK 


GREENBERG, J. H. Language universals. The 
Hague: Mouton, 1966. 

HuppLEsroN, R. D. More on the English com- 
parative. Journal of Linguistics, 1967, 3, 91-102. 

Hunter, |. M. L. The solving of three-term series 
problems. British Journal of Psychology, 1957, 48, 
286-298. 

HUTTENLOCHER, J. Constructing spatial images: A 
strategy in reasoning. Psychological Review, 1968, 
75, 550-560. 

HUTTENLOCHER, J., & HiGGiNs, E. T. 
comparatives, and syllogisms. 
view, 1971, 78, 487-504. 

HUuTTENLOCHER, J., HicciNs, E. T., MILLIGAN, C., 
& KAUFFMAN, B. The mystery of the “negative 
equative" construction. Journal of Verbal Learn- 
ing and Verbal Behavior, 1970, 9, 334-341. 

Jones, S. Visual and verbal processes in problem- 
solving. Cognitive Psychology, 1970, 1, 201-214. 

Just, M., & Carpenter, P. Comprehension of 
negation with quantification. Journal of Verbal 
Learning and Verbal Behavior, 1971, 10, 244-253. 

Lees, R. B. Grammatical analysis of the English 


Adjectives, 
Psychological Re- 


comparative constructions. Word, 1961, 71, 
171-185. 
Lyons, J. Introduction to theoretical linguistics. 


Cambridge: Cambridge University Press, 1968. 
Snir, C. W. A class of complex modifiers. Lan- 
guage, 1961, 37, 342-365. 
Trapasso, T., RoLLINS, H., & Suaucunessy, E. 
Storage and verification stages in. 'rocessing COn- 
8 ilio 6 
cepts. Cognitive Psychology, 197 239-289. 


(Received August 4, 197 


Psychological Review 
1971, Vol. 78, No. 6, 515-527 


JUDGMENT AND RESPONSE IN MAGNITUDE ESTIMATION ! 
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shown to influence not only judgment but, independently, response 
Pure response effects are isolated and 
a common shift design and are 
les in magnitude estimation 


Context is 
in the task of magnitude estimation. 
characterized in several studies employing 


contrasted with judgmental effects. Response sca 


NC 


are presented as interval scales constructed from a direct comparison of 


stimulus differenc 
tions for psychophysic 


In an earlier report (Ross & Di Lollo, 
1968a), we presented experimental evi- 
dence to show that judgments of heaviness 
are formed on a complex basis, and that 
the basis changes with context. One inter- 
pretation was that context causes a change 
in the way heaviness is defined, by changing 
the emphasis placed on relevant stimulus 
properties such as weight, density, and 
size. The general conclusion we drew was 
that the immediate problem for psycho- 
physics is to try to discover not the form of 
the psychophysical function for a given 
attribute, but the laws that determine the 
basis on which judgments are made. 
Anderson (1970) reaches à similar con- 
clusion on other grounds. 

Our primary concern in what follows is 
to show that context has powerful effects 
not only on judgment but also on response 
scaling in the task of magnitude estimation. 
The conclusion we reach, partly from new 
experimental evidence and partly from evi- 
dence in previous studies, is that magnitude 
estimates are made on an interval scale, 
with a unit and an origin that change with 
context, What this means is that judg- 
ments that themselves are dependent on 


context will be represented on a response 
tible to context 


; and shown then to account for context effects. 
"al theory are considered. 


Implica- 


physical law for magnitude estimation seem 
premature until each eflect is clearly 
understood. 


ISOLATING RESPONSE SCALE EFFECTS 


The distinction between judgmental 
effects and response scale effects, to which 
we have already alluded, may be stated as 
follows: The judgment of stimulus mag- 
nitude is the psychological fact that a judge 
must strive to represent by a number or, 
generally, a response. Judgments, in our 
view, depend on the combination of several 
relevant stimulus properties. The judg- 
mental effect of context is to change the 
combination of properties on which judg- 
ment depends (see Ross & Di Lollo, 1968a). 
The response scale effect of context is a 
change or modification in the scale that a 
judge constructs in order to represent his 
judgments. Where context affects re- 
sponse scale, but not judgment, the psycho- 
logical fact remains unchanged, but the 
way in which it is represented is altered. 
The subject will use a different number, a 
different category, or somehow modify the 
response he would otherwise have made. 

We first make a necessary working 
assumption to enable us to minimize judg- 
mental effects so as to observe response 


Sca hieh is sce à ; 
ea d pP M ent mei . Thé double scale effects n a relatively pure v" The 
effect of context, first on the judgmental assumption is Thet e is Dena o E 
Process and second on the response scale, multidimension? scaling i esur der 9 
a simple psycho- difference between stimu’ Wm order to 
f stimuli into those for 


makes the search for 
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ctures that are not unidimen- 
sional, which supports the suggestion (Ross 
& Di Lollo, 1968a) that lifted weights are 
judged as complex stimuli and nok M 
of the single stimulus property, weight. 
Other studies in our laboratory have shown 
that there are, on the other hand, classes 
of stimuli for which multidimensional scal- 
ing of difference estimates does produce 
unidimensional structures. Where it does 
so, we take it to mean that a single Stimulus 
property underlies the judgments of differ- 
ence. Since we argue that the Judgmental 
effects of context is to vary the emphasis 
placed on different stimulus properties, it 
follows that where only one property is ever 

considered, context will have no effect, 
The conditions under which estimates of 
difference with respect to a Property (like 
heaviness, length, pattern density, etc.) 
will produce structures that are not uni- 
dimensional are different from those under 
which estimates of overall dissimilarity for 
the same stimulus objects will do SO. In 
the former case, the instruction is to ignore 
everything except the relevant property, 
Whereas, in the latter, all perceptible differ- 
ences are grist for the judgmental mill. 
Estimates of differences with respect to a 
property will produce nonunidimensiona] 
Structures only if, so to Speak, instructions 
are disobeyed, 
Looking at the point more technically, 
the distinction may be made by considering 
how difference estimateg are formed in each 
case. Each stimulus object may be con- 
sidered as a vector of attributes. The in- 
struction to judge the overall difference 
?etween two objects x and y is an instruc- 
ion to estimate a distance value dzy, where 
uy == S(x,y), that is, some function of the 
wo vectors of attribute values. Analysis 
f a matrix of dzy values for a set of objects 
cates them as vectors ina multidimen- 
onal Space, according to Standard multi- 
imensional scaling theory if the re 
ents of that theory are met 
Estimates of differe 
Operty should be one of two things, 
st possibility is that they are estimates 
erence Considering only one attribute, 
vectors x ang Y. That possibility 


* represented by saying that the esti- 


quire- 


nce with respect to a 
The 
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mates represent distances d*,, where a, 
= f (Xa), £a and Ya being the values from 
the vectors x and y for attribute a. 
Analysis of a matrix of d*,, values should 
produce a unidimensional Structure, since 
the x, and Ya are scalars, 

The second possibility is th 


at the prop- 
erty in question, like heaviness, may de- 
pend on two or 


more stimulus attributes, 
like weight and density. |n this case there 
should be a two-step estimation of differ- 
ence with respect to the Property, Step 1 
being judgment of the Property, p, and 
Step 2a consequent derivation of a differ- 
ence estimate. The whole process may be 
represented as: 


Step 1 bz = h(x) 
. P,-h (y) 
Step2 dr, = S (bapi) 


A matrix of d^., values should produce a 
unidimensional structure eyen where the 
property p is based on the vectors x and y, 
since the values b. and p, (or h(x) and 
h(y)) are again scalars. 

Failure to find unidimensionalit 
either that the function % or 
J varies with the pair of objects, x and y, 
judged. Variation in J may be excluded 
both as unlikely and tolerable under reason- 
able assumptions of monotonicity jf non- 
metric Scaling procedures f 


Y means 
the function 


netri are employed. 
Finding a structure that is not unidimen- 
sional may be taken to mean that the 
function h varies With the pair of objects 
Judged, 

Our assumption js that where several 
attributes form the 


basis of 
the basis shi 
of objects judged so 


judgment of 
fts with the pair 
as to emphasize the 
D which they differ most and 
50 facilit. y 


> ation between them. 
that is true, % will y. 


ay in which variation might 
and Di Lollo 


ranges ol 
noted that 
ill Produce multi- 
if the } function 
Ot a direct recovery 


weights. It 
although d? é 
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of the attribute vectors for the stimulus 
objects. This fact severely complicates the 
Problem of describing the relationship be- 
tween the multidimensional structures de- 
rived from estimates of difference with 
respect to a property and scales derived 
Irom direct estimates of the property itself. 
We rely on multidimensional s aling to 
determine whether judgments lor a class 
of stimuli depend on more than one prop- 
erty or only on one single property. In 
the first case, we classify the stimulus class 
as one where the judgmental effects of con- 
text may be observed. In the second case, 
we classify the stimulus class as one where 
the judgmental effects of context will be 
Weak or absent. Any response scale effects 
of context in the second case will be uncon- 
lounded with judgmental effects. 
Multidimensional scaling, then, 
us to isolate classes of stimuli not at all or 
only weakly susceptible to judgmental 
effects, if our working assumption is correct. 
The assumption could be justified by an 
argument based on the findings of Donovan 
and Ross (1969) and our earlier report 
(Ross & Di Lollo, 1968a), but we prefer 
lo justify it by howing that context effects 
for the two classes of stimuli defined by 
multidimensional scaling are clearly dis- 
'inguishable and easily explicable by a 
scale where 


enables 


effects 


model for response 
Judgmental effects should be weak or 
absent, 


AN EXAMPLE OF RESPONSE 
ScaLE EFFECTS 

Random checkerboard patterns, like 
those used by Di Lollo and Kirkham (1969) 
all within the class of stimuli that are 
"dged in terms of one property, in this 
ase, pattern density. We report the re- 
Uts of a study using checkerboard pat- 
“rns to show pure response scale effects. 
n this study, as in all the studies that we 
sider, a common experimental design, 

led the shift design, was employed. . 
„The shift design employs two comparison 
' mulus series, Series A and Series B, and 
Single standard presented once only be- 
re judgments of either series begins. 
* Standard is set to a numerical value, 
"mally 100, and the subject makes 


magnitude estimates of the stimuli in 
Series A for a fixed number of trials. In 
the studies to be reported, each series con- 
sists of five Separate stimuli, and a trial 
consists of the presentation of all five 
stimuli in the series in random order. The 
order is independently randomized for each 
trial but is used for all subjects in all groups. 
On a predetermined trial, Series B is sub- 
stituted for Series A, abruptly and with 
no warning to the subject. If Series B is 
in a higher region than Series A, the shift 
is upward, if it is in a lower region, the 
shift is downward. Identical or other 
series may be used for purposes of control. 
Suitable controls permit the design to be 
used to study contrast effects produced by 
shift. The design yields information about 
preshift and postshift estimation functions, 
about the immediate effects of shift, and 
about the stability of estimates over time. 

Figure 1 shows the results of a study in 
which each of three independent groups of 
23 subjects (student volunteers) judged one 
of three different series of five random 
checkerboard patterns each with respect to 
proportion of black area. The stimulus 
objects have been described by Di Lollo 
and Kirkham (1969). Briefly, they con- 
sisted of 50 x 50 matrices of square- 
element positions having random structure 
and varying in percentage of black area 
from 20% to 80% in steps of 5%. Each 
matrix was projected on a screen as a 
square having 4-foot sides. The percentage 
of black squares in the patterns of each 
series was as follows: light series—20%, 
25%, 30%, 35%, 40%; medium series— 
40%, 45%, 50%, 55%, 60%; dark series— 
60%, 65%, 70%, 7595, 80%. The em 
ment consisted of eight preshift and eight 
postshift trials. The 50% pattern was yes 
as à common standard stimulus for a 
" and was called 100. The standard 
BROMpsun z e estimation 
was presented once only wp s 
trials began, and never again. i uring mee 
shift each group judged a separate series; 

A shift all groups judged the 
during postshi 

: ies. The three groups can thus 

"ipie arr as light-medium, medium. 
be siae dark-medium, respectively. 
s ee points shown in Figure 1 repre- 
E. mean judgment for each pattern 
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Fic. 1. Mean judgment values for each pattern over the last four preshift 


trials. (The intersecting horizontal line has been drawn at the level of 


magnitude estimate of 100.) 


averaged over the last four preshift trials 
for each group. 

The groups differ systematically in the 
stimulus value that they would call zero, 
as can be seen by producing linear fits to 
the estimation functions to the abscissa. 
Second, they differ in the slopes of their 
estimation functions which increase with 
mean stimulus level. Third, they differ in 
the stimulus value that would be estimated 
as 100, that is in their use of the standard, 


but the difference is less than for the zero- 
valued stimuli. 


AN EXPLANATION FOR RESPONSE 
SCALE EFFECTS 


The most obvious effect of context is to 
create an overlap in the range of numbers 
assigned by subjects to stimuli in adjacent, 
but not overlapping, stimulus series. The 
light series group judges a 40% black 
pattern as 115. The same 40% black 
pattern is judged as 75 by the medium 
series group, which would place it, on the 
light series scale, as a pattern between 25% 
and 30% black. The two groups are 
clearly working on different response scales 
despite the fact that both might be expected 
to have adopted the same ratio scale, its 
modulus set by the assignment of the num- 


a mean 


ber 100 to a pattern of 50% black squares 
in both cases. 

Similarly, the medium series group 
judges a 60% black pattern as 146, whereas 
the dark series group judge it as 114, à 
value which, on their scale, the medium 
series group would apply to a pattern about 
53% black. The response scale changes 


as the context moves upward, and the 
effect is much too large 


tematic to be dismissed 
perturbation. 
cases w 
but heri 
is some 


and much too sys 
as a mere source 0 
As we show later, there arè 
here the effect of context is small, 
€ it is not (cf. Stevens, 1966). 
thing to be explained, and we pro 
pose an informal model to do so. 

The two most important clues in a search 
for an explanation are provided by ts 
results themselves. Ag Figure 1 show? 
the three groups differ markedly in theif 
description of the standard and in th® 
stimulus that they would describe with z 
number 100. For the middle group, p. 
medium series group, that stimulus CÓ 


e 
responds very closely to the standard. TP 
medium series group does in fact judge $7, 
standard, 


Since their stimulus series JW. 
d e 
cludes a 50% black pattern, Even ato 
many trials, they judge it as about } 
They can be said to remember the stant“ 
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Fie. 2. Mean magnitude estimates for terminal preshift and initial postshift 


stages for the light-medium and dark-medium groups. 


(The broken line was 


fitted through five data points (not shown) representing the mean performance 
of the medium-medium group over Preshift Trials 5-8.) 


accurately. The lower group, the light 
series group, does not judge the standard, 
since their stimulus series does not include 
it, but by extrapolation it can be concluded 
that the standard would be judged as much 
higher than 100. When the 50% black 
standard is encountered on the first post- 
shift trial, it is in fact judged as 144, as can 
be seen in Figure 2. The number 100 is 
used to describe a stimulus with a black 
area much lower than 50%, somewhere 
about 37%. The light series group can 
be said to have modified the remembered 
standard downward, in the direction of the 
Stimulus series within the context of which 
they make their judgments. 

The dark series group is forced in the 
opposite direction. Subjects would describe 
the standard as 60, rather than 100, and 
use 100 to describe a stimulus of some 58% 
black area. They modify their memory 
of the standard upward, rather than down- 
Ward. The first thing the results show, 
then, is a systematic modification by con- 
text of the standard as it is remembered 
by groups working within a low or a high 
Context, and no modification for the group 
Working within a context to which the 


tandard is central. 


The second clue is that the groups differ 
in the stimulus values which would fall 
at the zero points on their response scales. 
The three stimulus values are: light series, 
7% black; medium series, 24% black; and 
dark series 38% black. These values are 
calculated by extrapolation downward of 
the linear functions fitted to the data 
points in Figure 1, and are to that extent 
suspect. However, any reasonable func- 
tions fitted to the three sets of results 
would show wide differences between the 
groups in the stimulus values at which the 
functions meet the abscissa. 


Correlations were calculated 


log judgments and log h 
individual subjects using data from Trials 
5-8. A ratio was computed, for each 
subject, of the log-log correlation to the n 
relation for the untransformed values. E 
average ratios for the light, medium, an 


953, and .947, 
lark groups were | Len renes 
ee indicating that terr qa 
regression is marginally iore pos hib 
he log-log regression. Linear functions, 
isa fit the data as well as power func- 
I - other exponent. A further 
tione. S erii linear fits is the fact that 
justific? trapolation of the light 


rard linear €x ; 
pice cu lv collects estimates by 
seri a 


between 


stimulus values for 


POL) 
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that group for the medium series stimuli 
on the first postshift trial (Figure 2). —— 
The two clues suggest that context in- 
fluences both the memory of the standard 
and the stimulus value that the subject 
supplies for himself as a lower bound im- 
plicit reference stimulus, the stimulus that 
he would call zero. As context moves up 
and down the stimulus range, the re- 
membered standard and the lower bound 
stimulus are forced up or down, with results 
that can be observed in Figure 2. 

We believe that the subject in the task 
of magnitude estimation constructs his 
estimates for stimuli indirectly, from esti- 
mates of stimulus difference. Subjects, 
when asked, say that they compare each 
stimulus with others, especially the last 
one or two, and adjust their previous esti- 
mates upward or downward by 


an appro- 
priate amount. 


We are inclined to believe 
them, and to take them to mean that they 
estimate a numerical value for a stimulus 
difference, then construct their estimate for 
the stimulus from the stimulus difference 
value and previous estimates. What sub- 
jects describe is interval scale 

that is, the direct representation 
vals as numbers by 


behavior, 
of inter- 
à process of comparison 
of intervals, and the construction of num- 


bers to represent stimuli from the numbers 
for stimulus differences. 

The informal model we propose 
that subjects are sensitive to stimulus 
differences, and estimate directly, without 
the use of numbers, the ratio of one differ- 
ence to another. They can, so to speak, 
estimate how many times a larger difference 
would contain a smaller. If we grant that 
capability, and if we also grant a memory 
adequate to enable the comparison of a 
present with a previous stimulus, and of 
two previous stimuli, then we have enough 
to explain how interval scale representa- 
tions can be constructed. 

The subject starts with a standard stimu- 
lus, s, and the instruction to represent it 
by a numerical value, n. Suppose that he 
imagines a lower bound stimulus, z, the 
stimulus he would call zero, If he can sense 
the difference between s and s, d, then 

€ can also use x to represent the difference 

En € now present him with a second 


assunies 


g 
; 


$ 
b 
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stimulus, z, and he senses the difference 
value between r and s, d,,. Granting him 
the ability to estimate the ratio of stimulus 
differences directly, he can calculate a 
numerical value for the difference dy: by 
applying the ratio of d,, to d,., which is 
obtained directly, to the number, 2, that 
represents d,.. Let us say that the result 
is the value a. The adjustment of », the 
value that represents both the dilference 
d,. and the stimulus value s, upward or 
downward by a, yields a value m that 
represents the new stimulus, r. A direct 
comparison of differences thus allows the 
direct representation of a difference, and 
the indirect representation of a stimulus. 
The subject is fortunate enough to have a 
check. He can compare d,. with d,., and 
repeat the process to see if he arrives at the 
same result. 

As new stimuli are introduced, the 
process can proceed in the same fashion, 
but with more options and with more 
opportunities for checks. A fresh stimulus 
can be compared with the previous one or 
two, with the standard, and with the lower 
bound; whatever is done, the result should 
be the same. 

What context does, we believe, is in- 
troduce a systematic bias into the stimulus 
value the subject remembers as the stan- 
dard, s, and the 
for the 
effect 


stimulus value he imagines 
lower bound stimulus, z. The 
is to produce interval scales with 
different intercepts and different slopes. 
To illustrate, it is suffic 
the medium and d 
the stimulus the 
black square. 


ient to consider 
ark series groups judging 
y have in common, the 60% 
The medium series group 
remembers the standard veridically, but 
imagines the lower bound stimulus 49 


about 24% black, For them, the number 
100 means both a 50% black pattern and 


the difference between a 50% and a 24% 
black pattern, They perceive the differ- 
ence between the 60% black stimulus anc 
the standard as being contained a little 
more than twice within the difference b€* 
tween the standard, s, and the lower houn® 
7. In the notation used above, d,. is a litt i 
more than twice as big as d, Since 4s 
'S represented by 100, the appropriat* 
number for dos, is a little under 50, that £F" 


——— Ó———————— 


Ni 


MAGNITUDE ESTIMATION 


the number that bears the same ratio to 
100 as dsos does to d,.. The ratio is the 
result of a direct comparison of differences; 
the appropriate number for the new differ- 
ence is the result of a numerical calculation. 
The stimulus pattern of 60% black is 
judged as 100, the number that represents 
the standard, plus a little less than 50, the 
numerical value that represents the differ- 
ence dio, It is judged by the medium 
series group as 146. 

The dark series group remembers the 
standard as a pattern of about 58% black 
squares, and places the lower bound at 
about 38% black. The 60% black square 
is only a little different from the standard 
as remembered. The difference, doo,., is 
represented by a number just above 10, 
and the 60% black square is called 114. 

It is not our intention to develop the 
sketch of the representation process into 
anything so ambitious as a formal measure- 
ment theory, but an attempt to do so might 
proceed along the lines of a psychophysical 
relation theory constructed by Krantz 
(1970). Krantz assumes that the funda- 
mental object judged is “the perceived re- 
lation between a pair of stimuli," and that 
ratio between stimuli can be apprehended 
by a direct mental matching operation, like 
the physical operation carried out in com- 
He goes on to construct à 

roved 


paring lengths. 
set of axioms from which it can be pi 
that representation of stimuli by numbers 
is guaranteed, and that the permissible 
representations are related by power trans- 
formations. What we have assumed about 
Stimulus differences is very close to what 
Krantz assumes about stimuli themselves, 
though we have probably assumed suffici- 
ently more to require multiplicative rather 
than power transformations. — 
Krantz (1970) proposes à mechanism by 
Which a sensation "ratio," a relationship 
between members of a pair, is arrived at 
directly, with no numerical calculations. 
lis assumption is that pairs of properties, 
ike heaviness or loudness, are matched 
With length pairs, and then one member of 
the pair “fitted into" the other as many 
times as it will go. The process of mentally 
‘fitting i i ic: -essing of 
fitting in," like the physical process EG 
Ying one length off against another, yields 
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a ratio directly. Numerical calculation 
enters in only when a person assigns a 
number z to a stimulus y: The number he 
assigns to y; is n = pr if the number he 
has assigned to x; is p and if the directly 
obtained ratio of y; to x; is r. 

Our assumption is that it is pairs of 
differences, not pairs of properties, that are 
compared, and one difference that is "fitted 
into" another, not one object or its length 
equivalent. lf it is reasonable to assume a 
mechanism that matches object pairs with 
length pair equivalents, it is as reasonable, 
and perhaps more so, to accept a matching 
of difference pairs and length pairs. Length 
itself may be a direct perception of differ- 
ence in location between points assigned 
positions within a spatial system, and 
rather more naturally coordinated to other 
perceived differences. The construction of 
numbers for stimuli from numbers that 
represent differences entails a further step, 
the addition or subtraction of a number for 
a difference to or from a number previously 
assigned to another stimulus. 


RESPONSE SCALES AND THE 
ErrECTS OF SHIFT 

An overlap in the range of numbers 
assigned by independent groups to adjacent 
stimulus series can be taken as a primary 
sign of response scale effects in cases where 
stimuli fall within the unidimensional class. 
Overlap is ambiguous for multidimensional 
stimuli, like lifted weights, since it could be 
caused by differences in the basis of judg- 
ment for stimuli within different contexts. 

When independent groups working in dif- 
ferent stimulus ranges are shifted abruptly 
to a common postshift series, their initial 
estimates should depend on the response 
cale established in the preshift series. 
With repeated trials, as the postshift series 
establishes new reference points, the stan- 
ard as remembered, and the lower bound 
stimulus, all groups should tend to establish 
similar response scales, ; and the initial 
effects of shift should begin to disappear. 

Returning to the study described in the 
foregoing, Figure 2 shows the initial effects 
of shift when groups judging random 
checkerboards are shifted abruptly to the 
medium series. Straight lines have been 
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Fic. 3. Mean judgment value for each group on each trial. 


fitted through the preshift data of both 
shifted groups and have been extended over 
the range of the medium series. On the 
first postshift trial, the upward shifted 
group follows the preshift estimation func- 
tion almost exactly in judging the postshift 
series. The downward group does not and 
could not unless some stimuli were assigned 
unrealistically small numbers. The down- 
ward shifted group uses a new scale defined 
at the upper end by the lowest preshift 
stimulus, and at the lower end by 
lower bound stimulus, forced abruptly 
upon it by the change in context, The 
lower bound is roughly equal to that of the 
medium series group on its | 
shift trials. 

With repeated postshift trials, both 
shifted groups moved toward the control 
function, as the postshift series established 
a new context. 

The five responses made by each subject 
on each trial were averaged to yield the 
mean response by each subject for the series 
on each trial. Mean magnitude estimates 
for each group on each trial, before and 
after shift, are shown in Figure 3. The 
point that emerges from Figure 3 is the 
Striking similarity in preshift group differ- 


a new 


ast four pre- 


ences, in the initial effects of shift, and in 
convergence toward control levels, to re- 
sults typically found in similar studies 
using category estimation (Di Lollo, 1964; 
Ross & Di Lollo, 1968b). There is also an 
important difference: The effects of up- 
ward and downward shift are fairly sym- 
metrical when category estimation is used. 
Here they are not. Upward shift produces 
à larger initial deflection from the control 
level than downward shift. Further, the 
upward shifted group moves toward but 
does not reach the control level with re- 
peated postshift trials, whereas the down- 
ward shifted group not only reaches but 
moves past the control level. We show 
later that the asymmetry of upward an 
downward shift effects is not an invariable 
feature of magnitude estimation but de- 
pends on the range of numbers available 
as permissible responses. 

So far it has been shown that where ove! 
lap between preshift estimates is larg? 
shift effects are also large and consistent 
with our explanation for response scale 
effects. Results for two further cases 
length of lines and pitch of tones, are nov 
Teported. Both fall within the unidime?" 


sional class, though there is some evident 
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for minor judgmental complications in the 
case of tones. 

The procedural and methodological de- 
tails of the two experiments were as follows: 

Length of line. ‘Three independent groups 
of 20 subjects each (student volunteers) 
made magnitude estimations of the length 
of one of three different series of five 
Straight lines each. The lines were pro- 
jected as horizontal strips of light having 
2 centimeters breadth and variable length 
about the center of a flat screen 7 meters 
Wide and 2 meters high, located in a semi- 
dark room. The length of each line in 


each series was as follows (in centimeters) : 


Short series—50, 60, 70, 80, 90; medium 
Series—90, 100, 110, 120, 130; long series" 
: f viewed 


130, 140, 150, 160, 170. The subjects 
the lines, presented one at a time, from a 
distance of approximately 8 meters. The 

10-centimeter line was used as a common 
Standard for all groups and was called 100. 
All other procedural details were the same 
^ lor the experiment with random reni 
ac s including a shift of the mec ium 

ries after eight preshift trials. 

Pitch of tone. Three independent groups 


Mean magnitude estimates for judgment of length of line and pitch of tone. 


of 40 subjects each (student volunteers) 
made magnitude estimations of the pitch 
of one of three different series of five tones 
each. The tones had been recorded on 
magnetic tape and were presented through 
headphones at an intensity of 72-decibels 
sound pressure level. The frequency of 
each tone in each series was as follows (in 
cycles per second): low series—262, 285, 
311, 339, 370; medium series—370, 404, 
440, 480, 523; high series—523, 571, 622, 
679, 740. Since the equal-loudness con- 
tour does not vary more than 3 decibels 
over the range of frequencies at the sound 
pressure level used in this experiment, no 
correction for loudness was made. The 440- 
cycles-per-second tone was used as a com- 
mon standard for all groups and was called 
100. The subjects were instructed that a 
tone having pitch twice as high as the 
ard should be called 200, a tone having 
half as high as the standard should 

All other pro- 


stand 
pitch 
be called 50, and so on. 
cedural details were the same as for the 
n length of line, including a 


experiment on € 1 | 
dium series after eight pre- 


shift to the me 
shift trials. 
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Figure 4 shows the results of both atudien, 
The overlap in the preshift phase betw Sin 
adjacent stimulus series is small in m casa 
of judgment of length, and large in aed 
of judgment of pitch. Points for fade ua 
stimuli in the preshift phase are shown in 
the left panels of Figure 4 only for the top 
and bottom series, but overlap between ad- 
jacent series can be estimated from the 
straight lines fitted to both sets of top and 
bottom series and drawn without individual 
points being shown for the middle series. 
The left panels of Figure 4 also show the 
performance of both shifted groups on the 
first postshift trial. In judgment of length, 
both the upward-shifted group (short- 
medium) and the downward-shifted group 
(long-medium) move to the control level 
immediately. Both Sets of points fall al- 


most exactly on the control curve. [n 
judgment of pitch, the upward-shifted 
group (low-medium) follows the preshift 


curve on the first postshift tri 
happened in judgment of black 
random checkerbo 
ward-shifted g 
behaves like t 
checkerboard p 
curve is tied to 


al, as also 
area with 
ard patterns. The down- 
roup (high-medium) also 
heir counterparts judging 
atterns. The top end of the 
the preshift scale, but there 

isan abrupt readjustment in slope to 
modate the lower stimuli. 
The right panels of F 
effect of continue 


accon- 


igure 4 show the 
d exposure to the 


post- 
shift context. Judgment of length remains 
stable over the whole postshift phase, 


showing neither an initial deflection from 
control levels in response to shift 
drift in level with repeated postshift trials. 
Judgment of pitch shows à pronounced 
deflection upward. with upward shift and 
downward with downward shift. With re- 
peated trials, both shifted groups drift 
toward the control level. The asymmetry 
between upward and downward shift that 
was observed with checkerboard patterns 
can also be seen clearly with pitch. 

The results of the two experiments show 
that where overlap between 
Signed to adjace 
the effects 
Overlap is | 

his corre] 
if 


; nor any 


numbers as- 
nt stimulus series is small, 
of shift are small, and that where 
arge, the effects of shift are large. 
ation of effects is to be expected 
both Preshift ang postshift effects de- 
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pend, as we say they do, on sensitivity of 
reference stimuli to context. "S 
What it is that determines the sensitivity 
of reference stimuli to context is another 
question. We do not offer any firm opinion, 
but we venture to suggest that the primary 
determinant is whether or not people have 
had training in the mensuration techniques 


for the stimulus property concerned. 
Everyone has been taught the principles 
of mensuration for length and weight. 


The concepts of zero length and zero weight 
are well established, the Operations to in- 
crease or reduce length and weight are 
known, and elaborate reference systems for 
both length and weight are part of the 
intellectual equipment of subjects like those 
who served in our experiments. Pitch is 
another matter. The concept of zero 
pitch is not a clear one, the operations to 
increase or reduce pitch are not familiar, 
and there is no well-est 
system for pure tones 
stimulus domains such as length and 
weight. While the concept of zero pattern 
density may be clear enough, the operations 
to increase or reduce the pattern density 
of a random checkerboar 
obscure to most 
well-established 
stock of exempl 


ablished reference 
as there are for 


d are, once again, 
people, and there is no 
and highly overlearned 
ars to remind a judge of the 


look of a stimulus having a given absolute 
density under the conditions of the 
experiment, 


Our suggestion is th 
arity with concepts, 
emplars, all of which 


at thorough famili- 
operations, and ex 
are part of training 1” 
mensuration, minimize context effects by 
Providing a general context within which 
the experimental context falls into place: 
Where familiarity. is lacking, the experi- 
mental situation is Prone to distort be 
general context, Producing the contex 
effects that are observed, 


RESTRICTION OF 


There is an asymmetry in the example? 
Considered so far between the effects °, 
Shift on the first postshift trial followi?* 
Upward and downward shift, Estima’ 
of upward-shifted groups follow the P i 
shift estimation function, whereas estimate 
of downward-shifted groups do not. 
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stages for the 


Mean judgment values for terminal preshift and initial postshift 
ight-medium and dark-medium groups. 
fitted through five data points (not shown) repr enting t 


(The broken line was 
he mean performance 


of the medium-medium group over Preshift Trials 7-10.) 


explanation given is that if the preshift 
function were to be maintained following 
downward shift, some stimuli would be 
assigned unrealistically low numbers and 
judges are therefore forced to make an 
abrupt readjustment. If the explanation 
is correct, symmetry between upward and 
downward effects should be found when 
there is a maximum as well as a minimum 
valued stimulus within a domain, and where 
both have a fixed value, like 0 and 100. 
Di Lollo and Kirkham (1969) report such 
a case in a study of pattern density where 
all black and all white checkerboards were 
maximal and minimal stimuli, respectively; 
One defined as 100, the other as 0. Only 
4 portion of the experiment need be con- 
Sidered here, Three groups (light-medium, 
medium-medium, dark-medium) of 13 sub- 
Jects each were required to estimate the 
Percentage of black area in each of five 
Patterns under conditions that were other- 
Vise identical to those of the study with 
Checkerboard: patterns described in the 
Oregoing, Further analy of the data 
"ported by Di Lollo and Kirkham (1969) 
rcveals symmetrical shift effects on the 
first postshift trial as shown in Figure >. 


Downward-shift effects are exactly like 
those reported for the previous checker- 
board study (Figure 2) and for the experi- 
ment with tones (Figure 4, lower left panel). 
Upward-shift effects are different from 
those reported above and symmetrical 
with downward-shift effects in that the 
estimate for the lower stimulus of the 
postshift series is fixed by the preshift 
estimation function, but the first postshift 
estimation function has a much shallower 
slope, as it must have if responding with 
unrealistically high numbers is to be 


avoided. dn 
The estimation of black proportion for 


checkerboard patterns is a task where the 
stimuli are unidimensional and where famil- 
iarity with mensuration techniques is low. 
People have typically not been taught to 
estimate black proportion as they have been 
to estimate length and weight. Where sul. 
jects are free to use any positive number 
to represent their judgments, there is an 
asymmetry between the effects of upward 
and downward shift, as was shown earlier, 
The effects become symmetrical when the 
task is changed to introduce an upper limit 
as well as a lower limit on the numerical 
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Fic. 6. Mean magnitude 
stages for the light-me 
was fitted through five data points 
formance of the medium-medium g 


responses available for the representation 
of judgments. That a task modification 
which affects only the response range avail- 
able to the subject alters the effects of shift 
so decisively strongly suggests that response 
effects are the predominant ones in the 
experimental situation. 


JUDGMENT AND RESPONSE EFFECTS 


We now show that the effects described 
above can be clearly distinguished from 
effects found with lifted weights, stimuli 
that do not fall in the unidimensional cl 
(Donovan & Ross, 1969). The 
shown in Figure 6 represent further 
analyses of data reported in a previous 
paper (Ross & Di Lollo, 1968a, Experiment 
1) to which the reader is referred for pro- 
cedural and methodological details. Briefly 
three independent groups of 20 subjects 
each (student volunteers) made magnitude 
estimations of the heaviness of one of three 
different series of five weights each. The 
weights consisted of 4-ounce polythene 
containers, 5.3 centimeters in diameter and 
7.3 centimeters high f filled with lead shot 
and paraffin wax to the weight required. 

he weight of each stimulus object in each 


ass 
results 
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Q PRESHIFT (TRIALS 6-10) 
A FIRST POSTSHIFT TRIAL 


500 
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WEIGHT (GRAMS) 
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estimates for terminal preshift and initial postshift 
dium and heavy-medium groups. 


(The broken line 


(not shown) representing the mean per- 
roups over Preshift Trials 6-10.) 


series was as follows (in grams): light 
series—100, 150, 200, 250, 300; medium 
series—400, 450, 500, 550, 600; heavy 
series—700, 750, 800, 850, 900. The 500- 
gram weight was used as a common stan- 
dard for all groups and was called 100. 
There were 10 preshift and 10 postshift 
trials. All other procedural details were 
the same as for the experiment with random 
checkerboards, reported in the foregoing: 
including a shift to the medium series- 
Although the zero-valued stimulus on the 
three preshift estimation functions show? 
in Figure 6 increases systematically with 
series level, the slopes decrease rather than 
increase correspondingly, Moreover, the 
differences in the 100-valued stimuli ar 
greater than, not less than, the differences 
in the 0-valued stimuli. 

Postshift effects are still more discrepant 
In the first place, the estimations on p 
first postshift trial do not follow the pr 
Shift estimation function either in upwar. 
or downward shift (Figure 6); and, in p 
Second place, the mean estimate for t% 
series continues to rise over postshift trie 
whether the shift Was upward or downW" 
(Ross & Di Lollo, 1968a, Figure 5). 
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Ross and Di Lollo (1968a) showed that 


a2 both preshift context effects and postshift 


/ 


we 


AS 


effects, including the almost invariable up- 
ward drift in level over preshift and post- 
shift trials, could be explained by a model 
lor the complex determination of judgments 
of heaviness, ignoring response-scale effects. 
While it is almost certainly true that re- 
sponse effects are also present in estimates 
of heaviness, it does seem that it is judg- 
mental effects that are predominant, and 
it does seem that they can be distinguished 
from response effects. 

The classification of stimulus domains 
into those susceptible to judgmental effects 
and those not susceptible or only weakly 
susceptible was based on a working assump- 
tion that multidimensional scaling of judg- 
ments of stimulus differences would reveal 
more than one dimension in the former case 
and only one dimension in the latter. The 
validity of the classification scheme is sub- 
stantiated first by the sharp differences be- 
tween preshift and postshift effects found 
for the two classes, for example, between 
lifted weights and checkerboard patterns; 
second by the fact that a variety of effects 
can be shown to be correlated within the 
class of stimuli not susceptible to judg- 
mental effects. Where the overlap in the 
range of numbers assigned by independent 
groups to adjacent stimulus series is large, 
the effects of shift, immediately and with 
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continued postshift trials, are also large. 
Where the overlap is small, as it is with 
straight lines, shift effects are also small. 
Finally, the validity of the classification and 
the effectiveness of the working assumption 
are attested by the fact that the effects 
found within both classes of stimuli can be 
explained by appropriate models. 
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METACONTRAST AND LATERAL INHIBITION ! 


BRUCE BRIDGEMAN ? 


Stanford University 


A Hartline-Ratliff lateral inhibitory network, modified for application to 


mammals, is stimulated with patterns having all of thc 
stimuli used in metacontrast experiments. 
the application of stimuli follows < " 
U-shaped function results when activity both during and 


monotonic masking function, but 


meters of 


Network act ed during 


y clic 


a 
after stimulation 


is examined. The longer processing time needed for obtaining U-shaped 


functions reflects the higher response criterion nec 


function psychophysically. 


increasing interstimulus separation, 


Backward masking, the obscuring of a 
brief visual stimulus by a subsequent visual 
stimulus, has long been of interest to 
psychologists because it apparently works 
backward in time. Backward masking is 
called metacontrast when a patterned 
masking stimulus obscures a target pattern 
even though the two have no spatial over- 
lap. The extensive literature on metacon- 
trast has been recently and ably reviewed 
by Kahneman (1968), Raab (1963), and 
Weisstein (1968), while Alpern (1952) has 
reviewed the earlier literature. Some gen- 
eralizations from this work can guide at- 
tempts to account for metacontrast effects. 

The two types of masking function com- 
monly found are the monotonic, in which 
masking is greatest when the two stimuli 
are presented simultaneously, and the 
U-shaped, in which the most effective 
masking occurs when the masking stimulus 
begins about 50-80 milliseconds after onset 
of the target. The latter condition, in 
which both target and mask are seen if 
presented simultaneously but the target is 
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"essary for obtaining this 


Forward masking, simultaneous presentation, 
and backward masking conditions are illustrated. 


qualitatively explains mutual masking, 


In addition, the model 


decreasing effectiveness of masking with 
; and change to monotonic functions with 
stronger masking stimuli, and it makes some predictions. 


interpreted as a handicap which the organism accepts in oi 
advantages of stimulus processing by lateral inhibition. 
physiological evidence is cited to support the model 


Metacontrast is 
rder to gain the 
Psychophysical and 


obscured if the mask follows it, is the most 
paradoxical condition and has been the 
most difficult to explain theoretically. 
Many of the conditions necessary lor 
obtaining U-shaped metacontrast functions 
have been established, though there is dis- 
agreement on some points. The function 
is obtained only when the masking stimulus 
surrounds the target stimulus but shares n0 
common borders with it; an annulus sur- 
rounding a disc or two parallel bars flanking 
a target bar are common stimulus situa- 
tions.  Metacontrast js a weak effects 
usually measured as an increase in the 
number of errors made in discriminating 
some characteristic of the target, perfor 
mance remaining above chance even at the 
peak of masking. The U-shaped functio” 
is best obtained with equal-energy stimul! 
and it is observed when a subject must €^ 
more than simply detect the presence a 
absence of the target; he has a high 
criterion (Kahneman, 1968). Unlike oth?" | 
forms of visual masking, metacontrast a 
be obtained dichoptically (Schiller & Smit j 
1968), so that the effect cannot be 
entirely to retinal or lateral genicu © 
interactions. Yet it is still crucially dep® jt 
dent on spatial parameters because pn 
disappears when the separation betw®%o 
target and mask increases much beyon” in 
of visual angle, and it is best obtaine val 
the peripheral retina. Masking is mutt | 


fi 


of 


| 


METACONTRAST AND 


that is, the perception of both target and 
mask is interfered with to some degree, the 
target stimulus being more strongly in- 
hibited. A satisfactory theory of meta- 
contrast must account for all of these 
characteristics. 


THEORIES 


Weisstein (1968) has divided metacon- 

trast theories into three general groups: 
confusion, overtake, and inhibition. Con- 
fusion theories argue that 
the more similar the mask is to the target stimulus, 
or the more noise generated by the interaction of the 
two stimuli, or the more difficult the spatial or 
temporal resolution, the harder it will be to dis- 
criminate the target from the overall stimulus 
situation [Weisstein, 1968, p. 496]. 
"This hypothesis only restates the problem 
in more obscure terms and has difficulty 
in accounting for both monotonic and 
U-shaped functions at the same time. One 
explanation of the U-shaped function 
(Kahneman, 1968) notes that the geometry 
of the metacontrast design implies “im- 
possible" apparent motion and the target 
is therefore suppressed, but there is no 
reason why the mask should not be sup- 
pressed instead. Further, apparent motion 
can be seen over a larger visual angle than 
metacontrast suppression. 

The overtake hypothesis, stating that 
the target stimulus is held in a short-term 
buffer store but is replaced by the masking 
stimulus before readout, encounters similar 
Problems as well as difficulties in explaining 
forward masking. Both of these theories 
generally ignore the uniquely spatial nature 
of metacontrast, and have difficulty in pre- 
dicting both monotonic and U-shaped func- 
tions with the same model parameters. 

A third set of theories is neurologically 
based and uses lateral inhibition to account 
lor the reduced responses to the target 
Stimulus. This kind of inhibition operates 
ina group of neurons with parallel inputs 
and outputs (Figure 1), in which each 
Neuron sends out collateral axons to inhibit 
its neighbors in the same layer, either 
directly or through inhibitory interneurons. 

Inhibitory networks are common in sen- 
Sory systems, providing an anatomical sub- 


LATERAL INHIBITION 


3 Diagrammatic representation of lateral 
inhibition in a neuronal network. (Information 
flows from left to right. Only a few inhibitory 
collaterals for a single cell are shown. Inhibition is 
applied recurrently, so that each cell in the network 
can be influenced by the firing of its nearby 


neighbors.) 


strate which is lacking in other explanations 
of metacontrast. Lateral inhibition has a 
role in many other visual phenomena, such 
as light and dark adaptation (Ratliff, 
Knight, & Milkman, 1970), contour en- 
hancement (Ratliff, Hartline, & Miller, 
1963), figural aftereffects (Ganz, 1966), 
successive contrast (Singer & Creutzfeldt, 
1970), and Mach bands (Mach, in Ratliff, 
1965). Thus an interpretation of metacon- 
trast in terms of lateral inhibition 1s 


parsimonious, invoking only classical neu- 
Furthermore, there is 
sking effects 


Ratliff et al. 


ronal mechanisms. 
physiological evidence for ma 
in lateral inhibitory networks. 1 
(1963) illuminated a group of Limulus 
ommatidia and showed that the response of 
a neighboring ommatidium to light. was 
decreased after a time delay. Grüsser, 
Petersen, and Sasowski (1965) have found 
metacontrastlike effects due to lateral in- 
hibition in the cat retina, but the effects 
were too brief (5-40 milliseconds) to be 
those responsible for metacontrast in hu. 
mans. Schiller (1968) obtained a mono- 
tonic masking function from single cells of 
the cat lateral geniculate. While these 
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experiments cannot account for human 
metacontrast because of its dichoptic na- 
ture, they do show that lateral pace 
can cause some kinds of backward masking 
under physiological conditions. 


The most complete simulation of meta- 
contrast in the literature is Weisstein’s 
(1968) model, which actually uses a form 
of forward inhibition with a lateral com- 
ponent. Her two-receptor model cannot 
simulate the nonadjacency of stimuli re- 
quired for true U-shaped metacontrast 
functions (two adjacent stimuli will appear 
as a single larger pattern if their onset and 
offset are simultaneous). Though the re- 
ceptors in her model are identical, the 
second-layer cells are not, so that the 


simulation will not work if target and mask 
are interchanged, 


THE MODEL 


A stimulus impinging on a network 
incorporating lateral inhibition causes a 


a 
series of disturbances to spread away from 
the stimulus. 


Activity in both the stimu- 
lated locus and in nearby areas of the net 
becomes modified by the spread of inhibi- 
tion, and even after stimulation ceases, 
these oscillations continue to influence the 
net until they damp out. Such effects have 
been studied in detail by Hartline, by 
Ratliff, and by others in the Limulus eye 
(Dodge, Shapley, & Knight, 1970; Ratliff, 
1965). On the basis of his observations, 
Ratliff has developed a mathematical model 
of the system in which each neuron inhibits 
neighboring neurons by an amount pro- 
portional to its own activity. If fp is the 
ongoing firing rate of neuron p, and r, is the 
rate of a nearby neuron j, the effect of 
ři On rp is described as 

Tp = Cp — kogy — ry?) [1] 
Where e, is the excitatory input to neuron 
P, kh»; is an inhibitory coefficient with 
value 0€ & « 1 expressing the effect of j on 
P, and 7,,;° is the corresponding threshold 
of inhibition. Interactions also travel in 
the Opposite direction, of course, with b 
inhibiting j. Similar effects connect each 
neuron with many others, so that the 


a 


BRucE BRIDGEMAN 


steady-state firing rate r, is more com- 
pletely described by 


kp,;(r; E 


n 
= -— A 
Tp = €p P 
j=l 


rt) j#p [D] 


for interactions with » neurons in a one- 
dimensional net. 

A similar equation can be written for 
each neuron in the network, yielding a 
matrix of linear equations describing the 
entire network's interactions. But the 
inhibition described in each equation is not 
truly recurrent, that is, the action of jon p 
is found without taking into account the 
action of p on j. In the model, this non- 
recurrence represents the passage of time; 
at any time / a neuron is being inhibited by 
signals which originated from its immediate 
neighbors at t—1, from twice as far away 


at (—2, etc., so that the basic equation 
becomes 


ra) =el) E brit Ip — il) n4] 


j=1 


jp. [3] 
Applying this equation to each neuron in 
succession yields a new set of 
the network, to which the same equation 
can be applied again. After an infinite 
number of such iterations, the state of the 


System converges to that obt 
recurrent one 


cussion of this 
model is give 
130-142). 


Equation 3 is a special case of the 
Limulus model with a simple time lag o! 
inhibition as presented by Ratliff, Hartline, 
and Lange (1966), which according to those 
Authors gives qualitatively correct p 
dictions of Limulus eye responses in mos 
Cases. The present model restricts inhib 
tory effects to a constant apparent con” 
duction rate, which may be an oversimp? ^. 
cation in the Limulus, where inhibitor’, 
latency is related partly to (a) the tim 
needed to reach the inhibitory thresho. g 
(Lange, Hartline, & Ratliff, 1966; Rat. 
et al., 1966) and partly to (b) the activ, 
of the ommatidia exerting the inhibit 


activities in 


ained in a 
(Figure 2). A further dis- 
and other properties of the 
n by Ratliff (1965, pP 
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Fic. 2. The effect of repeated iterations of | 


(The stimulus is shown on the left 


shown at center. After a large number of iterations, activity 
The same result is obtained by solving simultaneous linear 


network.) 


(Ratliff et al., 1966). In this application 
of the model to a mammalian system, 
inhibitory thresholds are set to zero, and 
the average activity of cells exerting inhibi- 
tion is similar in all simulated conditions, 
minimizing the influences of threshold and 
activity on apparent conduction time under 
the simulated conditions, so a constant 
Conduction speed is assumed. Wuttke and 
Grüsser (1968), in fact, find a constant 
conduction rate of lateral inhibition in cat 
retina, 

Because this simulation involves assump- 
tions about central structures in the ner- 
vous system, no thresholds for lateral 
inhibition are used. A background level of 
firing is maintained so that excitation and 
Inhibition can modify the final firing level 
by means of a dendritic microstructure of 
S'aded potentials which can register even 
very small changes (Pribram, 1971, Ch. 1). 

Inhibitory coefficients &p,; were chosen 
to follow tne results of Barlow (1969), who 
Ound a slight strengthening of inhibition at 
Short interommatidial distances in Limulus 
followed by a decline in effects for more 

istant elements. The coefficients used 
here closely follow the shape of Barlow's 
Vertical" curve (extrapolated to 0), con- 
€nsing its lateral spread; the first two 
Us of .3 bracket the peak of inhibition, 
"hile further ks of .1 and 0.0 follow the 
"Cline, The coefficients are larger than 


tf} 


ateral inhibition on a continuously applied stimulus. 
; after one iteration of inhibition, the network's activity is 


converges to that shown at right. 
equations for each neuron in the 


those found in Limulus (Ratliff et al., 1966) 
to compensate for the lack of effects from 
the second dimension in a one-dimensional 
network. Inhibition is symmetrical on each 
side of a neuron. 

Lange et al. (1966) present a more 
developed version of the same additive 
model of spatial and temporal interactions 
in Limulus, where the state of the network 
is defined for each millisecond of time so 
that influences can be traced impulse by 
impulse, and durations of postsynaptic 
potentials, etc., can be simulated precisely. 
Some of their assumptions, specifically the 
generation of spikes and their direct addi- 
tion to a lateral inhibitory pool, do not 
apply to a vertebrate system in which 
lateral inhibition is mediated by inter- 
neurons which themselves can be stimulated 
to change the characteristics of inhibition 
(Hull, 1968; Kalil & Chase, 1970). Later 
refinements of the additive system (Knight, 
Toyoda, & Dodge, 1970; Ratliff, Knight, & 
Graham, 1969; Ratliff et al., 1970) have 
modeled temporal effects more generally, 
in formats appropriate for analyzing re- 
sponses to sinusoidal stimulation. 


APPLICATION TO METACONTRAST 


The following simulations use a revision 
of a PDP-8 computer program written by 
R. W. Phelps and D. N. Spinelli rd 
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published). Using Equation 4 to calculate 
lateral inhibitory effects, it prints the 
network's state after each iteration. Stimuli 
can be added or removed after each itera- 
tion. To avoid instabilities, the ends of 
the network are tied together to form a 
circle, and simulations were stopped when 
effects from stimuli at the center spread to 
the ends, preventing effects from intersect- 
ing across the "seam." The program solves 
Equation 3 for each neuron in the network 
successively; the response r,(/) is printed 
out for each neuron and stored. For the 
next time iteration, the r,(/) values are 
moved to r,(/ — 1) and used to evaluate 
ry(t — 1) for effects on adjacent cells; in like 
manner, earlier matrices r,(| — n) are 
stored for use in calculating interactions 
with more distant cells. 

With this program, the effects of lateral 
inhibition on a brief stimulus can be ob- 
served directly by a three-stage process. 
First, the network is "adapted" to a steady 
state by giving each of the neurons a 
"background" stimulation level ep for 
severaliterations. (It was discovered early 
that one must simulate all stimulus con- 
ditions, not just the interesting ones!) 
After adaptation, a stimulus is applied by 
increasing the input values for some neurons 
for a few iterations, and finally the stimulus 
is "turned off" by reassigning the back- 
ground stimulation level to all neurons, the 
network again going through successive 
iterations of inhibition. 

Stimulus parameters for the metacon- 
trast simulation were chosen to follow as 
closely as possible those used in behavioral 
masking experiments. If the one-dimen- 
sional network is considered as a cross 
section through the center of a stimulus 
array, a disc would appear as a step increase 
in the excitation of some elements (Figure 
2, left). In the 24-cell array used here, the 

center four cells represent the target stimu- 
lus of metacontrast studies. The masking 
stimulus, a cross section of an annulus, 
Consists of two groups of two neurons each 
Separated by one neuron from an edge of 
the disc. The target and mask have equal 
ee intensity, and contrast, so that all of 
the parameters of metacontrast stimuli are 
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simulated. The two stimuli can also be 
interpreted as a cross section of a single 
bar (target) and two flanking parallel bars 
(mask). 

Each iteration of the lateral inhibitory 
equations represents 30 milliseconds of 
time, which is approximately the latency 
of reciprocal lateral inhibition of on-center 
and off-center cells in the cat lateral 
geniculate (Singer & Creutzfeldt, 1970). 
This is not to imply that metacontrast 
occurs in the geniculate by this mechanism, 
but only that 30 milliseconds is a physio- 
logically reasonable latency for lateral in- 
hibitory effects in mammals. Metacontrast 
should be obtained at two iterations, or 60 
milliseconds between the target and mask- 
ing stimulus onsets. 

The effect of the disc stimulus on the 
network is shown in Figure 3 (top), where 
the stimulus is applied for two iterations of 
lateral inhibition and is turned off at the 
dashed verticalline. The curves introduced 
in the second iteration of stimulation repre- 
sent Mach bands. The oscillations occur- 
ring after stimulation ceases are a function 
only of the stimulus and the 
characteristics; they are just 
the disc as the iterations with 
on. Here a distinction m 
(1910), in the paper where 
term “metacontrast,” 
the biological results o 


during its appearanc 
stimulus, a; 


the “meta 
activity 
homoph 


network 
as specific to 
the stimulus 
ade by Stigler 
he coined the 
is useful. He called 
f a physical stimulus 
€ the “homophotic’ 
nd those after ite disappearance 
photic” stimulus. The network 
before the dashed line represents 
ophotic stimulation while that after 
the line represents metaphotic stimulation: 
Any learning or recognition mechanis™ 
associated with this or subsequent cel 
layers would encounter both homophotic 
and metaphotic stimuli, and both would be 
equally correlated with disc stimulation: 
Though the metaphotic oscillations do n° 
look like the disc, they are specific to it, 2? 
example of Shepard and Chipman's (197 , 
'second-order isomorphism.” Because th 
represent the stimulus after its physic? 
disappearance, they suggest a mechanis? 
for short-term "iconic" 


storage (Neisse 
1967, Ch. 2). i 


Both the iconic store ?! 
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Fic. 3. Simulation of me 
for two iterations. At the 
to oscillate. Middle: me 


annulus 
stimulation is r 
time. Bottom: stimu 


applied and also exposed for two iterations. 
t to background. Stimulus onset dillerence represents 60 milliseconds of real 


tion by the mask (annulus) for two iterations. 
direct comparison of effects at corresponding times after stimulus on 
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contrast. (Top: result of stimulation by the target (disc) stimulus 
hed vertical line, input is reset to background and the 
contrast. The disc is exposed for two iterations. As it 


system allowed 
s removed, the 
At the second dashed vertical line, all 


Stimuli are aligned so that 
^t is possible. Oscillations 


after removal of all stimuli in the metacontrast condition are more similar to the annulus than to 


the disc patterns. Ordinate: firing rate. 


the oscillations are modality and stimulus 
specific, with significant decay in less than 
one second. 

A subject encounters a dilemma when 
asked to discriminate some feature of a 
disc presented for a brief time. If the 
stimulus is present for a shorter time than 
that needed for perception, the subject 
must use both the homophotic and the 
metaphotic stimuli in order to be stimu- 
lated long enough for perception to take 
place. If the metaphotic activity pattern 
is altered, perception is impaired even 
While simple detection, taking a shorter 
time, might still be possible. The curves 
shown below could be transformed inputs 
-matching or a feature- 
detection perceptual system; no assump- 
tions are required about the perceptual 
Drocess itself except that it takes time. 
“hese ideas can be used to account for both 
Monotonic and U-shaped metacontrast 
functions with identical stimuli. —-— 

A complete metacontrast simulation is 
Presented in Figure 3 (middle), where the 
‘lise is first exposed for two iterations and 
then turned off. At this time the masking 
"nnulus is also exposed for two iterations, 
E ter which the network is allowed to 
Scillate with the input set at the back- 


to either a template 


Abcissa: time.) 


ground level. For comparison, Figure 3 
(top and bottom) also shows the same 
network stimulated with the disc only and 
the annulus only at the corresponding 
times. It is clear that the metaphotic 
afteroscillations in the metacontrast situa- 
tion are similar to those generated by the 
annulus alone, but are very different from 
the activity resulting from stimulation by 
the disc alone, so that information specific 
to the annulus is available in the meta- 
contrast condition, while information spe- 
cific to the disc is present for too short a time 
to allow perception to occur. 

The situation is quite different when the 
target and masking stimuli are presented 
simultaneously as shown in Figure 4, again 
with disc-only and annulus-only patterns 
for comparison. The metaphotic stimulus 
in this case has similarities to both the disc 
and the annulus conditions, but is distinct 
from both. One can speculate that the 
distinctive oscillations give this figure a 
uniqueness in perception, and therefore in 
language; it is called not “disc and ring," 
but "bull's-eye." 

Another necessary control condition is 
the forward masking paradigm, presenting 
first the masking stimulus and then the 
target. (Figure 8) In this condition, the 
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Fic. 4. Simultaneous presentation of target and mask. 


(Top: disc stimulation only, ending at 


dashed verticalline. Middle: stimulation with both stimuli simultaneously for two iterations, end- 
ing at dashed vertical line. Bottom: annulus stimulation only. Oscillations after stimulus offset 
for the middle condition are intermediate between those for presentation of either stimulus alone. 


Ordinate: firing rate. Abcissa: time.) 


afteroscillations look at first similar to 
those generated by the disc alone at the 
corresponding times after stimulation, but 
later the annulus effects reassert themselves 
so that both stimuli should be perceived. 

The relationships shown in these results 
were quantified with the technique of cross- 
correlation, which was used because it is a 
way of comparing two curves with a mini- 
mum of assumptions (Hays, 1963, p. 510). 
Each point in Figure 6 represents a cross- 
correlation of the network's response to a 


4 


=) 


kon emm ox 


masking condition with the response due to 
one stimulus alone. The cross-correlation 
is repeated for each iteration of lateral 
inhibition, yielding a line which represents 
the similarity of a masking condition to the 
disc-only or the annulus-only response over 
time. Any stimulus array which creates à 
high correlation with the activity created 
by the disc, for example, will be recognize 
by the system as disclike. 

Figure 6 shows cross-correlations for all 
of the curves illustrated in Figures 3-5- 
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Fic. 6. Cross-correlations. 


the activity generate 
generated in a masking condition. 


Figures 3-5, yielding lines specifying 
d stimuli over time for each condition. 


combine’ 
visibility of the disc. Left: forward masking. 
sentation. Right: backward masking. 


stimulus offset), visibility of the 
dition. For the homophotic per 
for simultaneous target and masl 
cient. Abcissa: time.) 


The left graph shows that the network's 
response to the control or forward masking 
condition (Figure 5) correlates fairly well 
with the response to either of the stimuli 
alone. The correlation is not perfect. for 
either, of course; this reflects the slight 
mutual interference seen in all masking 
experiments, where thresholds for dis- 
crimination of both stimuli are raised 
slightly. Such mutual effects are not ex- 
plainable by “overtake” theories. The 


middle graph, for simultaneous disc and 
annulus presentation (Figure 4), shows 
ighly positive 


somewhat depressed but still hi S Pa 
correlations with both single stimuli. Fer- 


Ception can still be quite good in n 
situation because of the safety factor built 


into all biological systems; correlations 
need not be perfect for recognition to occur. 
The third graph, however shows a very 
different result. For this metacontrast con- 
dition (Figure 3), the mask's appearance 
reduces the correlation of the masking 
condition with the disc to near zero and 


Annulus with Combined Stimuli 
Disc with Combined Stimuli 


l; 3 (Each point represents the cross-correlation of 
d in the disc-only or the annulus-only condition with that 
The time axis in each graph is that of 


the degree of similarity of single and 

Solid lines represent the 
Center: simultaneous pre- 
For the metaphotic period (after 


disc is minimal in the backward masking con- 
iod (during stimulation), visibility is minimal 
k presentation. Ordinate: correlation coeffi- 


holds it there. The disc's metaphotic phase 
is effectively erased from the network. 
Thus, if perception requires a fairly high 
correlation. during the time of analysis, 
perceptual errors will be low for the first 
two conditions but high for the third. At 
longer intervals, of course, perception can 
be completed before the masking stimulus 
These effects reflect the U-shaped 


occurs. 

masking function. 
For a detection task, with a lower 
criterion, the result is different. If detec- 
while the 


ke place quickly, 
stimulus prevails, a low-cri- 
terion masking can be obtained by examin- 
ing only the first two iterations ol the disc's 
cross-correlation with the masking condi- 
tion in each graph (solid lines). These 
cross-correlations are at a minimum for 
simultaneous disc and annulus presenta- 
tion, meaning that detection errors will be 
maximal here, reflecting the monotonic 
masking function. Thus both the U-shaped 
and the monotonic masking functions can 


tion can ta 
homophotic 
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be obtained from the same patterns of 

activity, depending on the subjects cri- 
, 

terion level. 


DISCUSSION 


This simulation began with a modified 
Hartline-Ratliff neural net excited with 
spatial and temporal patterns like Muse 
used in visual masking experiments. U nder 
the appropriate conditions, both U-shaped 
and monotonic metacontrast functions 
were obtained, and some mutual masking 
was also observed. A complete theory of 
metacontrast, however, must also account 
for several other important properties of 
the effect. One of these is the decreasing 
effectiveness of masking as the angular 
distance between target and mask in- 
creases. In the model simulated here, 
where interaction effects travel one neuron 
laterally for each time iteration, effects 
from two stimuli begin to interact one 
additional iteration after stimulus onset for 
each cell interposed between them. Be- 
cause inhibitory effects are weaker at larger 
distances, each metaphotic stimulus is less 
distorted by the other and less masking 
results. With greater separations, each 
stimulus has its characteristic oscillations 
in a different part of the network. In fact 
the cross-correlation in the brain may 
carried out over relatively restricted areas 
of the visual array. Presence of neural 
noise would tend to reduce all of the 
correlations, especially at larger | 
and distances. 

Many masking studies find the relative 
energy of the target and masking stimuli to 
be of crucial importance. If the mask is 
very much stronger than the target, mono- 
tonic functions are obtained (Fehrer & 
Smith, 1962; Kolers, 1962). A strong mask 
reduces cross-correlations of the masking 
condition with the disc-only condition 
simply because the high energy of the mask 
creates a very large disturbance which 
reduces the target's relative contribution 
to the total activity. 
of the stimuli, rather than their spatial 
characteristics, becomes the crucial param- 
eter and the situation becomes similar to 
that of masking by light flashes. 
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cross- 
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The relative energy 
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Though this network can be transposed 
only with great care from the Limulus eye 
to the human cortex, there is evidence that 
such a network comprises at least part of 
the functional anatomy of the cortex 
(Creutzfeldt, Baumgartner, & Schón, 1956; 
Krnjevic, Randic, & Straughan, 1964). An 
additive model of excitation from a recep- 
tive field center and inhibition (or off 
response) from the periphery can account 
for much of the experimental data on the 
temporal responses of cat retinal ganglion 
cells to sinusoidal stimulation (Maffei & 
Cervetto, 1968; Maffei, Cervetto, & Fioren- 
tini, 1970). Lateral inhibition would create 
circular receptive fields with an antagonistic 
surround; even in monkey visual cortex, 
these comprise about one-quarter of all re- 
ceptive fields (Spinelli, Pribram, & Bridge- 
man, 1970). This figure suggests that 
though a functional lateral inhibitory net- 
work may be operating in visual cortex, 
other modes of neuronal organization are 
also present. ‘The same authors found that 
about one-third of the cells they examined 
could not be influenced by the same simple 
visual stimuli which could effectively elicit 
receptive fields in nearly all cells of cat 
visual cortex and in other monkey cortex 
cells. These “uncommitted” cells do not 
fit into the lateral inhibitory scheme and 
become more common near the foveal pro- 
jection area. A different sort of processing 
may be occurring there, more powerful but 
requiring many more neuronal components- 
The uncommitted cells may explain why 
metacontrast is not obtained in the fovea- 

Another anatomical alternative for the 
model is that either lateral or forward 
inhibition could be applied to incoming 
cells at several stages of the visual pathway: 
1 his is mathematically equivalent to latera 
inhibition at the final stage, implying that 
lateral inhibition in the cortex is not 
necessary for this scheme to be effective 
but only an addition of the inputs from 
the two eyes. Lateral inhibition is we 
documented at both retinal (Grüsser et al 
1965; Wuttke & Grüsser, 1968) and lateral 


geniculate (Singer & Creutzfeldt, 1970) 
levels. 
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experimental data on metacontrast, this 
theory makes some predictions. At present 
there is a lack of good psychophysical data 
on the nature of the change from monotonic 
to U-shaped metacontrast functions as re- 
sponse criterion changes. If response 
criterion is varied systematically in terms 
of modern decision-theory concepts, the 
lateral inhibitory model presented here 
predicts that subjects should switch grad- 
ually from monotonic to U-shaped mas dáng 
functions as their response criterion is in- 
creased, with no change in stimulus 


parameters. 
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Under normal stimulus conditions, the 
oscillations of neuronal activity induced by 
a stimulus damp out to ready the system 
for new input, while in metacontrast the 
oscillations of two stimuli interact so that 
the metaphotic phase of the target stimulus 
is distorted. The interaction of oscillations 
can be pushed further, as illustrated in 
Figure 7. By overdriving a lateral inhibi- 
tory network capable of contour enhance- 
ment, continual and abnormally large 
oscillations could be set up. The resulting 
instability might cause the striking move- 
ment effects and aftereffects created by 


rned stimulation. (Movement effects 
rk in the sensory system.) 
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viewing Figure 7, which hints at the ise 
of the transformations mediated by latera 
inhibition. The mathematical. require- 
ments for amplification by lateral inhibition 
have been developed by Ratliff et a 
(1969). After processing by the latera 
inhibitory network, visual information has 
some rather special properties; it is no 
longer isomorphic with the retinal stimulus, 
and it has been distributed in space and 
time. Figures 3 through 5 show that the 
distribution of effects across the network 
is much wider than the extent of direct 
anatomical connections. Enhanced spatial 
distribution increases the reliability of the 
system because information about the 
stimulus is now coded in many more ele- 
ments than the original input neurons, 
and information readout to another level 
can be preserved despite failure of readout 
from any few neurons. While remaining 
homotopically organized, visual informa- 
tion is spread in such a way that any retinal 
locus is represented by perturbations over 
a wide cortical area, and in like manner one 
cortical locus encodes information about a 
wide area of the visual world. The extent 
of this spread is shown by Mellwain's 
(1964) finding that many single visual cells 
even in the retina and lateral geniculate of 
the cat can be influenced by some stimuli as 
much as 90? from the centers of their 
receptive fields. This sort of spatial re- 
dundancy is suggestive of Pribram's (1969) 
holographic, or interference-pattern, theory 
of perceptual and memory coding. On a 
more microscopic physiological level, out- 
put from the lateral inhibitory system 
would be an ideal input to Spinelli's (1970) 
OCCAM memory model. 

The model proposed here is a mathe- 
matical-zeural model of a substrate whose 
quantitative properties have not yet been 
defined in vertebrates, though each param- 
eter has a physiological meaning. The 
metacontrast effects described above, how- 
ever, are robust; the entire simulation has 
been repeated with a single inhibitory 
Coefficient of .4, yielding narrower oscilla- 

tions but similar masking results. Recent 
Studies by von Békésy (1971) show that 
metacontrastlike inhibitory phenomena also 
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occur in the auditory system in the same 
time range as visual metacontrast, about 
60 milliseconds between stimuli. He refers 
to the effect as backward inhibition, a term 
which is defined behaviorally and should 
not be confused with lateral inhibition, 
which is defined anatomically. The present 
paper supports von Békésy's suggestion 
that backward inhibition may indeed be 
mediated by a lateral inhibitory network, 
and metacontrastlike effects may be found 
wherever lateral inhibition is 
sensory systems. 


The theory applied here to metacontrast 
was not developed for the purpose of 
accounting for masking data. The lateral 
inhibitory equations were originated to 
explain spatial and temporal interactions 
observed in Limulus and other eyes, and 
even this very rough model when combined 
with some psychological data can account 
for metacontrast phenomena better than 
existing theories without the. stulation 
of any other neuronal pro ing. The 
approach invokes no mechanisms which are 
not known to be present in the nervous 
System; metacontrast falls out as an arti- 
fact of the lateral inhibitory w 
cessing information, a price 
organism pays for gaining 
the network to perform 
such as intensity 
enhancement. 


used in 
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"TEMPORAL AUDITORY ACUITY ! 
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GREEN? 
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‘Temporal auditory acuity, the shortest time inte 


al within which the ear 


can discriminate the order of auditory events, is reviewed. Experiments are 


reported on transient signals in which 
signals that differ only in their phase 


the task is to discriminate two brief 
spectra. Periodic stimuli are next 


reviewed and it is argued that the same temporal acuity may account for these 


results. Finally the paper concludes wi 
work to the research on residue pitch. 


th a discussion of the relation of this 
The review indicates that the valuc 


for temporal auditory acuity is about 1 to 2 milliseconds. 


The ear's ability to extract information 
from an acoustic waveform depends on its 
ability to correctly discriminate among the 
spectral contents of different waveforms 
and to correctly discriminate the order in 
which these waveforms occur. Considerable 
effort has been directed at studying the 
ear's capacity to discriminate changes in the 
spectrum. There are numerous studies of 
the just discriminable change in frequency 
Af or amplitude AZ of a sinusoid. These 
are also numerous studies of discrimination 
of a slight change in a broadband spectrum 
such as noise. The entire area of masking, 
studies of how one sound obscures an- 
other, can be viewed as investigations of 
the limits of such spectral discrimination. 
Very few studies, however, have been 
directed at determining the ear's capacity 
to discriminate changes in temporal order, 
that is, what I will call temporal auditory 
acuity. Speech and music are but some 
of the many acoustic waveforms where the 
temporal order of the acoustic elements is 
vital in the appreciation of the information 
content. This paper is devoted to a review 
of some of the recent efforts to explore 
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Foundation. The paper was initially prepared as an 
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whom various aspects of this work have been pur- 
sued in our laboratory for the past two years. 
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temporal auditory acuity and to sum- 
marize some findings. Essentially we want 
to know the limits of temporal resolution. 
What is the shortest time interval within 
which the ear can discriminate the order of 
auditory events? 

One of the major technical 
encountered in studies of auditory acuity 
is that it is difficult to generate two 
auditory stimuli which differ only in 
temporal order. Often alteration in the 
temporal waveform produces a number of 
other unwanted cues, for example, slight 
differences in the energy spectrum. These 
spectral alterations may then become the 
basis for discrimination betwe 
forms rather than the tempor, 
In the work desc 
trum of the stin 


problems 


en the wave- 
al differences. 
ribed, the energy spec- 
nudus was always held 
constant; thus it can never serve as the 
basis for discrimination. 
changed by manipulating the phase spec- 
trum. One, therefore, could describe this 
work as a study of phase perception. While 
this is an accurate description, | have not 
found it particularly useful in under- 
standing the results. In thinking about 
mechanisms responsible for discriminating 
two waveforms that differ only in their 
phase spectra, |] find it simpler and more 
concrete to think of differences in the order 
of temporal events within the waveform 
rather than differences in phase spectra. 

The basic procedure used to study 
auditory acuity is analogous to that used 
to study visual acuity, that is, simply 
establish some difference in temporal order 
between two stimuli and then progressively 
reduce the total duration of the stimuli 


The waveform is 
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until the different orders can no longer be 
perceived. For a concrete example, think 
of two clicks, one about half the amplitude 
of the other. If we play them in one order, 
we hear loud-soft; in the opposite order 
soft-loud. No matter which order was 
used, the energy spectra of the pair of 
clicks are the same; the pairs differ only 
in phase, that is, the order of events in 
time. Now let us gradually reduce the 
time between the clicks until both clicks 
occur within some very short interval. 
It was surprising to find that subjects 
could reliably make such discriminations 
even when the duration of the two clicks 
was only 1 or 2 milliseconds | (msec.). 
The author and his collaborators have per- 
a number of similar experiments to 
investigate this ability under a variety of 
circumstances. The limit of temporal reso- 
lution was always 1 or 2 msec. 

A general question is whether the same 
temporal acuity is possible with steady- 
state as well as transient signals. There 
are several ways to tackle that question 
and some preliminary results are presented. 
In the limit, one can present periodic 
stimuli and ask if the subject is able to 
discriminate phase changes among the 
periodic components. This is a topic 
closely related to pitch perception or 
"residue" pitch and is rather remote from 
the original starting point; nevertheless, 
it now seems that the mechanism 1S 
basically the same and that these different 
experiments merely explore different as- 
pects of the same basic process. 


formed 


TEMPORAL INTEGRATION 


The majority of the traditional work on 
temporal parameters of the auditory Sys 
tem concerns integration time. In these 
experiments, the subject attempts to detect 
Some weak signal either in a background 
noise or in quiet. The threshold of this 
Weak signal is measured as à function of 
its duration. Typically one finds that the 
Signal's detectability is the Same if the 
Product of signal duration and intensity 
is held constant (energy constant) at 
least over some limited range of durations- 
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Generally, this equal energy rule holds for 
signal durations of up to 100 to 200 msec. 
Zwislocki (1960), in his theory of temporal 
integration, has written and summarized 
a great deal of the information in this area. 
It would be parsimonious, but untrue, 
to believe that since the ear can add up 
information over a 100-msec. interval, that 
it cannot distinguish how the information 
arrives within that interval. Certain 
simple mechanical analogies might work 
that way, but the ear is quite different. 
Hirsh, for example, has established that 
one needs only about 25 msec. to distin- 
guish differences in starting times between 
a high- and low-frequency sinusoid. Pat- 
terson and Green (1970), using a different 
psychophysical procedure and a much 
shorter signal, found that discrimination 
is possible when the total duration of the 
transients is only 2 msec. The latter 
experiments were carefully controlled to 
rule out spectral differences as being the 
basis for the discrimination. The dis- 
crimination is based on some sort of 
qualitative information. One of the tran- 
sients sounded like “tick,” the other 
“tock.” Before describing this experiment 
in more detail, let us consider the general 
problem of discriminating between short 
transient signals and how this ability is 
related to integration processes. 
"TEMPORAL ACUITY AND THE EaR's 
MINIMUM TIME-CONSTANT 


‘The response of any realistic physical 
system is a function of the present input 
and some combination of past inputs. If 
the variation in the input is too rapid, the 
inertia in the system smooths that input 


riation and produces an output that is 


va 
nation of the recent 


some weighted combit of t 
past inputs. If the system 1S linear, the 


output is simply à weighted average of 
past inputs. One of the simplest linear 
svstems has an output that is a simple ex- 
ponential weight of past inputs. Recent 
values of the input are given larger weight, 
whereas more remote values are given less 
weight. The parameter of this exponential 
decay is called the time constant of the 


device and represents the memory of the 
Pus complicated Systems may not 
have simple exponential weighting func- 
tions but if they are linear, one can still 
represent the present output as a weighted 
average of past inputs. The weighting 
function is simply more complicated. : In 
all systems, both linear and nonlinear, it is 
important to know how much of the past 
affects the present output. : 
Now consider how these systems will 
behave when confronted with two clicks. 
Figure 1 illustrates the output of the 
system to the soft-loud and loud 
click pairs. The output is illustrated as- 
suming two quite different time constants. 
Because of the sluggish response of the 
long time-constant device, the order in 
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scale for the middle line.) 


Response of two simple linear 
iputs, only the order of th 
the response of a system with a long tim 


Systems to a 
€ clicks differ. 
C-const; 
on the bottom fin 


which the clicks arrive would probably 


not create a noticeable difference in the 


overall response. On the other hand, the 
form of the output for the short time- 
constant device is obviously heavily de- 
pendent on the order in which the clicks 
occur. Incidentally, Ronken (1970) has 
shown that observers can distinguish the 
order of two such pairs of clicks down to 
about 2 msec. ; 

Figure 1 illustrates the general experi- 
mental procedure, at least in its conceptual 
form. By determining the limits of the 
discrimination of temporal order for brief 
acoustic sounds, we can determine. the 
minimum time-constant of the auditory 
System. For inputs shorter than this 
time, the response of the system is ap- 
parently some combination of events ol 
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the recent past in which order information 
is lost. 

We must be very careful, however, about 
details of this experiment when we apply it 
to the sense of hearing. It is well known 
that the ear performs a limited spectral 
analysis and hence we must be very careful 
to insure that order of arrival is the only 
cue available to the subject. If the clicks 
differed in energy spectrum, then the 
discrimination might be based on spectral 
differences and the experimental results 
would then reflect the quality of spectral 
analysis rather than parameters of tem- 
poral processes. To illustrate the im- 
portance of this problem, suppose we ask 
what is the minimum gap between two 
10-microsecond clicks that the ear can 
reliably detect. Leshowitz (1971) has 
shown that the minimum gap is 6 micro- 
seconds and that the basis for the dis- 
crimination is undoubtedly slight differ- 
ences in the energy spectra in the region 
of 10,000 to 15,000 hertz (Hz). The 
detection of the gap is, therefore, not a 
temporal discrimination but rather a dis- 
crimination of difference in intensity at 
certain regions of the spectrum. 

The two-click procedure illustrated in 
Figure 1 avoids this problem, since the 
energy spectrum is the same for each pair; 
the only difference between the pairs is 
their phase spectrum. What we would 
like is a large set of stimuli, all with equal 
energy spectra, that differ only in their 
phase spectra. In particular, we would like 
to be able to program how the energy is 
Ordered over the entire frequency range 
so that we could delay one or another part 
at will, Huffman (1962) has provided a 
general means of accomplishing this re- 
He has provided a method 


arge set of waveforms 
ties: 


quirement. 
for constructing a | 
with the following important proper 
(a) the waveforms are zero outside an 
interval T; (b) the energy spectra of all 
Waveforms are the same, only the phase 
Spectra differ. Thus all the waveforms in 


a class generate the same energy at any 
Tequency; it is only the order in which 
frequencies 


this energy occurs at various 


that differs over the class. We call a 


single waveform of this class a Huffman 
sequence. 


EXPERIMENTS WITH HUFFMAN 
SEQUENCES 


Figure 2 shows two 10-msec. signals, 
both Huffman sequences and thus having 
the same energy spectra. The upper trace 
is the output of the earphone as measured 
into a 6-cubic-centimeter coupler; the 
lower trace is the electrical voltage applied 
to the earphones. The phase spectra of 
the two signals differ. The left waveform 
has a phase delay at 200 Hz.; the right 
waveform has a phase delay at 1000 Hz. 

The relative energy and phase spectra 
of these signals are shown in Figure 3. The 
energy spectra are identical for both signals 
and are marked “magnitude” in the figure. 
The falloff in magnitude is caused by quan- 
tizing the waveform in time. For all of the 
experiments, the signal was filtered to ex- 
clude this high frequency energy (marked 
in grey on the figure); thus all the signals 
will have an approximately rectangular 
spectrum. The phase spectra are shown 
below in the bottom portion of the figure. 
The difference in the phase spectra means 
that the energy associated with different 
frequencies arrives at different times. The 
ability to distinguish among these wave- 
forms means the ear is sensitive to these 
differences in the arrival time of energies 


at different frequencies. 
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Fic. 3. Energy and phase spectrum of two Huffman sequences. (The curve, 


labeled Magnitude, gives the energy spectrum in decibels 


(arbitrary reference) 


of Huffman Sequences generated at 3200 samples per second. The gray area 


indicates that analog filtering was used to suppress all frequencic 
the sample rate. Thus, the magnitude spectrum is essentially flat. 


bove half 
The phase 


curves of two different sequences are shown below, one with its singularity at 


200 Hz., 


the other with its singularity at 1000 Hz. 


The two phase curves 


correspond to waveforms a and b of the previous figure.) 


Consider a short time-power spectrum 
of these signals such as that provided by a 
Speech spectrogram. In the conventional 
speech spectrogram, energy is coded by 
the darkness of the marking, and the co- 
ordinates of the graph are frequency 
(along the y axis) and time (along the x 
axis). Thus, a single sinusoid is represented 
by a horizontal line at the appropriate 
frequency and a brief burst of white noise 
by a dark vertical line. If a typical 
speech spectrogram were used to analyze 
the signals, then they would all be nearly 
identical, namely, thick dark vertical bars. 
This is true because the different spectra 
are nearly white and the signals are brief 
compared with the time constant of the 
speech analyzers, which is about 20-30 
msec. If the analyzer is modified by 
altering the time constant by changing it, 
say from 20 to 1 msec., then differences 
in the signals would become more evident, 
With these brief time constants, it would 
be clearly seen that the signals differ in 
the arrival time of energy at the different 


frequencies. For example, Figure 4 shows 
a short time-spectral analysis of the two 
waveforms displayed in Figure 2, The 
large spike at the beginning of the time 
waveform corresponds to the large vertical 
line at the left of the spectrogram. The 
near simultaneous arrival of energy over à 
wide range of frequencies causes the large 
spike in the time domain. The energy at 
one frequency, about 200 Hz in Figur" 
4a and about 1000 Hz. in Figure 4b J$ 
delayed. If we listened to the two wave 
forms, both would sound like a click, but 
the waveform with the energy delaye 
at 1000 Hz. would sound higher in pitch: 
tick,” than the waveform with delay 
energy at 200 Hz. “tock.” d 

The first experiments consisted of simply 
shortening the total duration of thes 
signals until they could no longer be die 
criminated in a paired-comparison tee 
ubjects can 
crimination on 
10 msec. 
chance 


achieve about 100% d! 
such waveforms if they a 
in duration, they achieve 2°" 
at 1 msec., and they are correé 
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Fic. 4. Time-frequency distribution of energy for 
two waveforms. (The figure shows the locus of 
energy as a function of time and frequency for two 
Huffman sequences. The (a) graph is the distribu- 
tion for a Huffman sequence generated with a single 
phase delay at 200 Hz., e.g., the waveform marked 


a in Figure 2. The (b) graph corresponds to wave- 
form b of Figure 2. It hasa phase delay at 1000 Hz.) 


about 75% of the time when the duration 
is about 2 or 3 msec. The ability to dis- 
tinguish between pairs of Huffman se- 
quences also depends on other variables 
such as the frequency separation between 
the phase delays and the width, in fre- 
quency, of delayed energy. The value of 
2 msec. is an upper bound on the true 
temporal acuity value because that num- 
ber is the total duration of the signal. 
Distinctions among the waveforms must 
be based on differences that occur within 
this total duration. 

In addition to experiments on dis- 
criminating between pairs of such signals, 
Patterson and Green (1970) have also in- 
vestigated other properties of these signals. 
Since these transient signals have broad- 
band, noiselike spectra, the question of how 
Well they mask sinusoidal signals was con- 
sidered. In these masking experiments, the 
Shape of the envelope of the waveforms 
Within the groups was systematically 
Varied. At one extreme, the envelope was 
Nearly uniform over the 10-msec. duration. 
At the other extreme, the envelope peaked 
Dear the onset and decayed rapidly. Thus, 
in the latter case, most of the masker 
energy was concentrated in the first few 
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milliseconds of the observation interval. 
Three groups of maskers with envelopes 
intermediate between these extremes were 
also used. It should be kept in mind that 
all the waveforms in all groups had identical 
energy spectra. Thus, if masking is 
strongly influenced by the energy spectrum 
of the masker, we might expect all groups 
of maskers to mask sinusoidal signals 
equally well. This is clearly not the case. 
Figure 5 shows the threshold of a 1500-Hz., 
10-msec. signal for each of the five groups 
of maskers. The solid line connects the 
averages of the four observers. The 
abscissa of Figure 5 is marked "the number 
of phase delays" ; this scale is an indication 
of the peak factor in the waveform. If 
only a few delays are used, the waveform 
has a high peak factor, such as Figure 2 
which has only one phase delay. As the 
number of phase delays increases, the 
waveform is nearly uniform over the in- 
terval; 25 delays out of 50 possible delays 
has a very uniform envelope pattern. As 
can be seen, the maskers with the more 
uniform envelope produce considerably 
more masking than the waveform with 
the sharp spikes. Plotted on the edges of 
Figure 5 are the logical extremes of this 
continuum. To the left is the masking 
produced by a very brief click at the be- 
ginning of the 10-msec. interval. Such a 
masker produces little masking, although 


its power spectrum is similar to that pro- 


duced by the Huffman signals. At the 
extreme right is the masking produced by 
average 


random noise having the same 
power spectrum as the Huffman sequences. 


The fact that a brief click and a random 
noise sample fall in a natural continuation 


of the results obtained with the Huffman 
that the crucial masking 


of the masking to 
the signal nearly 


sequences indicates 
variable is the a, 
e interval o 
cover the interv? ; 
evenly with energy- Signals that obscure 

. he entire signal duration are 


nly part of t À 
oe recive maskers. This result also 


indicates that the ear is very quick and 
take advantage of these brief time 
hen the total signal duration 
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was equal that of the Huffm, 


While these results Suggest significant dif- 
ferences among Huffman Sequences, there 
is one situation in which the various 
waveforms within the same class behave 
quite similarly, This is the threshold for 
detecting the presence of one of these 
transient signals in broadband noise, Al] 
the Huffman Sequences within the same 
Class, that is having identical energy 
duration, have very similar 
masked thresholds, Because they all have 
imi *tection thresholds, one can also 
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detections as a function of signal-to-noise ratio. 
(The squares are percent correct dis "nations of a 
pair of waveforms with phase delays either at 400 
Hz. or 600 Hz. The circles are percent correct 
delections for the waveform with a 400-Hz. singu- 
larity. The triangles are percent correct detections 
for the waveform with singularity at 600 Hz.) 


threshold value, it is possible to distinguish 
one sequence from another. Thus, the 
ability to distinguish Huffman sequences, 
although based on slight qualitative dif- 
ferences in the sounds of the two transients, 
is relatively robust and not easily masked. 

Furthermore, Ronken (1970) has shown 
that even when phase changes are an 
irrelevant cue in detecting a change in 
amplitude of two brief clicks, the ran- 
domization of this cue interferes with the 
discrimination of the intensity change in- 
dicating that this phase cue is difficult to 
ignore, 

Thus, to briefly summarize the results 
to this point, it has been established that 
(a) the ear can distinguish temporal events 
differing in time by only a few milli- 
Seconds, (b) this phase discrimination is not 
€asily masked by noise, (c) differences in 
the phase spectrum can influence the de- 
tection of a sinusoid by as much as 30 
decibels, and (d) phase differences are ol 
Sufficient saliency to disrupt amplitude 
discrimination when phase is a random but 


"relevant cue. 


But what of the generality of these 
findings? Perhaps the ear shows this re- 
markable temporal acuity only when listen- 
ing to isolated, transient stimuli. How 
does this ability stand up when the acoustic 
events are not transient clicks but ongoing 


sustained sounds ? 


TEMPORAL ACUITY WITH 
LONGER SIGNALS 


Work in this area is not complete, but 
all the preliminary results indicated that 
the ear’s temporal acuity is the same for 
sustained sound as it is for transients, and 
indeed this generalization holds for steady- 
state signals such as periodic tones. 

Leshowitz has determined that this re- 
markable temporal acuity is true for longer 
duration stimuli such as speechlike sounds. 
He used synthetic fricatives which are 
essentially noise bursts having energy 
throughout most of the audio spectrum. In 
normal speech, these bursts are generally 
about 20-50 msec. in duration. For his 
experiment, he selected stimuli of 50-msec. 
duration. Using a digital filter, he sub- 
jected these noise bursts to a pair of filter- 
ing operations. The first filtering was 
similar to that imposed by the acoustic 
resonance of the vocal system. The re- 
sulting waveform will be called a normal 
fricative. The second filtering was similar 
to the first except the impulse response was 
reversed in time. This produced another 
waveform, which will be called an abnormal 
fricative. The abnormal and normal 
fricatives are similar in many Ways but 


i It is 
differ because of local time reversals. l 
a tape recording an 


something like taking E 
reversing small segments of the E E 
intensity patterns within both ae s 
main the same and the distri oe 
energy over frequency 1S the same a € 
sounds; however, locally, time is mens se 

Technically, the operation ope the 
same overall energy spectrum or the wwo 
noise bursts and thus discrimination on the 
basis of long-term energy cues Is impossible. 
Discrimination must be based on differences 
in the temporal order within the stimuli, 


Using noise bursts of up to 50 msec. in 
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duration, Leshowitz’s su 


discriminate among them when the local 


time reversals are only 5 
5-msec. interval, 

parently discrimin 
and reliably 


forms that differ only in 
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is to return for 
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the signal on th 
s a phase delay 
at frequency occurs ii 
aveform. If one listen: 
waveform, one hears ; 


; the pitch is that of < 
pulse train at 100 Hz., and in addition. 


another high-frequency pitch. The fre- 
quency of this high-frequency pitch cor- 
responds to the out-of-phase component, 
in this case the 1000-Hz. component. 

The first to observe 
nomenon was M 
utilized this effect 
Bell Telephone 


msec. Within a 


this manner, 


nt to Huffman 


this general phe- 
play a 10-msec, 


anfred Schroeder, who 
on à special organ at the 
Laboratories about 10 
years ago. Schroeder rotated the phase of 
the higher components of a complex 
periodic wave and was thereby able to 
play simple tunes. His work is men- 
tioned briefly. in Pierce (1960). 
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deduced by the following thought experi- 
ment. Suppose we arrange a set of cosine 
components, a fundamental, and all its 
harmonics, all having equal amplitude. 
If all the components are in cosine phase, 
then we produce a waveform such as 
shown in Figure 7 (top). The spectrum 
corresponding to that waveform is illus- 
trated in the right of the figure. If we 
look at the waveform in lime, we find the 
coherence of all the cosine waves produces 
the large spike every T seconds. Between 
the spikes, the various components nearly 
cancel each other and little energy is 
Present. The distance between the spikes, 
T, is the reciprocal of the fundamental of 
the spike train. 

Now consider the same set of cosine 
components, except one of the components 
has been phase shifted to a sine rather than 
cosine wave. In Figure 7 (bottom right), 
we illustrate the third component as ro- 
tated 90°. The waveform corresponding 
to this case is shown at the bottom left in 
Figure 7. The waveform is little changed 
from the previous, all-cosine wave. Since 
One cosine wave is not present, the peak 
of the spike is reduced by one over the 
total number of components, and the 
Phase shifted component (3/9) now appears 
"5 à sine wave in the time between spike 
trains, It is easy to see that at some 
fundamental, Jo, one must be able to hear 
the phase shifted component. In the limit 
"5 the fundamental is moved to lower and 
wer frequencies, the time between spikes 
creases ang eventually the presence of 
the out-of-phase component should be 
vident, In effect, we are studying 
the audibility of a sinusoidal component 
Masked by two bracketing clicks. If the 
Clicks are separated by sufficient time, one 
Second for example, then any masking 
Produced py the clicks should decrease to 
Such an extent that the phase-shifted 
"Inusoiq would be evident. i 
ti ‘hus, the central question is a rcr 
P one: Under what conditons, h ift 
à damental frequency, what pe 
tom what frequency of the iin "s 
xe D ment, will the odd componer as 

*"Dtible? Again it is obvious that we 


can view this experiment as a measure of 
temporal acuity. How quickly can the 
ear recover from the effects of the clicks 
and discriminate details of the acoustic 
waveform between the clicks? Systematic 
measurements have recently been reported 
on this question in a brilliant paper by 
Duifhuis (1970). His analysis of these 
experiments is consistent with the pre- 
ceding arguments and he emphasizes how 
these measurements can be used to esti- 
mate the quality of the frequency analysis 
that precedes the temporal analysis. His 
later data (Duifhuis, 1971) suggest that 
the temporal resolution at low frequencies 
(500 Hz.) may not be as acute as that 
possible at high frequencies (2000 Hz.). 

It is also of interest to consider an 
amusing variation of this experiment. 
Suppose all components are exactly in 
cosine phase and the waveform is simply 
a series of spikes at the reciprocal of the 
fundamental frequency as shown at the 
top of Figure 7. If we decrease the ampli- 
tude of one of the components, then we are, 
in effect, adding that component to the 
pulse train exactly out of phase. Even- 
tually, as we decrease the component 
sufficiently, the sinusoidal oscillation be- 
tween the pulses reaches a level that we 
can hear. We are turning down a com- 
ponent so we can hear it. Thus, one can 
study both the threshold for presence of a 
component and the threshold for the 
absence of a component within the same 
experimental context. . : 

Many parts of the preceding analysis 
have been rather superficial. The author 
has been referring to pictures of sinusoidal 
oscillations between two clicks and claiming 
that if you can see the oscillations then you 
must be able to hear them. Obviously the 
interest is in the exact parameters required 
to hear the phase-shifted components, and 
such measurements are presently being 
made. It is fairly easy to show that the ear 
is more complicated than. the simple argu- 
ments concerning these pictures would in- 
dicate. As we know, a limited frequency 
analysis of the waveform is performed 
mechanically in the cochlea. Fhus, it is 
possible to hear phase-shifted components 
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when their presence is far from obvious in 
a wide-band picture of the w 
Figure 8 shows such a case. 
The waveform shown 
periodic wave in which th 


of the components have been scrambled, 
yet over a certai 


n range (1.4 to 2.8 kilo- 
hertz), all of the 


except for one (2.0 kilohertz). 
of this one odd c i 
if one listens to the waveform, but its 
presence is not clear in a wideband picture 
of the waveform. By studying the range 
of components that must be coherent in 
order to make the odd component still 
detectable, we will gain further information 
on the degree of frequency analysis that 
precedes the temporal processes that h 
been the main topic of this paper. 


aveform. 


in Figure 8 is a 
e phases of many 


ave 


TEMPORAL ACUITY AND THE RESIDUE 


Throughout our studies of temporal 
acuity, I have been impressed that the 
limits on temporal discrimination and the 
limits on the perception of the residue are 
very similar. For those not in audition, 
I might simply explain that the residue is 
a pitch sensation presumably based on some 
temporal features of the waveform. It is 


often illustrated by playing a complex 


| wd 
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pulse train such as that perpe = 
Figure 7. One can then filter out all th 
lower frequency components of the com- 
plex wave and still the md 
"pitch" remains. The existence of this 
residue pitch is frequency cited as evidence 
against a purely resonance or place theory 
of pitch perception. The range of Et 
residue is limited; it becomes more ant 
more difficult to hear as the frequency 5i 
increased. The upper frequency limit 1s 
usually claimed to be about 1000 Hz. ! 
The most direct estimates of this upper 
frequency limit have been carried out sea | 
an experimental procedure first employe 
by Miller and Taylor (1948). be 
interrupted white noise at various rates an : 
determined if ubjects could set an oscillator 
at the frequen the interruption. Ew 
the long-term spectrum of the noise is flat 
whether interrupted or not, the basis O 
the subject judgments must be ge tr 
rather than spectral characteristics of € 
stimulus. Miller and Taylor found tha 
subjects would make reliable judgments © 
the repetition rate up to 500-1000 ! d 
Several replications of this experiment herd 
been carried out with the same bas 1 
result (Harris, 1963; Pollack, 1969), e 
though some subjects seemed to make sf 
curate pitch matches up to interrupt 
rates of 2000 Hz. 
It seems likely th 
poral acuity are, 
quickly the peripl 


[ ten” 


at the studies o oi 


in effect, studies of € 
heral nervous system ye 
respond to changes in the auditory wes an 
forms. These temporal limitations ey. 
upper bound on the frequency that ten 
be utilized by the temporal pitch syste” 

s long as the time T between epochs 4 
the periodic wave is larger than 1 al 
Tolle (00 to 1000 Hz.), then the ear 67 
follow details of the waveform and a rest ah 
Pitch can be heard. For waveforms s 5i 
smaller periods, that is, higher frequer® e 
the ear cannot follow details of the on 
form and is forced to utilize spectral P 
information to determine pitch. 

.^ series of related experiments pe? 
With auditory temporal acuity has wit! 
described. This series of studies deals | eil 
the shortest times during which the 


dealif 


can distinguish the order of temporal 
events. In all these experiments, the 
Spectral energy distribution, computed over 
, some longer period of time, cannot be the 
basis for the discrimination. The summary 
is quite simple. The ear can resolve events 
within 1 or 2 msec. This resolution is true 
for transients, for brief 50-msec. noise 
bursts, and for continuous periodic stimuli. 
‘This temporal parameter apparently im- 
poses an upper limit on the frequency that 
can be utilized by the temporal pitch mech- 
anism. 
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RANDOMIZATION 


A REPLY TO WIESEN 


D. M. WARBURTON ! 
Reading University, England 
A reader of the Wiesen correction of an error in a paper by Warburton 


and Greeno is likely to form a gross misconception oi the factors motivating 
the shape function analysis and of the problem of randomization in the 


construction of small groups. 


Weisen (1971) is certainly correct in point- 
ing out an error in the calculation of the prob- 
abilities in our example illustrating the dangers 
of blind randomization ( Warburton & Greeno, 
1970), but he himself is wrong in his com- 
binatorial arithmetic, in his views on the justi- 
fication of the shape function model, and in his 
ideas on randomization in general. 

The correct answer to cor example is p = .01 
(actually .00994), the -p.v/Ability of obtaining 
7 of one kind from a poolZ;f 16 subjects, con- 
sisting of equal numbers of; bright" and “dull” 
rats. This value can be dé jed from the gen- 
eral formula of the hypergeometric distribution 
which is appropriate for situations where the 
sampling is done without replacement. Weisen's 
calculations were based on a sampling with 
replacement model which just shows how easy 
it is to misapply elementary probability theory, 
to use Weisen's words. 

Wiesen also creates a false impression that 
the shape function model was justified by this 
erroneous example alone, whereas the problem 
of randomization in small group designs was a 
fairly minor consideration motivating the shape 
function analysis. The major justification was 
provided by the comparison of the traditional 
statistics used to describe acquisition perform- 
ance and the more precise shape function 
analysis. The increased precision in descrip- 
tion prevented the occurrence of both Type I 


‘Grateful acknowledgment is given to Bernard 
Singer and Michel Treisman for their important 
suggestions during the preparation of this paper. 

Requests for reprints should be sent to D. M. 
Warburton, Department of Psychology, University 
of Reading, Building 3, Earley Gate, Whiteknights, 
Reading RG6 2AL, England. 
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and Type IL errors and the resulting erroneo 
interpretation of treatment effects. 

Finally, he is mistaken in infering that \ 
argue against using randomization. Randon 
zation is useful for increasing accuracy by + 
moving unsuspected biases and, of course, fc 
making one's data reasonably convincing t 
other people as well as oneself (Cox, 1962 ` 
However, these objectives can only be acl to 
if the experimental arrangements produoup g 
randomization are unsystematic. On this, nog 
Cox (1962) stated “randomization is very ver,- — 
in large and moderate-sized experiments, the 
not much good in very small single experces- 
|p. 90]." However, the shape function ime; 
can help to guard against any unusual asje of- 
the experimental arrangements even. ins to 
experiments and also provides a means alsi 
recting for the subject's initial response i a 
Obviously, if there were marked unexplaii | 
differences in the initial response level of o. 
group of subjects, the inferences drawn fro. 


the corrected data would have to be interpretec 
cautiously. 
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AUS' DERIVED-LIST EXPERIMENTS RECONSIDERED ' 


EBBINGH 


WAYNE SHEBILSKE AND SHELDON M. EBENHOLTZ * 


University of Wisconsin, Madison 


H. Ebbinghaus in his derived-list paradigm are at 
odds with inferences from interference theory and are contrary to results 
obtained in modern derived-list experiments. This paper challenged the 
contention of D. TF s and R. K. Young that Ebbinghaus used an in- 
appropriate control condition as well as their conclusion that his results 
might represent various degrees of negative transfer. N. T. Slamecka’s 
contention that Ebbinghaus’ results were due to his knoweldge of the trans- 
formations also was challenged. Possible causes of this unresolved discrepant 
finding were discussed in the context of the unique features of Ebbinghaus’ 


procedure. 


The results obtained by 


studies using derived lists have found negative 
transfer (Hakes, James, & Young, 1964; 
Young, Hakes, & Hicks, 1965), the fact is that 
Ebbinghaus obtained positive transfer. Using 
the difference in time to learn® the original and 
derived lists, he reported savings, averaged 
over two eXiyeriments, of 137, 89, 73, and 42 
-onted by the presence of intermediate seconds for de ived lists eich: d, 2) Sandee 
pms and, according to the other, all the items intermediate sy36, "8. skipped, and a savings 
* bound together, but the strength of the of only 6 seconds iur a specially chosen control 
invciation is & decreasing function of the list designed to determine the amount of sav- 
| aber of intervening items. In his experi- ings attributable to increased item familiarity. 
thits, Ebbinghaus learned a first list and 24 Hakes and Young (1966) attempted to 
heirs later a second, derived list. The latter explain this apparent inconsistency as follows: 
| ex , formed so plata pua" laa Since Irion (1946) has shown that the random-order 
ag separated in the rst list y ner ^! ^' — condition (Ebbinghaus' control condition) yields 

fr 7 intervening members. Ebbinghaus pegative transfer, Ebbinghaus’ findings cannot be 
oned that if no associative 


! 4 strength was considered to show positive transfer: The savings 
uilt up between nonadjacent syllables, then — in the derived-list conditions were greater than for 
* no savings would result in the derived list, and a negative-transfer condition and might well repre- 
A fit such associations were present, then savings sent decreased negative transfer rather than positive 
/ would result. He did not entertain a conception transfer [p. 249]. 
] of associative interference, that gs; tae having -Fhe authors argued that whether positive 
tion might inhibit the learning net s tained by Ebbinghau n 
derived list. Given transfer was obtained Dy inghaus is in- 


of interference; 95 3 [n the study Ebbinghaus used total time to learn 
ber of list repetitions as his 


913) devised the derived-list 
two alternative explanations 
Both hypotheses explained 
terms of interitem associa- 
g to one, associations are 


formed only between adjacent items with the 
possible connection between nonadjacent items 


Ebbinhgaus (1 
paradigm to test 
of serial learning. 
serial learning in 
tions, but, accordi 


E. 


the old associa 
of a new associat! 
our contemporary 
Hakes and Young 


jon in the 
view 
(1966) have noted, the rather than the num 
first-list associations would be a source of dependent measure. In his various experiments, he 
1 interference in learning the derived — used both kinds of measures and found they were 
related enough so that for most purposes it was 

d. Precautions were taken 


erefore both the theory - direct immaterial which was use 

and the theory of remote associa- — (o increase the accuracy of the time-to-learn mea- 

ative transfer for the sure. For example, he read at a constant rate paced 
d he allowed very little 


by the ticking of a watch, an " 
tudy and test. On this latter point, 


delay between s! 
the first author Ebbinghaus (1913) wrote, “There was a perfectly 


1 This paper was repared while t c T : 
held ad ational Scent Foundation Traineeship free interchange between the reading and the 
under Grant GZ-1754. occasionally necessary tests of the capacity to repro- 
2 Request for reprints ttoSheldon M. duce by heart. For the latter there was an im- 
Ebenholtz, Department POET portant rule to the elect m hesitation the 
Eben? furtei i nso! Madison, rest of the series Was to be rea through to th 

m Coat we 4o before beginning it again Ep. 24." e end 


Wisconsin 53706. 
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powerfu 
list, and th 
associations 


tions would predict neg 
derived-list paradigm. Although more recent 
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determinate since Ebbinghaus did not use the 
appropriate control. The implicit assumption 
here is that if Ebbinghaus had compared his 
derived-list conditions to a neutral C-D control 
list, that is a second list composed of new items 
instead of the original list, he would have 
found negative transfer as Young et al. (1965) 
did when they used the C-D control. The C-D 
list is necessary to control for warm-up and 
learning-to-learn effects when naive subjects 
are used. However, one cannot assume that 
Ebbinghaus, who learned many lists per day, 
benefited from either of these nonspecific 
transfer effects (Postman, 1968). For Ebbing- 
haus, a new C-D control list and the initial list 
learned 24 hours apart probably would have 
been learned in the same amount of time, on 
the average. Therefore, Ebbinghaus’ measure 
of savings was appropriate and his report of 
positive transfer in derived lists stands as valid. 
The discrepancy between Ebbinghaus’ and 
Irion's results which was referred to by Hakes 
and Young (1966) might easily 5e explained 
by noting that Irion's random-<!der condition 
was not the same as Eb haus’ control. 
Irion’s random list was à TÁndom rearrange- 
ment of the members of the first list. Ebbing- 
haus' random control, on the other hand, was 
not a reshuffling of any one list, but each 
randomly derived list was made up of items 
which had previously been members of different 
original lists. Ebbinghaus learned six 16-item 
lists for each type of derived list. To make up 
his control condition, he maintained the same 
first and last items and sampled randomly 
from the remaining pool of 84 items to con- 
struct six 16-item control series. This pro- 
cedure minimized the effects of remote 
associations while, at the same time, it served 
the purpose of equating control and experi- 
mental series for item familiarity as well as for 
the repetition of the initial and final syllables 
in the original and derived lists. Since Ebbing- 
haus’ control list was not the same as Irion’s 
and since Ebbinghaus' measure of savings was 
appropriate, the inconsistency between the 
savings found by Ebbinghaus and the negative 
transfer found by Hakes et al. (1964) nd by 
Young et al. (1965) remains in need of 
explanation. " 
'The theoretical significance of this incon- 
sistency has not yet been determined, but 
because the inference of negative transfer is 
clearly required by interference theory, it seems 
worth considering possible causes of the dis- 
crepant results. Among these are the unique 
features of Ebbinghaus' procedures, some of 


e 


THEORETICAL NOTES ' 


which have been pointed out by Hakes et al, * 
(1964) and Slamecka (1964): (a) Ebbinghaus 
learned by the method of whole presentation 
and may have formed some remote associations 
directly by inadvertenfly attending to non. 
adjacent items. This explanation seems quite 
unlikely in view of the great care that Ebbing. | 
haus took in controlling his reading rate which |. 
closely approximated .4 second per syllab| 
This rapid presentation rate, incidentally, has 
not been duplicated in contemporary studies o f 
derived lists. (b) Ebbinghaus read the list jjj 
a regular rhythmical manner. To some extent + 
this rhythmical organization may have con) 
tributed to the overall savings, but an examina. 
tion of the pattern of stressed syllables that he 
reported revealed no differential effects that 
would favor lower over higher order deriveq 
lists. (c) Ebbinghaus served as the sole 
subject and therefore knew the types of deriveq 
lists in the experiment. Slamecka (1964) 
contended that this is the most likely ex. 
planation of Ebbinghaus’ results. To support 
his contention, Slamecka (1964) argued thay, 
knowledge of the derived lists is sufficient ¢, 
produce savings. He found savings for a grip rà 
who knew the nature of the derived list andny, | 
savings for a group that did not. Howeer | 
besides telling the subjects the nature of he | 
derived list, he also numbered the items suces. | 
1 
1 


Ed 


à 


sively in the original list and retained the Sme 
numbers in the derived list. The presenc of. | 
these numbers may have allowed subject: tg | 
form direct associations between the numer|s 
and the syllables. Knowing the pattern 7 
the numerals, subjects may have provid, 
their own stimuli and transformed the derive, 
list into a paired-associate task. Therefor, 
Slamecka’s (1964) conclusions must be ten,’ 
pered by this fact and also by the fact thy, 
Goldstein (1950) told her subjects the natu, | 
of a first-order derived list immediately befo, | 
it was presented and yet the derived list di 
not show significantly more savings than 

scrambled list. Even more importantly, ; 
should be noted that Ebbinghaus himself wh 
aware of this criticism and ran a second expert 

ment in which he prepared the list far enou ds 
in advance of running so that he felt succe d 
in eliminating the knowledge of the kind a 


transformations for any particular list. h 
savings in this second experiment were onl, X 
slightly reduced and were comparable to the 1 
first. 1 


In view of Ebbinghaus’ record as a` 
careful experimenter, it seems unlikely that his 
results can be attributed to this artifact " 
There are three additional major difference j 
`S 
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LETTERS TO THE EDITOR 


Equation 1 implies that 
INTERDEPENDENCE OF STEVENS’ 
EXPONENTS wae log(Ii'/1,) 0/ atw 


= log(Ij//1j) t!» leat), [2] 
i 971) presented data which i . 
‘pera se tek counterpart to the Equation 2 enables us te derive the inter- 
Weber fraction is a constant independent of relationship between Stevens law exponents 
sensory modality. He suggested, on the basis and Weber fractions, for if 
of this, that all physical dimensions are mapped s(I) = AJB [3] 
into a common continuum of sensory intensity. i 
He also pointed out (see p. 78) that his data where s(/) is the sensation magnitude of 
are consistent with Fechner's original idea stimulus of intensity 7, then Z; 
about scaling, namely, that stimuli equally be matched whenever 
iscriminable from matched stimuli will also 3 Ap "m 
Pete the statement of which need not G?/1)5 = (15/1). [4] 
require any assumption that equal just notice- Comparing Equation 4 with Equation 2 reveals 
able difference (jnd) units constitute subjec- the interdependence between Stevens’ law 
tively equal intervals. Fechner's assumption; e rents and Weber fractions, namely, that 
stated in terms of a cross-modality matching k 
experiment, reads this way: If stimuli Z; and Bilog(1-- W;) = K [5] 
T, drawn from modalities i and j, are equally where B; and Wi; are the Stevens’ law exponent 
discriminable from matched stimuli Z; and 7;, and Weber fraction for modality 7, and K isa 
also drawn from modalities i and j, then constant which depends on the reference 
stimuli /;' and Ij will be matched also. The modality used to determine the Stevens! law 
relationship between Stevens’ exponents and exponent. 
discriminability measures which Teghtsoonian 
discovered may be derived from this state 
in the following formal Way: 


ME TEGHTSOONIAN, 
Suppose J,’ and lj, equally discriminable 
from I; and J; 


modalities i and j. 


a 
and 7,’ will 


ment REFERENCE 


R. On the exponents in Stevens’ 


À law and the constant in Ekman's law. Psycho- 
j are n jnd's from 7; and Ij It logical Review, 1971, 78, 71-80. 
follows that . 
SNP (Received Juni 971) 
f= (+ Wo; [1] Carl Auerbach 
I; = (14 WY Departitent of Psychology 
. eshiva Uni i 7 
where W; and W; are the Weber fractions for i o 
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Three structural models are described 
munication, recall, 
are stated in terms ©! 
structures which may 
stract structures involvi 
and are conceived as summaries 
ternally consistent theories about wh 
ceived structures, however, may be 

.ordinated to the immediate situation. 
perceived structure and t 
as a condition for cognitive 
sible ways of expanding 
theoretical issues are then co 
of change and organization 
proaches suggesting the emergence O 


f graph theory 
be of two types 
ed in the three 


are in psychology to discover para- 
digms that span seemingly diverse areas. 
Much of the thrust of model büilding in re- 
cent years has involved a turning away from 
general principles applicable to a wide range 
of problems to more specialized miniature 
systems where the limits of application are 
clearly defined. This strategy of narrowing 
the focus is certainly not to be condemned. 
]t is probably necessary to concentrate on 
well-defined subareas of a field if mathe- 
matical models are to achieve any degree of 
sophistication and if they are to enable pre- 
cise prediction. But one can sometimes be 
blinded to the existence of general principles 
by working within a narrow confine. The 
gains from intricate formalization may be at- 
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and attribution of responsibility, 
and are used to repre 


or abstractions 
at ought to be the case. 
balanced or unbal 


he corresponding bala 
adjustment, bel 
the three models are discussed. 
nsidered particularly in re 
in abstract structures. 
f a general para 


which have been applied to com- 
respectively. The models 
sent cognitive 
abstract and perceived. The ab- 
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havi 


tended by a premature insulation of the 


worker from other areas of inquiry which he 
sees as unrelated to his own. Every so often 
one needs to take the wide view, for surely 
one of the goals of science is to search for 
principles that are sufficiently general that 
they illuminate and knit together à number 
of separate areas. F 

The author has taken the wide view on a 
previous occasion (Feather, 1959) where 
models from quite diverse areas of psychol- 
ogy such as mathematical decision theory, 
social learning, risk taking, performance, and 
level of aspiration were shown to involve 
basically similar concepts (expectations and 
values). This expectancy-value paradigm 
has since been applied to other areas includ- 
ing further aspects of achievement motiva- 
tion (Atkinson & Feather, 1966) and 
information-seeking behavior (Feather 
1967a). It is a way of approach that ha: 
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wide relevance when the analysis of overt 
goal-directed behavior is under scrutiny. 

The present paper is an attempt to take 
the wide view again. In this case, however, 
the general paradigm to emerge is one that 
concerns cognitive as well as behavioral ad- 
justments. Three models are discussed that 
have been developed to account for the ef- 
fects of communication, the process of re- 
call, and the nature of attribution of responsi- 
bility for success or failure at a task. On the 
surface, these three areas appear quite dis- 
tinct, but it is shown that models involving 
the same general principles may be applied 
to them all and that the paradigm that 
emerges is one that deals with adjustment to 
discrepancy between structures and the need 
for a person to achieve a relatively stable 
view of his world under conditions of vary- 
ing information input. 


ABSTRACT AND PERCEIVED STRUCTURES 


The three models to be described have 
been developed by the author over the past 
few years. Their development has been 
guided by a basic conviction that an adequate 
treatment of cognition must come to grips 
with the problem of structure. Two types 
of structure may be distinguished, namely, 
abstract structure and perceived structure. 
The distinction between these two types of 
structure becomes apparent in subsequent 
discussion in the content of each of the three 
models, but in general the abstract structure 
refers to the residue of experience, to an 
abstract or summary of past encounters, 
whereas the perceived structure relates to the 
immediate situation. It is assumed that ab- 
stract cognitive structures develop as the 
person copes with the influx of information 
from the external world, that these abstract 
structures must be organized and relatively 
stable to provide continuity and meaning un- 
der changing circumstances, but that they are 
susceptible to change as the person matures 
and as new and discrepant information is re- 
ceived that cannot readily be interpreted in 
terms of existing abstract structures. "These 
ideas are not new. They are basic to the 
Work of eminent theorists such as Piaget, 
Bartlett, Tolman, and Bruner. Abstract 
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structures are general schemata that the per- 
son carries with him from situation to situa- 
tion. In contrast, the perceived structure re- 
fers to the way information provided by the 
immediate situation is organized. This type 
of structure, therefore, is dependent on the 
immediate situational context and does not 
have the stable, transituational character of 
an abstract structure. 

The models to be described assume that 
abstract structures and perceived structures 
may be defined in terms of the elements they 
contain and the relations between these ele- 
ments. All three models involve both per- 
sonal and impersonal elements; that is, they 
apply to the person and his relation to in- 
formation of a personal 
kind. 


any two 
a structure, one of which must 


the 
relation refers to 
association between two elements of a struc- 
ture. These elements may refer to either 
personal or impersonal entities or both. A 
positive unit relation is taken to imply posi- 
tive association or linking together of ele- 
ments, a negative unit relation is taken to 
imply dissociation—not just a neutral lack 
of association, but an active pushing apart or 
Opposition to unit formation between ele- 
ments. The negative unit relation, there- 
fore, is not the Same as a null relation which 
would imply simple lack or absence of as- 
sociation. The negative unit relation carries 
with it the definite implication of a resistance 
to association. These two types of relation 
(attitudinal and unit) are derived 
Heider (1958). The attitudinal 
corresponds to Heider’s sentiment relation, 
involving feeling and evaluation. In the 

eiderian analysis, the positive unit relation 
has been applied to dyads where entities are 
related through similarity, Causality, owner- 
ship, or other unit-forming characteristics, 


from 
relation 
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but Heider has little to say about negative 
unit relations. Indeed he does not distin- 
guish clearly between a negative unit relation 
and the absence of a relation. 

The three models to be described there- 
fore involve abstract structures and per- 
ceived structures defined in terms of unit 
and/or attitudinal relations between ele- 
ments. It is evident that the processes of 
association and evaluation are held to be 
basic in the formation of these structures. 
The person is seen as an active processor of 
information, associating separate units of in- 
formation on the basis of similarity, causality, 
contiguity, ownership, etc., and evaluating 
information along a good-bad dimension. It 
is further assumed in each of the three 
models that the relatively stable abstract 
structure tends to be organized according to 
Heider's principle of balance but that the 
perceived structure may be balanced or un- 
balanced depending on the information given. 
Although Heider's balance principle is gen- 
erally familiar, it will be explained briefly in 
the following section since it is basic to the 
understanding of the general theory involved 

-in the three models to be discussed. 


STRUCTURAL BALANCE 


Heider defines his principle of balance in 
terms of a tendency for unit (U or not U) 
relations and sentiment (L or DL) relations 
to fit together harmoniously. The conditions 
for balance are defined as follows: 


A dyad is balanced if the relations between the 
two entities are all positive (L and U) or all nega- 
tive (DL and not U). Disharmony results when 
relations of different sign character exist. A triad 
is balanced when all three of the relations are posi- 
tive or when two of the relations are negative and 
Imbalance occurs when two of the 
relations are positive and one is negative. The 
case of three negative relations is somewhat am- 
biguous ..- [Heider, 1958, pp. 202-203]. 


one is positive. 


If one considers as entities person (P), 
other person (O), and impersonal object 
(X), the following are examples of balanced 
states: P likes something hé made (pLx, 
pUx) and P likes what his friend likes (rLo, 
pLx). Some examples of unbalanced 
states are as follows: P’s friend dislikes him 
(Lo oDLe) and P likes what his friend 
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dislikes (pLx, »Lo. oDLx). Heider as- 
sumes that states of balance are preferred 
over disharmony, that new relations may be 
induced by existing relations so as to pro- 
duce a balanced state, that where relations 
are not in balance there will be a tendency 
for them to change so that a balanced con- 
figuration is achieved, and that if such a 
change is not possible, the state of imbalance 
will produce tension. Thus the Heiderian 
view is of structures containing relations that 
are dynamically interdependent, moving to- 
ward harmonious organization. 


DISCREPANCY BETWEEN ABSTRACT 
STRUCTURE AND PERCEIVED STRUCTURE 


In the three models to be discussed, it is 
assumed that the abstract structure based on 
previous experience is balanced. The per- 
ceived structure may be balanced or unbal- 
anced according to the information pre- 
sented. When both the perceived structure 
and the corresponding abstract structure are 
balanced there is no pressure to change, that 
is, no cognitive adjustment is necessary. 
But when the perceived structure is un- 
balanced, then a structural discrepancy exists 
between the perceived structure and the 
underlying abstract structure and there will 
be pressure to resolve the discrepancy. This 
resolution may involve a change in the per- 
ceived structure in the direction of the cor- 
responding abstract structure. The present 
paper focuses on such cognitive adjustments. 
In other cases, however, direct behavioral 
adjustments may occur in an attempt either 
to obtain more information or to escape from 
the discrepant situation. And in still other 
cases, where the structural discrepancy per- 
sists, changes in the underlying abstract 
structure may occur. It is assumed that the 
resolution of structural discrepancy between 
perceived and abstract structures is neces- 
sary if the individual is to achieve a rela- 
tively stable view of his physical and social 
world. It is also assumed that the perceived 
structure relating to the immediate situation 
is more amenable to change than the abstract 
structure which is grounded in the indi- 
vidual's past experience, though the abstract 
structure may itself change as new and dis- 


ho d 


crepant information is continually experi- 
enced. Finally, since it is probably easier to 
distort social reality than to distort physical 
reality, changes in perceived and abstract 
structures keyed to social reality may be 
more likely to occur than changes in these 
structures when they are keyed to physical 
reality. 


GRAPH THEORY 


In the models to be described, abstract 
structures and perceived structures are rep- 
resented in terms of the mathematics of 
linear graphs (Cartwright & Harary, 1956; 
Harary, Norman, & Cartwright, 1965). 
One can map elements and relations into 
points and lines of a linear graph and define 
the overall balance within the system. Fig- 
ure 1 presents a number of structures, half 
of which are balanced and half of which are 
unbalanced according to Heider's principle. 
In these structures, a postive attitudinal rela- 
tion is represented by a solid line and a 
negative attitudinal relation by a dashed line ; 
a positive unit relation is represented by a 
solid bracket and a negative unit relation by 
à dashed bracket, An arrow on a line indi- 
cates the direction of an attitudinal relation. 
Thus Figure 1 (a) Tepresents a situation in 
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which person (P) likes other person (O), 
and both like impersonal object (X). This 
is the familiar balanced triad in which a per- 
son likes what his friend likes. 

TThe structures in Figure 1 are called 
signed digraphs (s digraphs), since lines in 
a structure may have both sign (positive or 
negative) and direction. In Figure 1, 
Structures a, b, c, d, e, and f are balanced, 
and Structures g, h, i, j, k, and 1 are unbal- 
anced. Also in Figure 1, s digraphs a, c, f, g, 
h, and | involve only the one type of relation, 
in each case an attitudinal relation. They 
are called s digraphs of Type 1. The re- 
maining s digraphs involve both attitudinal 
and unit relations, that is, two types of rela- 
tion, and are called s digraphs of Type 2. 
Each of the s digraphs in Figure 1 contains a 
semicycle which is defined by Cartwright and 
Harary (1956) as “a collection of lines ob- 
tained by taking exactly one from each pair 
AB or BA, BC or CB, ..., DE or ED, and 
EA or AE [p. 283].” Each semicycle has a 
sign which is defined as the product of the 
signs of its lines. For example, the semicycle 
POVOx, XPS positive in Figure 1 (a) but 
negative in Figure 1 (g). Cartwright and 
Harary have generalized Heider's principle 
by defining an s digraph as completely bal. 
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BALANCED COMMUNICATION STRUCTURES 
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Fic. 2. Signed digraphs for the communication model representing eight completely balanced structures. 


anced when the signs of all the semicycles it 
contains are positive (see also Harary et al., 
1965). It can be observed in Figure 1 that 
each of the balanced s digraphs involves one 
positive semicycle and each of the unbalanced 
s digraphs involves one negative semicycle. 
In more complex structures, there could be a 
number of semicycles all of which must be 
positive if the entire structure is balanced. 
The three models will now be presented. 
Since each has been described in detail be- 
fore, along with the research that it has 
generated (Feather, 1964, 1967b, 1969a, 
1969b), the present statement will be re- 
stricted to a short description of each model 
followed by a general comparison of the 


models. 


Tue THREE MODELS 


The Structural Balance Model 
of Communication Effects 


The structural balance model of communi- 
cation effects is presented in Figure 2. Each 
s digraph in Figure 2 is coordinated toa 
balanced structure in which the elements 


refer to a source or communicator (S) who 


is presenting a communication (C) about an 
issue (I) to a receiver or communicatee (R). 
Relations within the structures are unit and 
attitudinal and can be positive or negative. 
The elements of the structure are coordinated 
to points of the s digraph and the unit and 
attitudinal relations are represented by using 
positive and negative brackets and lines as 
described in the previous section. The ar- 
rows on the lines describe the direction of 
each attitudinal relation. It will be observed 
that all semicycles are positive, that is, in 
balance. 

The s digraphs in Figure 2 may be co- 
ordinated both to balanced abstract struc- 
tures and balanced perceived structures. An 
example of a balanced perceived structure is 
one corresponding to an objective situation 
in which a person presents a communication 
about some issue to his friend, and both his 
attitude and the friend's attitude toward the 
issue are in the same direction as the stand 
taken in the communication. Under these 
circumstances there will be no pressure to 
change, since the perceived structure cor. 
responds to an underlying balanced abstract 
structure (either s digraph a or s digraph h 


An example of an unbalanced 


in Figure 2). I 
perceived structure is one corresponding to 
an objective situation in which a person pre- 
sents a communication about some issue to 
his friend and both his attitude and his 
friend's attitude toward the issue are in the 
opposite direction to the stand taken in the 


communication. Two such unbalanced per- 
ceived structures are presented in Figure 3. 
Under these conditions, a discrepancy exists 
between the perceived structure (either s 
digraph a or s digraph b in Figure 3) and 
the corresponding balanced abstract structure 
(either s digraph a or s digraph h in Figure 
2). There will be pressure to resolve the 
discrepancy and this may involve some form 
of cognitive adjustment (eg., the source's 
message might be misinterpreted) or overt 
response (e.g., the source might be asked to 
clarify his position) or both. 

It is important to note that in any com- 
munication situation the perceived structure 
of the source of the communication may be 
quite different from the perceived structure 
of the receiver of the communication. Typ- 
ically the source has more information about 
why he is performing the act. In situations 
where he is involved in attitude-discrepant 
communication, for example, the source may 
know that he has been coerced in some way 
to deliver the message, whereas the receiver 
may be quite ignorant of this fact. In this 
case the receiver may perceive the source as 
acting of his own volition and responsible for 
his act, whereas the source would know bet- 
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3. Signed digraphs representing two un- 
balanced perceived structures corresponding to 
communication situations where the communication 
is discrepant. with attitudes of both source and 
receiver who are mutual friends. 


N. T. FEATHER 


ter—he may from the start deny responsi- 
bility for his attitude-discrepant action. 

A proper test of the communication model 
depends on how successfully one can co- 
ordinate the theoretical concepts to em- 
pirical operations. Positive or negative at- 
titudes between source and receiver, positive 
or negative attitudes of source or receiver 
toward the issue, positive or negative stands 
taken by the communication toward the is- 
sue, agreement or disagreement with the 
communication by source or receiver have 
all been taken as instances of attitudinal re- 
lations. Unit relations have been somewhat 
harder to specify, particularly when they are 

assumed to be negative. Membership of a 
source and receiver in the same tightly knit 
group has been represented by a positive unit 
relation linking S and R. A positive unit 
relation between source and communication 
has been taken to represent the situation in 
which the source sees himself as responsible 
for the communication (source's perceived 
structure) or the receiver sees the source as 
responsible for the communication (receiver's 
perceived structure). The negative unit re- 
lation between source and communication has 
been taken to represent the case where the 
source sees himself as disowning the com- 
munication (source's perceived structure) or 
is perceived by the receiver as 
sponsible for the communication 

erceived structure). 

Wie model presented in Figure 2 has con- 
siderable predictive power. Provided one 
knows any three relations that involve the 
four elements (S, C, I, R) of a Structure, 
one can use the principle of structural bal- 
ance to derive the remaining relations. This 
type of strategy can be employed in studies 
designed to map into the assumed balanced 
abstract Structures, For example, one can 
present a hypothetical communication situa- 
tion in which the source's communication 
favors Some issue (positive CT attitudinal 
relation ), the receiver likes the Source (posi- 
tive RS attitudinal relation) 
sees the Source as responsible for the com. 
munication (positive SC unit relation). 
One can ask subjects to predict whether 
the receiver's attitude toward the issue will 


be favorable (positive R] attitudinal rela- 


not re- 
(receiver's 


; and the receiver 
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UNBALANCED ATTITUDE STRUCTURES 
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Fic. 4. Signed digraphs for the recall model representing four balanced structures 
and four unbalanced structures. 


tion), whether the receiver will agree with 
the communication (positive RC attitudinal 
relation), etc. Such a study would seek to 
discover if subjects were using the balanced 
abstract structure represented in Figure 2a 
in making their judgments. In other in- 
vestigations, one can design situations in 
which the perceived structures are unbal- 
anced from the start and study the cognitive 
adjustments that occur. In fact the struc- 
tural balance model of communication effects 
has initiated an extensive program of re- 
search (Feather, 1965a, 1965b, 1966, 1967c ; 
Feather & Armstrong, 1967; Feather & Jef- 
fries, 1967). It has also been applied to the 
literature on attitude change and has proved 
to be a powerful and general model in its 
ability to illuminate this highly complex area 
(see Feather, 1964, 1967b). 


Structural Balance Model of Recall 


The structural balance model of recall is 
presented in Figure 4. Each s digraph in 
Figure 4 is coordinated to a structure in 
which the elements are person, argument, 
and issue. The basic structure is therefore 
similar to the S-I-C and R-I-C triads in the 
communication model of Figure 2. The re- 
call model involves only one person, how- 
ever, and a simple triadic structure that in- 
cludes positive and negative attitudinal 
relations. These relations, which are repre- 
sented by directed positive and negative lines 
in each s digraph, refer to the person's atti- 
tude toward the issue, the person's attitude 
toward the argument, and the stand taken by 
the argument toward the issue. For ex- 
ample, Figure 4c could represent an abstract 


structure in which a person opposes capital 
punishment (the issue), knows an argument 
that opposes capital punishment, and agrees 
with that argument. In Figure 4, half of the 
s digraphs are balanced; the other half are 
unbalanced. 

The model has been applied to the situa- 
tion where a person is asked to recall argu- 
ments about some controversial issue. 
Meaningful material, therefore, is elicited 
toward which the person has a definite atti- 
tude. It is assumed that he will employ 
balanced abstract structures corresponding to 
s digraphs a, b, c, and d in Figure 4 in re- 
calling arguments, since the structures cor- 
responding to s digraphs e, f, g, and h are 
unbalanced and unlikely to survive in mem- 
ory. This assumption is consistent with the 
general proposition developed previously, 
namely, that the products of long-term infor- 
mation processing are likely to be abstract 
structures that are organized rather than dis- 
Hence arguments that are keyed 


organized. - 
t structures will be more 


into balanced abstrac 

available for recall. 
This emphasis on abstract structures as 

directive influences on the recall process 


echoes Bartlett’s (1932) view: 


when a subject is being asked to remember, very 
often the first thing that emerges 1$ something of 
the nature of an attitude. The recall is then a 
construction, made largely on the basis of this 
attitude, and its general effect is that of a justifica- 
tion of the attitude [p. 207]. 


In the present model, the constructive nature 
of recall is also emphasized (see esos 
1970; Neisser, 1967, p. 292), and the Sone 
structive activity is assumed to be guided by 
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balanced abstract structures in accordance 
with the demands of the immediate social 
situation. 

The recall model has served as the basis for 
an extensive program of research (Feather, 
1969a, 1969c, 1969d, 1970) mainly con- 
cerned with mapping abstract structures in 
relation to what is recalled. These studies 
have shown that subjects who were asked to 
report arguments that were pro and con on 
an issue (e.g, Vietnam) presented argu- 
ments that tended to be consistent with their 
own attitude toward the issue, where con- 
sistency was defined as in Figure 4. Thus a 
person in favor of an issue would report a 
large number of pro arguments with which 
he agreed and a large number of con argu- 
ments with which he disagreed (see balanced 
s digraphs a and b in Figure 4), but very 
few pro arguments with which he disagreed 
or con arguments with which he agreed (see 
unbalanced s digraphs e and f in Figure 4). 


Thus his recall was selective and appeared to 
be determined by balanced abstract struc- 
tures. 

The degree to which cognitive structures 
facilitate recall has been investigated in other 
contexts apart from the present one which 
involves controversial issues toward which 
the person has a definite attitude and for 
which the special recall model illustrated in 
Figure 4 was formulated. For example, 
Deese (1965) has investigated associative 
structures using the method of word associa- 
tion, and Bousfield (1953) has examined the 
extent to which words are recalled in mean- 
ingful clusters (for a recent review of clus- 
tering in recall see Shuell, 1971). Each of 
these different approaches emphasizes the 
concept of underlying structures directing 
the recall process. The degree to which 
these underlying structures would involve 
attitudes would vary according to whether 
the material was personally relevant (as as- 
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Fic. 5. Signed digraphs for the attribution model 
representing four balanced structures. 
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sumed in the present model) or relatively 
neutral (as assumed in most of the studies 
of "clustering"). 


Structural Balance Model of Attribution 
of Responsibility 


The structural balance model of attribu- 
tion of responsibility is presented in Figure 
5. Each s digraph is coordinated to a struc- 
ture in which person, self, and outcome 
(success or failure) are the elements and the 
relations are unit and attitudinal. As be- 
fore, the elements of the structure are 
mapped into points of the s digraph, the 
attitudinal relations into directed lines which 
may be positive or negative in sign cor- 
responding to the direction of the attitude 
(pro or con), and the unit relations into posi- 
tive or negative brackets. The model as- 
sumes that success is positively evaluated by 
the person, whereas failure is negatively 
evaluated. But the person may have either 
a favorable or an unfavorable attitude toward 
himself. The positive unit relation between 
self and outcome has been taken to imply 
that the self is seen as owning the outcome 
responsibility for the outcome is attributed 
by the person to internal factors within the 
self such as ability. The negative unit rela- 
tion between self and outcome has been taken 
to imply that the self is seen as disowning or 
not responsible for the outcome—responsi- 
bility for the outcome is attributed to factors 
outside the self such as luck. This coordina- 
tion of perceived responsibility to the posi- 
tive unit relation and perceived lack of 
responsibility to the negative unit relation is 
consistent with the interpretation of the 
positive or negative unit relation between 
source and communication discussed in rela- 
tion to the communication model. The at- 
tribution model, however, involves only one 
person, not two as in the communication 


model. 

The four s digraphs in Figure 5 may be 
coordinated both to balanced abstract struc- 
tures and balanced perceived structures. An 
example of a balanced perceived structure 
Jd be one coordinated to a situation in 
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Fic. 6. Signed digraph representing an un- 
balanced perceived structure corresponding to a 
situation where success is attributed to external 
factors. 


evaluation fails at a task and sees his failure 
as due to lack of ability. Under such condi- 
tions there would be no pressure to change 
the perceived structure, since it corresponds 
to an underlying balanced abstract structure 
(s digraph d in Figure 5). An example of 
an unbalanced perceived structure would be 
one in which a person with high positive 
self-evaluation succeeds at a task but is led to 
believe that his success was due to external 
factors such as good luck and not to his own 
ability (see Figure 6). In this case there 
will be a discrepancy between the perceived 
structure represented in Figure 6 and a cor- 
responding abstract structure (s digraph a 
or s digraph c in Figure 5), There will be 
pressure to resolve the discrepancy and this 
may involve some form of cognitive adjust- 
ment. For example, the person might de- 
crease his selí-evaluation (bringing about 
correspondence with s digraph c) or restruc- 
ture the situation so that the success can be 
attributed to self after all (achieving cor- 
respondence with s digraph a). 

The attribution model has also led to an 
extensive program of research (Feather, 
1969b: Feather & Simon, 1971a, 1971b), 
most of it involving mapping into the four 
balanced abstract structures assumed in the 
model. One of the main predictions so far 
tested has been that a person will attribute 
good outcomes (success) to himself when 
self-evaluation 1s positive but will disown 
good outcomes when self-evaluation is nega- 
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tive. In contrast, bad outcomes (failure) 
will be attributed to the self when there is 
negative self-evaluation but will be disowned 
when there is positive self-evaluation, This 
hypothesis has been supported in a specific 
task situation in which high expectations of 
success have been taken to connote positive 
self-evaluation and low expectations of suc- 
cess to connote negative self-evaluation with 
respect to the task to be performed. Thus a 
highly confident person (positive self-evalua- 
tion) tended to attribute the outcome more 
to ability than to good luck if he succeeded, 
and more to bad luck than to lack of ability 
if he failed. An unconfident person (nega- 
tive self-evaluation) tended to attribute the 
outcome more to good luck than to ability if 
he succeeded, and more to lack of ability 
than to bad luck if he failed. Attribution of 
responsibility for success and failure, there- 
fore, appeared to follow balanced abstract 
structures in terms of the s digraphs in 
Figure 5. 


Summary 


The three models that have been described 
are the products of an overall strategy that 
attempts to deal with perceived and abstract 
Structures by specifying elements and rela- 
tions and by invoking the principle of bal- 
ance to account for the ways in which the 
abstract structures become organized. Un- 
balanced perceived structures are assumed to 
be under pressure to change toward cor- 
responding balanced abstract structures and 
any discrepancy may result in cognitive or 
behavioral adjustments on the part of the 
Person aimed at reducing the discrepancy. : 

The three models involve unit and atti- 
tudinal relations and thereby recognize the 
importance of the processes of association 
and evaluation in the formation of structures. 
But the attitudinal relations have been in- 
cluded only because each model involves per- 
Sonal elements. One can also conceive of 
Structures, where evaluation is not important 
and where the elements are impersonal, as- 
Sociated in terms of unit-forming characteris- 
tics such as similarity and proximity. One's 
cognitive map of the physical structure of a 
town or city would be an example. 
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STRUCTURAL CORRESPONDENCE AND 
STRUCTURAL Discrepancy 


In presenting the three models described 
in the previous section, it was implicitly as- 
sumed that the elements in the perceived 
structure would correspond to the elements 
in the underlying abstract Structure; for ex- 
ample, in the case of the communication 
model, that both perceived structure and 
abstract structure would contain the same 
set of elements, namely S (source or com- 
municator), C (message or communication), 
I (issue), and R (receiver or communica- 
tee). In the sense that the abstract struc- 
ture is considered to be an abstraction or 
Summary of previous experience, this as- 
sumption appears to be reasonable. One 
can conceive of a person forming a simplified 
Schema in which the common elements of a 
situation are abstracted over time and be- 
come incorporated into an abstract structure, 
It would be difficult, for example, to imagine 
a communication situation that did not in- 
volve the four elements mentioned above. 
In terms of the logic of abstraction, therefore, 
one would expect correspondence or iso- 
morphism betweeen elements of the abstract 
Structure and elements within the perceived 
structure. Indeed, over time the abstract 
Structure may come to exert a directive in- 
fluence on the way in which the actual situa- 
tion is perceived, permitting a simplified view 
of present reality in terms of the basic ele- 
ments involved in the abstract. structure, 
Also over time the abstract structure may be 
enlarged as more common elements are ab- 
stracted on the basis of experience. 

What then is the basis of structural. dis- 
crepancy if there is correspondence between 
elements of the abstract structure and ele- 
ments within the perceived structure? Clear- 
ly, structural discrepancy would depend on 
the relations rather than the elements within 
the perceived structure, increasing dis- 
crepancy would occur as more and more 
relations in the perceived structure had to be 
changed in order to correspond to the under- 
lying abstract structure. In these terms, 
one index of the size of structural discrep- 
ancy would be the minimum number of 
relations that would have to be modified if 
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the perceived structure were to correspond 
to an underlying abstract structure. Such 
an approach would have the effect of equat- 
ing size of structural discrepancy with the 
degree of balance in the perceived structure 
(see Harary et al, 1965, Ch. 13), size of 
discrepancy increasing as degree of balance 
becomesless. In Figure 7, for example, two 
relations would have to be altered to bring 
s digraph b into correspondence with one of 
the three possible abstract structures. Struc- 
tural discrepancy would be greater in this 
case than for s digraph a where only one 
relation would have to be changed to accom- 
modate to the corresponding abstract struc- 
ture. Signed digraph a could represent the 
situation where source (S) and receiver (R) 
are mutual friends with similar attitudes 
toward an issue (I), and where source (S) 
delivers a message or communication (C) 
that is discrepant with his own attitude to- 
ward the issue. The situation represented 
by s digraph b would be similar except that 
in this case, the two people are not mutual 
friends, S likes R, but R does not like S 
Intuitively the latter situation appears more 
discrepant from a balanced state than the 
former, and indeed, in terms of the above 
definition, it is more structurally discrepant. 


PERCEIVED STRUCTURE 
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The above definition of structural dis: 
crepancy, however, is an oversimplification 
for it neglects the possibility that relation: 
may differ in strength as well as in sign. 4 
single relation may be very difficult to chang 
because it is strong (e.g. intense liking fo 
a friend), whereas two relations may b 
quite easy to change because they are weal 
Hence an index of structural discrepanc 
should probably refer not only to the min 
mum number of relations that have to I 
altered to produce balance, but also to tl 
strength of those relations relative to oth 
relations in the perceived structure GRO 
number weighted by strength). 

Finally, there may be a tendency for ne 
relations to be induced in the perceived stru 
ture in line with the underlying abstra 
structure, that is, the abstract structure mi 
determine the induction of new perceiv 
relations involving association or evaluati 
or both in their various forms. For e 
ample, a person listening to a friend's cor 
munication that takes a position consona 
with his own attitude may perceive his friei 
as having the same attitude toward the iss 
as he has himself. 

The questions raised in the present secti 
suggest some new directions in the analy: 


ABSTRACT STRUCTURE 
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Inbalanced perceived structures and corresponding balanced 


abstract structures indicating degrees of structural discrepancy. 
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of cognitive structures, particularly in regard 
to the specification of structural discrepancy. 
In the next section various way in which the 
three models may be further expanded are 
considered and problems that require future 
solution are briefly discussed. 


FURTHER EXPANSION OF THE 
THREE MODELS 


Elements and Relations 


The three models so far explored have 
been extremely simple ones involving no 
more than four elements and no more than 
two relations. In actuality, however, situa- 
tions are likely to be much more complex so 
that, in a communication situation, for ex- 
ample, a receiver may be exposed to a num- 
ber of communications about a variety of 
issues and he may be only one of a large 
audience listening to the source. It follows 
that perceived structures may be quite com- 
plex, involving a constellation of elements 
and relations. Abstract Structures, on the 
other hand, are likely to be simpler in their 
formal properties since they are the residues 
of experience, summaries or abstractions 
from what has gone before. 

It should be possible to expand the models 
described previously by adding new elements 
and new relations. One such extension has 
already been made in an attempt to con- 
Ceptualize the concept of “importance of the 
issue” in the communication model. The 
concept of importance is one that has been 
introduced into a variety of theories. For 
example, Festinger (1957) allowed for such 
& concept when he included the importance of 
the cognitive elements involved in a disson- 
ant relationship as a factor influencing the 
Magnitude of dissonance. In relation to the 
communication model (Feather, 1967b), it 
has been argued that one can represent an 
important issue "as one toward which the 
person has a strong attitude and as one that 
is closely bound together with other issues 
toward which the person also has strong 
attitudes of the same sign [p. 149]." Figure 
8 represents such a situation in which Issues 
I, J, and K are important to receiver R, and 
in which a communication C about issue I is 
sent by a positive source S. The stand 
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taken in this communication is discrepant 
with R's attitude toward I in both s digraph 
a and s digraph b. Obviously the perceived 
structures corresponding to s digraph a and 
s digraph b are unbalanced. Tt would be 
possible to restore balance in the S-C-I-R 
semicycle if the RI attitudinal relation were 
to change from positive to negative (s 
digraph a) or from negative to positive 
(s digraph b). One would expect such 
changes to be resisted, however, since they 
would throw the semicycles involving I, J, 
and K out of balance. Thus, in the two 
cases represented, the receiver might dissoci- 
ate the source from the communication (a 
rather difficult cognitive adjustment to make, 
see Feather, 1965b), he might misperceive 
the stand taken by the communication (if 
there is some ambiguity) or he might change 
his attitude toward the source. All of these 
would be effective m adjustments that 
would resolve the N- 

It will be noted that the above interpreta- 
tion of importance of the issue preserves the 
structural character of the communication 
model and assumes that balanced abstract 
structures exist that are somewhat more 
complex than those represented in Figure 2. 
Other possible expansions of the communica- 
tion model might include additional elements 
to allow for the group memberships of the 
source and of the receiver, and additional 
elements to represent the value base of the 
source and the receiver (perhaps enabling 
better representation of the perceived trust- 
worthiness of the source in terms of the 
values he holds). Additional relations 
might also be incorporated to take account of 
dominance or power relationships between 
Source and receiver (eg. the authority of 
the source) and the role characteristics of 
Source and receiver (perhaps by integrating 
the communication model with structural 
role theory; Oeser & Harary, 1962, 1964). 
Similarly, in the recall model, one might in- 
clude extra elements and unit relations to 
represent clusters of issues and arguments, 
and in the attribution model, additional ele- 


ments to represent other persons when at- 
tribution is investigated within a social 
context (Feather & Simon, 1971a). Such 


extensions are feasible and would enable ex- 
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Unbalanced signed digraphs in which Issues 


I J, and K are important to receiver R. 


pansion of the structural models to wider 
areas while still operating within the mathe- 
matical framework of linear graph theory. 

It is apparent that one psychological factor 
limiting the size and complexity of abstract 
structures in the three models presented is 
the general assumption of balance in the 
underlying abstract structures. It is prob- 
ably easier to justify this assumption in those 
cases where the number of elements and 
relations and hence the number of semicycles 
is small (as in the three models presented). 
Large, complex abstract structures may not 
be completely balanced, however, since they 
would contain large numbers of semicycles, 
some of which would involve many elements, 
and all of these semicycles would have to be 
balanced—a formidable task for the indi- 
vidual. For this reason, abstract structures 
of the kind presented in the three models 
may involve only a limited number of ele- 
ments. Highly complex structures are prob- 
ably beyond the capacity of most individuals, 
and simplification of complex information 
input so that it becomes coded in terms of 
smaller organized units is probably the nor- 
mal course of events. As has been argued, 
this simplification is an important part of 
adaptation, enabling the individual to achieve 
stability in the face of complexity and 
variation. 

However, it would be foolish to ignore the 
fact that individuals do differ in their 
capacity to cope with the welter of informa- 
tion they receive and that some seem able to 
develop more complex and differentiated ab- 


stract structures than others. Since it would 
be difficult to balance these structures com- 
pletely, other types of balance such as local 
balance may become important (see Harary 
et. al, 1965). Local balance would occur, 
for example, if only the semicycles involving 
the self as an element were balanced—a 
plausible assumption in more complex struc- 
tures where se/f is an element. Whether 
there are general principles that underly the 
particular inconsistencies that can be toler- 
ated in complex structures is an important 
question. The articulation of such principles 
as they might apply to structures of greater 
complexity is a task for the future, demand- 
ing both theoretical and empirical inquiry. 
Psychologically, however, it is important to 
allow in any conceptualization for people who 
have a complex view of existence that toler- 
ates some inconsistencies. And, in line with 
the above discussion, some inconsistencies 
may be almost inevitable when cognitive 
structures are complex. 


Strength of Relations 


Expansion of the present models requires 
some means of representing the strength of 
different relations, since at present the mod- 
els only enable predictions about sign (posi- 
tive or negative). Balance theory is in this 
respect somewhat akin to a two-valued logic 
when what is required in expanded models is 
a multivalued logic. The present approach 
enables one to deal with sets of relations ed 

s ; s ch 
as: If P likes O and if P likes X then O will 


like X, but it does not enable one to ansv 
answer 
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questions such as: If P likes O with prob- 
ability pı and P likes X with probability 
what is the probability that O will like x 
The concept of strength will be necessary if 
one is to predict the strength of new induced 
relations in a perceived structure and if one 
is to arrive at an adequate index of structural 
discrepancy. More generally, the question 
of whether there are equilibrium states for 
strengths of structural relations as well as 
for signs of structural relations (as in bal- 
anced structures) is an intriguing one, de- 
serving of close investigation. 

In regard to the induction of new relations, 
several principles have been advanced by the 
author in a study of how subjects predict in- 
terpersonal attraction (Feather, 1966). At 
the level of the dyad, a similarity principle 
appeared to hold. Thus, in two person 
Structures involving person (P) and other 
person (O), a predicted attitudinal relation 
tended to mirror a given unit or attitudinal 
relation in strength as well as in sign. For 
example, if a subject knew that P liked O a 
lot or that P and O both belonged to the 
same tightly knit group, there was a tend- 
ency for him to predict that O will very 
probably like P. At the level of the triad, a 
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discrepancy principle taking account of both 
sign and strength of relations seemed to fit 
the data best. This principle asserts that 
when two given relations have the same sign, 
large discrepancies in their strength will be 
associated with a weak positive third rela- 
tion and small discrepancies with a strong 
positive third relation completing the triad. 
When two given relations are of opposite 
sign, large discrepancies in their strength 
will be associated with a strong negative 
third relation and small discrepancies with a 
weak negative third relation. 

The discrepancy principle may be applied 
to all triads in the three models discussed 
previously and is generalized mathematically 
in Figure 9 which represents any triad of re- 
lations containing elements P, QR It ts 
assumed in Figure 9 that relations PR and 
QR are given and that PQ has to be pre- 
dicted. As before, solid lines represent posi- 
tive relations and dashed lines represent 
negative relations. The numbers x and y 
refer to the strength of the PR and OR re- 
lations, respectively, and are assumed to fall 
between 0 and 1. The predicted strength of 
the PQ relation according to the discrepancy 
principle is indicated in Figure 9 for triads 
containing given relations of the same sign 
(s digraphs a and b) and of opposite sign 
(s digraphs c and d). The predicted 
strength will be between 0 and 1. 

The two principles mentioned above are 
extremely simple ones and will undoubtedly 
require modification as they are tested em- 
Pirically. Even now, it will be apparent that 
the discrepancy principle leads to paradoxi- 
cal predictions as x and Y both approach 
Zero, since the third relation would approach 
1 when the two given relations are of the 
same sign and 0 when the two given relations 


y N . are of opposite sign (see Figure 9), More- 


c d 


Fic. 9. Predicted strength of PQ relation when 
Strengths of PR and OR relations are given using 
the discrepancy principle for given relations of 
Same sign (signed digraphs a and b) and unlike 
Sign (signed digraphs c and d). 


er, one needs a formulation of strength 
Nat can be applied in a mathematically con- 
sistent way to structures containing any 
number of elements and relations, The 


a 


aue is complicated by summation effects 
. (Feather, 1966) that may occur in 


^ , à struc- 
The possibility exists that there may 
be no general mathematical solution to these 
problems, but this has still to be shown. If 
there is no general solution, one will have to 


tures. 
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be content with structural models that pre- 
dict sign but not strength of relations, an im- 
portant achievement nevertheless.* 


Other Forms of Cognitive Bias 


While research based on the three models 
discussed in the present paper has strongly 
supported the assumption of underlying bal- 
anced abstract structures, the results have 
also shown that it may be necessary to con- 
sider the effects of other forms of cognitive 
bias. Chief among these has been a posi- 
tivity bias: a tendency for subjects to bias 
evaluation in the positive direction. In tests 
of the three models, this effect has appeared 
in a tendency (a) for the receiver of a com- 
munication to evaluate the source of the 
communication positively (Feather & Arm- 
strong, 1967 ; Feather & Jeffries, 1967), (5) 
for persons to recall more arguments they 
agreed with than arguments they disagreed 
with (Feather, 1970), and (c) for a person 
to attribute the success of another person 
more to ability than one's own success, and 
the failure of another person less to lack of 
ability than one's own failure (Feather & 
Simon, 1971a; Simon, 1969). Positivity ef- 
fects have also been found in other areas of 
social psychology (Feather, 1967b; Zajonc, 
1968). Indeed Rosenberg (1968) has re- 
cently referred to "the individual's need to 
so construct ‘reality’ as to emphasize hopeful 
prospects [p. 91]," a psychological need to- 
ward wish fulfillment that may interfere with 
the arousal of inconsistency and the restora- 
tion of consistency. Earlier, Rosenberg and 
Abelson (1960) had spoken of a tendency 
“to maximize expected gain and to minimize 
expected loss [p. 145].” In addition, other 
forms of cognitive bias have been explored 
(eg. reciprocity, agreement, extremity). 
Some of these effects may be situation spe- 
cific, but others (like positivity) appear to 
have considerable generality. Whether they 
can be accounted for in structural terms is 
a question still to be answered. 


2 The present models do not include representa- 
ambivalent relations. A recent attempt has 
Cartwright and Harary (1970) to 
ambivalence and indifference. 


tion of 
been made by 
treat the problem of 
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Modes of Resolution 


One of the major problems in cognitive 
consistency theory is that of specifying which 
mode of resolution an individual will adopt 
to restore consistency. In the present con- 
text the problem is: Which relation will 
change when the perceived structure is un- 
balanced? In the case of the communication 
model, for example, will the receiver change 
his attitude toward the issue, alter his atti- 
tude toward the source, misperceive the 
stand taken by the communication, etc. ? 

For each of the three models, one might 
expect changes to occur in the weaker rela- 
tions of a perceived structure first, but that 
such changes will be conditional upon the 
precise nature of the structure. If a change 
in a weak relation were to throw a lot of 
semicycles out of balance, for example, such 
a change might be resisted (see Figure 8). 
Some relations in a perceived structure may 
be more tied to the immediate situation than 
others (e.g., one’s attitudinal relation to a 
source who is obviously weak and ill-in- 
formed) and may be more easily changed 
than others which are less situation specific 
and more self-relevant. Furthermore, mul- 
tiple changes may occur in a perceived struc- 
ture as well as single changes. For exam- 
ple, a number of changes are allowed for in 
the congruity model (Osgood & Tannen- 
baum, 1955). Perhaps also there is a tend- 
ency to keep the number of changes needed 
to restore balance to a minimum and to 
maximize hedonic gain and minimize hedonic 
loss (Rosenberg & Abelson, 1960). Abel- 
son (1968a) has succinctly summarized a 
number of alternative hypotheses concerning 
modes of resolution. 

These comments concern cognitive adjust- 
ments and that is in line with the main em- 
phasis of the present paper. But it should 
be evident that overt behavioral adjustments 
may also occur in addition to or in place of 
covert adjustments. Confronted by incon- 
sistency, the individual may as a first step 
seek further information by exploration 
communication, or other means and this may 
lead to modification in existing structures 
(Berlyne, 1960, 1966; Newcomb, 1959). or 
he may engage in escape behavior and re- 
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move himself from the source of inconsist- 
ency. What adjustment is made probably 
relates to the hierarchy of habits that the in- 
dividual has established in the course of his 
developmental history. For some individ- 
uals, overt action may be a dominant re- 
sponse without intervening cognitive adjust- 
ment; for others, cognitive adjustment may 
be dominant and may precede overt action. 
A complete understanding of modes of reso- 
lution, therefore, will require specification of 
the habit hierarchy for individuals in rela- 
tion to particular situations. The specific 
adjustments made may also reflect the in- 
fluence of motivational factors in addition to 
consistency seeking and inconsistency reduc- 
tion. A recent analysis of information-seek- 
ing behavior by the author has allowed for 
such extrinsic factors within an expectancy- 
value paradigm (Feather, 1967a). The 
whole general question of the relationship of 
behavioral adjustment to cognitive adjust- 
ment is a particularly thorny one in psychol- 
ogy. It may turn out that the relationship is 
correlative rather than causal (Atkinson, 
1969, pp. 123-126). 


Individual Differences 


The present approach allows for individ- 
ual differences both in perceived structures 
and in abstract structures. Subjects may 
differ in their immediate perceptions of the 
same situation and in the abstract structures 
they employ. As already indicate - for some 
persons the abstract structures n y be com- 
plex and differentiated, for others simple and 
undifferentiated. Investigation of individ- 
ual differences in structures implies the need 
for a taxonomy of ways in which structures 
can differ. Zajonc (1968, pp. 320-338) has 
provided an excellent review of approaches 
to cognitive organization in terms of unde- 
fined elements, attributes, dimensions, cate- 
gories, concepts, and schemata. This paper 
will not attempt to cover the same ground. 
Of special relevance, however, are attempts 
to formulate structural aspects of cognitive 
organization (e.g., Scott, 1963, 1969 ; Zajonc, 
1960) employing concepts such as differen- 
tiation, complexity, unity, rigidity, centrality, 
€tc——concepts that are in turn related to 
Lewin's pioneering attempts to use topology 
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in the representation of a person's life space 
(Lewin, 1935, 1951). Various ways of 
distinguishing between structures have also 
been suggested in the analysis of sociometric 
structures and communication | networks 
(Bavelas, 1950; Shaw, 1964). One should 
be able to give a more precise definition of 
these structural concepts using graph theory, 
and although some approaches have been 
made in this direction (e.g., in group struc- 
ture, Flament, 1963), much more needs to 
be done. Such translations would have im- 
mediate value since a person's conceptual 
organization has been taken by some as a 
means of understanding his personality, his 
cognitive style, his level of psychological de- 
velopment, etc. (e.g, Kelly, 1955; open- 
mindedness versus closed-mindedness, Ro- 
keach, 1960; ^ concreteness-abstractness, 
Harvey, Hunt, & Schroder, 1961). 

A number of studies have focused on gen- 
eral personality variables such as dogmatism, 
category width, intolerance of ambiguity, 
concreteness-abstr; tness, self-esteem, etc., 
in relation to tol rance for inconsistency 
(Glass, 1968), but these studies are beset 
by difficulties of interpretation (Abelson, 
1968b). Thus, in the present context, tests 
of the recall model have shown no effects of 
dogmatism or intolerance of ambiguity on 
the types of argument available in recall 
(Feather, 1970), yet in another study two 
personality variables (dogmatism and intoler- 
ance of ambiguity) were found to influence 
preference for information that varied in con- 
sistency and novelty (Feather, 1969e). It is 
likely that such research would be more pro- 
ductive if the personality variables could be 
closely related to well-defined dimensions of 
the underlying abstract structures. One 
obvious dimension already discussed is the 
Sheer size of the structure in terms of the 
number of semicycles it contains, It is a 
plausible assumption that individuals who 
are to maintain relatively complex and dif- 
ferentiated abstract structures containing 
large numbers of semicycles are more likely 
to be able to tolerate inconsistencies since, as 
argued previously, large complex structures 
are more difficult to balance completely than 
simple structures that are relatively un- 
differentiated. 
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Summary 


The present section has been concerned 
with ways in which the three models might 
be expanded. The treatment has necessarily 
been rather summary, but new directions 
have been suggested that are feasible and 
that could be pursued in the future to enable 
an even more powerful representation of 
cognitive structures concerned with com- 
munication, recall, and attribution, and an 
even better understanding of the various 
cognitive adjustments and behavioral effects 
that may occur. 

The remaining sections of this article will 
move away from discussion of the three 
models to focus on some wider issues. 
These are concerned with the possibility of 
generalizing the principle of structural dis- 
crepancy to structures relating to the physi- 
cal world, the problem of development and 
change in abstract structures, and finally 
whether there are ways (apart from bal- 
ance) in which abstract structures might be 
organized. In conclusion, brief mention will 
be made of some approaches in the literature 
that are similar in certain respects to the 
present one. 


ABSTRACT STRUCTURES AND 
PHYSICAL REALITY 


Although previous discussion involving 
the three models has referred only to situa- 
tions in which the relevant structures contain 
personal as well as impersonal elements, it 
should be apparent that abstract structures 
may refer both to the physical environment 
and to the social environment. One’s way 
of structuring the world of physical objects 
is probably more stable than one's way of 
structuring social objects, but it is believed 
that the attempt to resolve structural dis- 
crepancy applies to both. In both cases a 
discrepancy between a perceived structure 
and an abstract structure may be assumed to 
lead to pressure to resolve the discrepancy 
by cognitive and/or behavioral adjustment. 
The familiar distorted room illusion of Ames 
is a good example of a case in which a per- 
ceived structure is discrepant with an ab- 
stract structure that refers to the physical 
environment (one’s conception of the way 
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Fic. 10. Perceived structure inconsistent with under- 
lying abstract structure concerning contours. 


people usually appear in rooms) ; so is the 
perception of a picture hanging askew on 
the wall (it ought to hang straight). The 
strange figures that involve contour confu- 
sions likewise involve discrepancies between 
perceived structures and abstract structures 
based on one’s usual experience of contours 
in the physical world (see the disconcerting 
example in Figure 10). Hebb’s earlier re- 
search on the discrepancy between the ex- 
pected and the strange also falls into this class 
(Hebb, 1949), since the perceived structure 
(concerning, for example, the laboratory at- 
tendant dressed in unfamiliar clothes) is dis- 
crepant with an abstract structure (concern- 
ing the way the laboratory attendant usually 
dresses). 

Such discrepancies create tension and 
there may be a move to reinterpret informa- 
tion from the physical environment just as 
one may try to modify perceived structures 
that are related to social situations and that 
are discrepant with assumed abstract struc- 
tures (as in the examples discussed previ- 
ously relating to communication and attribu- 
tion). However, as previously indicated, it 
is probably harder to change one’s interpre- 
tation of physical reality than to change one’s 
interpretation of social reality (cf. Festinger, 
1957, p. 27). Hence, where the discrepant 
perceived structure refers to the physical en- 
vironment, direct behavioral adjustments 
may occur. These adjustments may involve 
the seeking of new information, particularly 
when the structural discrepancy is small (as 
in curiosity behavior, see Berlyne, 1960, 
1966), leading to the modification of existing 
structures or to the development of new 
ones. Alternatively, active escape from the 
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situation may occur, particularly when the 
discrepancy is large (as in avoidance of 
the strange, see Hebb, 1949). Such avoid- 
ance would have the effect of preserving ex- 
isting abstract structures. 


DEVELOPMENT AND CHANGE IN 
ABSTRACT STRUCTURES 


Where perceived structures cannot be 
modified and where the structural discrep- 
ancy is consistently experienced, some change 
in the corresponding abstract structures may 
develop over time. Such changes occur in 
the course of cognitive development as the 
child grows older and as more and more 
novel and discrepant information is fed into 
the system (cf. Piaget). The old structures 
no longer prove adequate and are modified 
and elaborated to meet the changing circum- 
stances. Hunt (1961, pp. 267-288) has pro- 
vided an interesting account of accommoda- 
tive modification and growth in terms of the 
match between environmental circumstances 
and existing schemata, using the Piagetian 
stages of development. He suggests that 


any discrepancy between central processes and cir- 
cumstance beyond the limits of an organism's ca- 
pacity for accommodation evokes distress and avoid- 
ance, while any discrepancy within the limits of an 
organism's capacity for accommodation is a sı 
of pleasurable interest or curiosity . . . wher 
cumstance and central process match perfectly, e 
result is stultifying boredom in which development 
fails [p. 269]. 


Such assumptions are obviously very similar 
to those implied by the work of Hebb (1949) 
and Berlyne (1960, 1966) and to those pre- 
sented in the previous section. 

'The abstract structures involved in the 
three models that have been described (com- 
munication, recall, attribution) are probably 
well-established ones, end products of a lot 
of experience in social situations. At a more 
radical level, change in abstract structures 
may be forced by an extreme change in per- 
ceived structures. In regard to physical 
reality, the experiments concerning the ef- 
fects of reversal or distortion of the visual 
field probably involved such changes (e.g., 
Kohler, 1962). In regard to social reality, 
a sudden shift to a hypothetical culture in 
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which success was negatively valued and fail- 
ure was not a disgrace would probably lead 
to basic changes in the abstract structures 
assumed for the attribution model. 

Finally, one may deliberately attempt to 
create new abstract structures by reexposing 
oneself to past information (if it is possible) 
and by refusing to interpret it in terms of 
existing structures that have proved inade- 
quate. Such a process may underly the 
creation of new scientific theories and per- 
haps a willingness to dissociate oneself from 
abstract structures already available. 


ORGANIZATION OF ABSTRACT STRUCTURES 
t 


In presenting the three models of com- 
munication, recall, and attribution earlier, it 
was assumed that the underlying abstract 
structures would be organized according to 
the principle of balance. In these models, 
both personal as well as impersonal elements 
were involved, and relations could be at- 
titudinal or unit. It was recognized that in 
more complex structures, involving large 
numbers of elements and relations, complete 
balance may not be possible but that there 
should be an overall tendency for the struc- 
ture to be well organized. The present sec- 
tion is concerned with whether or not there 
are other principles of organization apart 
from balance that may be applied to abstract 

- structures. Is the balance principle a gen- 
eral one or is it only one of a number of 
principles underlying cognitive organization ? 

This question highlights an important is- 
sue in cognitive psychology today. While 
there has been general agreement that cog- 
nitive organization occurs (witness the use 
of terms such as structures, schemata, clus- 
ters, chunks, etc.), not enough attention has 
been paid to how the structures emerge and 
what principles govern their organization. 
In the case of structures related to the 
physical world, the tendency toward formal 
organization of objects and events on the 
basis of principles of similarity, temporal 
and spatial contiguity, closure, symmetry, 
good continuation, etc., is undoubtedly im- 
portant and is a major contribution of the 
Gestalt psychologists, though one might not 
want to accept their notion of innate ante- 
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cedents at this stage, but to leave the ques- 
tion open. In his discussion of cognitive 
structures, Neisser (1967, pp. 290-292) 
underlines the importance of temporal and 
spatial dimensions in the way structures are 
organized. Such dimensions presumably 
enable the reconstruction of physical and 
social events in their temporal order as well 
as in terms of their spatial characteristics. 
Also, important in regard to abstract struc- 
tures that are keyed to the physical world 
are the various constancies related to size, 
shape, color, etc. These groupings and con- 
stancies give stability and consistency to ex- 
From the empiricist point of view, 
be seen as summaries or abstrac- 
tions from continual experience with the 
physical environment, becoming essential 
aspects of the programs or abstract structures 
that underly one's stable view of the physical 
world (cf. Hebb, 1949), A nativist ap- 
proach (as in Gestalt psychology) would see 
them as outcomes of innately determined 
brain processes, determining the form in 
which information input is cognized (see 
Neisser, 1967, p. 246). 

But what of social reality ? Do the or- 
ganizational principles of Gestalt psychology 
also apply to the social world? Zajonc 
(1968, p. 323) has pointed to the dangers 
in making this assumption. Yet, as used by 
Heider (1958), the principle of balance is 
rather similar to the Gestalt principle of 
pragnünz—the emphasis on “good form" 
and a fieldlike state of equilibrium. Funda- 
mental to this notion is that of a cognitive 
unit having parts that are of the same dy- 
namic character (Heider, 1944, 1946; Za- 
jonc, 1968, p. 339). When this similarity 
occurs, the parts are grouped together and 
a steady state exists. But when parts are 
of different dynamic character, disequilib- 
rium arises and the parts tend to become 
segregated from each other (Heider, 1946, 
p. 107). In his cogent discussion of the 
principle of balance, Zajonc (1968) argues 
that although forces toward balance have the 
same character as Gestalt forces toward 
“good figures” in the perception of forms, 
uctures considered by Heider have in 
another component, namely, the 
g from particular attitudes (see 
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also Heider, 1960, p. 168). According to 
Zajonc it has 


never been theoretically clear how these attitudinal 
forces interact with the “good figure” forces— 
whether the former are more or less important, or 
even whether they have ihe same dynamic proper- 
ties as Gestalt forces [p. 341]. 


The intrusion of these “attitudinal forces" 
has become evident in a number of studies 
testing the balance principle. Recently, con- 
sideration of the attitudinal relation between 
two persons (P and O) in a P-O-X struc- 
ture also involving impersonal object (X), 
has led Newcomb (1968) to distinguish be- 
tween balanced, unbalanced, and nonbalanced 
sets of relations. A balanced set of relations 
is one which a person accepts as it is; an 
unbalanced set of relations is one that he 
attempts to modify; a nonbalanced set of 
relations is one toward which he has a clear 
lack of preference for accepting or modifying 
the set of relations. According to New- 
comb, whenever the PO relation is negative 
(one person dislikes the other) the triad is 
nonbalanced and is often accompanied by in- 
difference and low engagement. Otherwise 
his definitions are the same as Heider’s. 
Newcomb’s discussion also indicates that the 
original definition of balance may be over- 
simplified and his review of the research 
evidence tends to support his view (see also 
Zajonc, 1968, Table 1). Thus, the balance 
principle is coming under increasing attack 
not only on theoretical grounds, but also in 
relation to how far it can handle all of the 
research evidence. It may end up as only 
one of a number of factors influencing cog- 
nitive adjustments. 

The use of the 
three models described pr 
be taken as endorsement of Gestalt principles 
of pragnünz, closure, etc. Such principles 
have not fared well as unitary explanations 
(for a recent review of studies concerning 
closure see Holmes, 1968), and the under- 
lying assumption of innate determinants, 
particularly of spatial organization, is a hotly 
disputed one. Instead, in the three models 
described in the present article, the balance 
principle is used in a definitional sense to in- 
dicate what is meant by consistent and in- 


balance principle in the 
eviously should not 


374 


consistent sets of relations (something that 
is often left vague in alternative consistency 
theories—e.g., dissonance theory). In avery 
broad sense, each of the three models may be 
seen as presenting a structural set of rules 
which are organized in a consistent manner 
(balanced) and which specify the psycho- 
logic of how sets of elements are associated 
and evaluated. One way of interpreting the 
balanced abstract structures, therefore, is in 
terms of systems of rules that constitute in- 
ternally consistent theories about reality. 
For example, on the basis of one’s own ex- 
perience in task situations, one might formu- 
late the general rule that unexpected success 
or unexpected failure at a task will usually 
be caused by variable characteristics of the 
person or the situation (e.g., by an unusual 
increase or decrease in momentary effort or 
by good or bad luck), whereas expected out- 
comes will be caused by stable dispositional 
properties of the person or situation (e.g., by 
the presence or absence of ability or the ease 
or difficulty of the task). This general rule 
is captured in the abstract structures of the 
attribution model insofar as the person is 
concerned. Similarly, on the basis of past 
experience in communication situations, one 
might formulate the general rule that people 
whom one likes usually have attitudes to- 
ward issues similar to one’s own attitudes, 
and that their communications tend to be 
consistent with these attitudes. This gen- 
eral rule is captured in those abstract struc- 
tures of the communication model in which 
the attitudinal relations between source. à 
receiver are positive. Thus, the balance 
principle enables one to structure or organize 
sets of general rules in a consistent manner 
and to predict the direction a particular rule 
will take within the total system of rules. 
In an important sense, therefore, the un- 
derlying abstract structures involved in the 
representation of social reality have a norma- 
tive function. They specify sets of rules 
making up internally consistent theories 
about what ought to be the case in any given 
social situation that has been experienced 
over time. It is likely that the normative 
aspect of abstract structures also extends to 
structures concerning physical reality. Con- 
stancies of size and shape, for example, may 
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be considered to involve notions of what 
ought to be the case in the physical environ- 
ment. This quite general interpretation of 
abstract structures recognizes that one of the 
compelling characteristics of individuals is 
that they follow rules and are guided by 
normative principles (Peters, 1969; Sherif, 
1936). 

Why are the rules balanced or consistent 
in the three models presented? The answer 
to this question is beyond the scope of the 
present article. Yet, as indicated above, we 
do not want to prejudge the issue by appeal- 
ing to innately determined principles of or- 
ganization, though innate determinants may 
be part of the answer. Zajonc (1968) in- 
dicates that 


Whether or not these principles can be extended to 
refer also to semantic or social proximity, semantic 
or social similarity, semantic or social continuation, 
and semantic or social closure is an empirical ques- 
tion which has not yet been answered [p. 342]. 


As emphasized previously, modes of or- 
ganization may be learned in the course of 
an individuals adaptation to a changing 
world to give meaning, continuity, and sta- 
bility to variable information input, and such 
learning may be an essential part of early 
experience (Hebb, 1949), In addition, there 
are demands on a person from an early age 
toward behaving consistently, and in time, 
consistency may acquire incentive value in 
its own right, although one would expect 
individual differences to occur in such learn- 
ing (Feather, 1967). Finally, the real 
world may be constructed in a consistent way 
in the sense that, for example, friends do 
tend to like the same things, people do tend 
to like the things they own, etc. The bal- 
anced organization of abstract structures 
keyed to social contexts may therefore re- 
flect social reality. Certainly, if one relates 
abstract structures to sets of rules organized 
in a consistent way according to the balance 
principle, then these normative principles 
would be influenced by what usually happens 
in real social situations, since abstractions 
should incorporate modal events. 

Some of the ideas expressed above have 
their counterparts in the literature of con- 
sistency theory, though these related ap- 
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proaches do not make the distinction between 
perceived structures and abstract structures 
and have little to say about the long-term 
effects of information processing in the 
formation of abstract structures. For ex- 
ample, some theories have specifically looked 
at organization in terms of rules of implica- 
tion, emphasizing a sort of psychological 
rationality in man and indicating principles 
of "psycho-logic" by which consistency is 
achieved (Abelson, 1968c ; Abelson & Rosen- 
berg, 1958; McGuire, 1968; Rosenberg & 
Abelson, 1960). Jordan (1968) has related 
imbalance to conflicting implications about 
unit formation among sets of elements and 
relations and argues that “pUo and p-Uo is 
the canonic form for any form of imbalance, 
no matter how many entities are involved 
[p. 175]." In other cases more emphasis 
has been placed on conceptual good figures 
in line with the Gestalt tradition that the 
configurational laws of perception apply to 
cognition generally (Koffka, 1935). Thus, 
De Soto (1960, 1961) has shown that there 
is a pervasive preference for linear orderings 
when people assign orders and that this 
preference may be associated with an under- 
lying conceptual good figure or schema (see 
also De Soto & Albrecht, 1968). More re- 
cently, Davis (1967) has argued for a gen- 
eral principle of clusterability as a basis for 
organization, particularly within sociometric 
structures. The balance principle implies 
the formation of two clusters or cliques link- 
ing elements, with positive relations within 
each clique and negative relations between 
cliques (see the structure theorem, Cart- 
wright & Harary, 1956). Davis has sug- 
gested that more than two cliques may form 
under certain conditions that can be specified 
mathematically and that balance may be a 
special case of clusterability. Research by 
De Soto, Henley, and London (1968) does 
suggest that clusterable structures (more 
than two cliques) are as easily learned as 
balanced structures, but such research is still 
in its infancy (see also Peay, 1970). These 
different contributions indicate that the bal- 
ance principle may be only one likely prin- 
ciple underlying organization 1n abstract 
structures and that at the present stage, one 
should not exclude the possibility that a 
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number of principles may be found to exist, 
some more general than others. The bal- 
ance principle, however, does seem to be a 
viable one in relation to the three models 
discussed in the present article, enabling a 
precise definition of how relations within the 
abstract structures are organized in regard 
to their signs. But, as previously indicated, 
it is coming under increasing attack as a 
unitary principle and in relation to the em- 
pirical evidence. It probably needs to be 
supplemented by consideration of other fac- 
tors (cf. the previous discussion of expanded 
models and of specific attitudinal effects; 
Newcomb, 1968; Zajonc, 1968). Asa prin- 
ciple, it is more appropriate to abstract struc- 
tures keyed to social reality where both 
attitudinal and unit relations are involved.* 
Moreover, in time it may be superseded by 
more general principles such as clusterability. 
Finally, it should not be forgotten that the 
perceived structure is assumed to be organ- 
ized too and that the processes involved in 
this organization also need to be understood. 
In most cases, particularly with respect to 
the physical environment, perceived struc- 
tures and abstract structures tend to cor- 
respond. The interesting cases emerge 
when organization in the perceived structure 
is imperfect or abnormal (as in Figure 10 or 
in an unbalanced communication situation), 
since there is then a discrepancy between the 
perceived structure and an underlying ab- 
stract structure. Consideration of the ef- 
fects of these structural discrepancies has 
been a major thrust of the present article. 


DISCREPANCY THEORY : RELATED 
APPROACHES 


The general paradigm involved in the 
present approach is one that focuses on struc- 


3 The present discussion has not included an 
analysis of the types of structures considered by 
Piaget. The three models have been concerned 
with structures containing both persons and things 
as elements and with relations that include evalua- 
tions as well as unit-forming tendencies, The 
mathematics is in terms of graph theory. Piaget's 
structures focus more on a psychologic involving 
lattice theory and group theory. But the Piagetia 
emphasis on adaptation and organization is bi on 
to the present formulation. asic 
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tural discrepancies and their cognitive and 
behavioral effects. The use of a concept of 
discrepancy occurs in other theories in psy- 
chology and some of these have already been 
mentioned (e.g., Hebb’s analysis of the fear 
of the strange, Berlyne's discussion of curi- 
osity, Hunt's account of the match between 
environmental circumstances and existing 
schemata). The general approach of the 
consistency theories is also somewhat similar. 
Newcomb (1959), for example, has included 
a concept of discrepancy in his analysis of 
the strain toward symmetry, and Aronson 
(1968) has considered dissonance theory in 
terms of confirmation or disconfirmation of 
self-relevant expectancies. The earlier con- 
tributions of Festinger (1950, 1954) con- 
cerning informal social communication and 
social comparison also involved concepts of 
discrepancy. These approaches, however, 
have not distinguished between abstract 
structures and perceived structures. The 
following related approaches may also be 
briefly noted: the emphasis on general 
schemata that are built up over time and 
tested against reality (Bartlett, 1932; Piaget, 
1966), spontaneous changes in recall in terms 
of underlying ideal structures  (Koffka, 
1935), generalized expectancies or "cognitive 
maps" that can be confirmed or disconfirmed 
by experience (Tolman, 1958), neural or- 
ganizations that become progressively re- 
fined as learning proceeds (Hebb, 1949), 
adaptation levels that are based on previous 
experience (Helson, 1964), discrepancies 
between perception and adaptation levels that 
underly the learning of motives (McClel- 
land, Atkinson, Clark, & Lowell, 1953, Ch. 
2), disparities in psychological events that 
define motives (Peak, 1968), images and 
plans (analogous to computer programs) 
that guide the form that behavior takes 
(Miller, Galanter, & Pribram, 1960), cate- 
gories or concepts that are refined on the 
basis of experience and to which information 
input is matched in some sense so as to 
achieve stability in perception and thought 
(Bruner, 1957; Brunswik, 1939; Harvey 
et al, 1961), and basic linguistic structures 
that enable us to understand the generation 
9f sentences (Chomsky, 1957, 1965). 

All of these approaches are similar in some 
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respects to the present one though they differ 
in the degree to which the structural con- 
cepts involved in them are articulated in 
formal terms. For example, even though 
the concept of expectancy is a structural 
concept involving the basic idea that one 
comes to learn the implications of a response 
(e.g., If in this situation (S) I perform this 
response (R), then outcome (X) will hap- 
pen; see MacCorquodale & Meehl, 1953), 
very little has been done to represent ex- 
pectancies and the discrepancy between 
expectancy and present experience using 
structural models incorporating mathematical 
treatments of structures. And the same 
criticism applies to most of the other con- 
cepts involved in the above list. 


CONCLUDING REMARKS 


The concepts employed in the present 
article obviously have wide generality. They 
focus on the individual’s attempt to achieve 
some degree of constancy or stability as he 
processes information from the physical and 
social world, and the consequences that fol- 
low when this constancy is violated. In a 
very general sense, the abstract structures 
involve internally consistent systems of rules 
or theories about what ought to be the case 
in the physical and social environment, The 
development of organized abstract structures 
appears to be a necessary part of an in- 
dividual's adaptation to a complex world.* 

The generality of these concepts and the 
fact that similar approaches have appeared 
in varius areas of psychology suggest the 
emergence of a basic paradigm (Kuhn, 
1966). Itisa paradigm that forces one to 
come to grips with the concept of structure, 
to consider different ways in which struc- 
tures may be organized and ways in which 
Structures may be discrepant. The present 
paper has shown that it is possible to deal 
with these questions mathematically (albeit 
in a simple way), thereby providing a clear 
definition of structural organization, and that 
it is also possible to determine empirically 
what the perceived and abstract structures 


.' The detailed elaboration of this idea in rela- 
tion to development is, of course, a major contribu- 
lion of Piagetian psychology (Piaget, 1966). 
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are and the effects that occur when there is 
discrepancy between them. Much remains 
to be done, however, in regard to the map- 
ping of cognitive structures, the identification 
of modes of organization, and the determina- 
tion of degrees of structural discrepancy. 
Further development in this area is an im- 
portant task for psychology since structural 
concepts lie at the heart of so much of psy- 
chological theory. 
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THE DUPLICATION THEOREM OF SOCIAL 
RELATIONSHIPS AS TESTED IN THE 
GENERAL POPULATION 


WALTER TOMAN! 


University of Erlangen-Niirnberg, Germany 


The effects of certain characteristics of a person's family constellation on 


his social behavior, on his choices of partners for life, anc 
founds have been tested with two samples of the general 


“duplication theorem,” 


larger scale. 


Early social relationships have bearing on 
later social relationships. This thesis has 
been advanced by Freud (1953) who claimed 
explicitly that a person’s relations to his 
parents while a child affect his psychosexual 
development and, ultimately, his adult het- 
erosexual relationships. Adler (1920) ques- 
tioned what difference the presence of sib- 
lings would make for a person’s character 
development and adjustment in later life. He 
was not very systematic and described only 
a few sibling positions and their effects. Seg- 
ments of such effects have been sporadically 
studied by others (e.g, Koch 1955, 1956, 
1960; Krout 1939; Schachter 1964 ; Schoon- 
over 1959; Warren 1966). 

The author's thesis is the following: Sib- 
ling positions tend to determine role prefer- 
ences in social contacts and in enduring peer 
relationships outside the family. Friends 
and partners in love and/or marriage will 
tend to get along with one another better and 
for longer times, other things being equal, if 
their role preferences are compatible. But 
when are they compatible ? 

Of many possible aspects by which such 
compatibility of role preferences may be de- 
fined, what was thought to be the simplest 
and most objective aspects was chosen: the 
age order and the sex distribution among a 
person's siblings. 

If a sibling configuration consists of n 
children, there are 2" different patterns in 


* Requests for reprints should be sent to Wal- 
ter Toman, Psychologisches Institut der Univer- 


sität Erlangen-Nürnberg, 8520 Erlangen, Bismarck- 
strasse 1, Germany. 


stating that new social relationships tend to | 
more chances of permanence and success the closer they resemble ea 
and earliest (intrafamilial) social relationships, 


l on the family he 
l| population. The 
have 
rlier 
has been confirmed on a 


which it may appear if only sex and age 
order are being varied. A person chosen 
randomly can hold any of n2" sibling posi- 
tions. Holding a given of » possible posi- 
tions in his sibling configuration, his n-1 
siblings could be any of 2" combinations of 
brothers and sisters. For the sake of sim- 
plicity, let us inspect what this means when 
n= 2. 

There are four different patterns of sibling 
configurations: two brothers, two sisters, a 
brother and a sister, a sister and a brother. 
A given person coming from a two-children 
configuration, if male, may be the older or 
the younger brother either of a brother or 
of a sister ; if female, she may likewise be the 
older or the younger sister either of a brother 
or of a sister. 

Sibling positions have ordin 
maintained over many years in the context of 
the original family. They are a person’s 
earliest, most regular, and inescapable peer 
experiences. Social contacts with peers in 
new settings such as playground, church, 


school, clubs, etc, may be patterned after 
them, 


_ Each of the above mentioned eight basic 
sibling positions implies a peer role distinct 
from those of all other sibling positions. 
One may due on average, that the oldest 
brother of a brother—b (b)—is used to a 
Senior position among peers and to contacts 
With a peer of the same sex only; that the 
Oldest brother of a sister—b(s). is also 
"sed to a senior role, but vis-à-vis a person 
9f the opposite sex only; that the younger 
brother of a brother— (b). s accustomed 


arily been 
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to be led and protected by a male; and 
the younger brother of a sister—(s)b—by a 
female. 

Likewise, the older sister of a sister— 
s(s)—is used to a senior position among 
peers of the same sex; the older sister of a 
brother—s(b)—to a senior position among 
peers of the opposite sex ; the younger sister 
of a sister—(s)s—may, on the average, be 
assumed to expect some kind of a leadership 
and protection from a peer of the same sex ; 
the younger sister of a brother—(b) s—from 
a peer of the opposite sex. 

All of them would tend to be unaccus- 
tomed to assuming opposite roles, respec- 
tively, or to maintaining contacts with a peer 
of a sex other than that of their respective 
sibling. Such an assumption would be plaus- 
ible by common sense, but is also compatible 
with learning theory (e.g. Hull, 1943) or 
psychoanalytic theory (e.g., Freud, 1953). 
Most generally stated, experiences of a cer- 
tain type generate preferences for similar 
experiences rather than different ones. This 
would at least hold for positive experiences. 
Negative experiences would create avoid- 
ance preferences, but if avoidance is not 
really possible in the primary milieu, if the 
brother or the sister are around to stay and 
cannot be escaped, as seems to be the case in 
most families, at least practically speaking, 
one would expect even such negative experi- 
ences with siblings to generate certain pref- 
erences for similar experiences and similar 
long-range relationships. As a matter of 
fact, so-called negative experiences with sib- 
lings and other family members are fre- 
quently embedded in a context of relative 
social comfort and positive attachments 
which are rarely outweighed by the negative 
experience. Even if they are, a child usually 
has no other option during his early and 
most formative years than to put up with 
them. 

Tt is also consistent with common sense, 
learning theory, and psychoanalytic theory 
that generalizations to new social contexts 
(such as playground, school, professional re- 
lationships, friends) occur automatically. 
Corrections of such generalized expectations 
through new experience are possible, to be 
sure, but those expectations tend also to 


codetermine new experience, the more so, 
the more pervasive and exclusive of others 
the original experiences have been. 

One of the new contexts to which the 
original experiences may be generalized are 
love relationships and marriage. Sibling 
roles learned in the original family may be 
conducive to different degrees of compatibil- 
ity among love and marriage partners. The 
following matches would be considered com- 
patible by the types of peer experiences that 
the partners gathered in their original fam- 
ilies: b(s)/(b)s, which means the older 
brother of a sister married to the younger 
sister of a brother; and (s)b/s(b), which 
means the younger brother of a sister mar- 
ried to the older sister of a brother. The 
reasons : In the first match, both partners get 
what they were used to in their original fam- 
ilies. He can lead and protect a female peer. 
She can accept to be led and protected by 
him. This is true of the second match as 
well, although under different auspices: He 
gets leadership and care from his wife, she 
has somebody to guide and care for. In 
both matches the partners duplicate for one 
another a former sibling. The partners are 
maximally compatible by their sibling posi- 
tions. Their relationship is one of role 
complementarity. 

The following two pairs would, on the 
other hand, be judged as incompatible rela- 
tionships: b(b) /s(s), which means the older 
brother of a brother married to the older 
sister of a sister; and (b)b/(s)s, which 
means the younger brother of a brother mar- 
ried to the younger sister of a sister. The 
reasons: In the first match, neither the man 
nor the woman are used to life with a peer 
of the opposite sex by their sibling experi- 
ences, and both of them have the same age 
rank. Both are seniors, probably aspiring to 
lead and protect the other without finding 
the other inclined to oblige. In the second 
match, both are supposedly equally unpre- 
pared for life with a person of the opposite 
sex, and both are juniors. Both would tend 
to expect to be led and/or protected without 
finding such leadership or protection with 
their partner. In both matches the partners 
fail *maximally" to duplicate for one another 


a former sibling. The partners are wi 


T- 


382 


mally compatible by their sibling positions. 
Their relationship is the opposite of role 
complementarity. By sibling experience they 
have a rank and a sex conflict. 

This, in essence, illustrates the “duplica- 
tion theorem" which states more generally 
and more inclusively: Those of a person's 
extrafamilial social relationships have better 
chances of permanence and success that re- 
semble more closely than others the earlier 
and earliest (intrafamilial) social relation- 
ships that the person has experienced in his 
life (Toman 1959a, 1959b, 1959c, 1962a). 

There are, of course, intermediate degrees 
of complementarity (e.g., an older brother 

of both a sister and a brother married to the 
middle sister of two brothers: b(sb)/ 
(b)s(b)), rank conflicts without sex con- 
flicts (e.g., the older brother of a sister mar- 
ried to the older sister of a brother: b(s)/ 
s(b)), and sex-conflicts without rank con- 
flicts (e.g. the older brother of a brother 
married to the younger sister of a sister: 
b(b)/(s)s). A rank order of the severity of 
conflict of sibling positions of love or mar- 
ríage partners has been established as part 
of the study to be reported. It is too cum- 
bersome to describe in detail, although a 
computer program has been written to dis- 
tinguish nine such degrees of complementar- 
ity (even 12, if matches with single children 
among the partners were included). Suf- 
fice it here to make and explain the cruder 
distinctions upon which the statistical tests 
to be reported were based. 

Matches of high complementarity of sib- 
ling positions of partners comprise: (a) 
Complete complementarity : Both partners 
have siblings of the opposite sex only and 
complement each other by sibling-age-rank ; 
for example, b(ss)/(b)s or (ss)b/s(bbb), 
which means the older brother of two sisters 
married to the younger sister of a brother, 
or the younger brother of two sisters mar- 
ried to the oldest sister of three brothers, 
respectively. 

(b) Partial complementarity : One or both 
partners have siblings of both sexes, but 
there is at least one sibling in the original 
family of each partner who duplicates the 
age-rank and sex of the spouse ; for example, 
b(sb)/(bsb)s, which means the older bro- 
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ther of a sister and a brother married to the 
youngest sister of two older brothers and one 
older sister (the husband has a younger sis- 
ter among his siblings and the wife two older 
brothers among hers); or (s)b(s)/s(b), 
which means the middle brother of two sis- 
ters married to the older sister of a brother 
(among the two sibling relationships of the 
husband, one sibling, his older sister, dupli- 
cates the sibling role of his spouse). 

Matches of medium to low complementar- 
ity of sibling positions of partners comprise 
all other types except those in which single 
children are among the partners. There are 
rank conflicts without sex conflicts (e.g. 
b(ss)/s(b); both partners are the oldest, 
but both have siblings of the opposite sex 
and can thus be assumed to be used to life 
with a peer of the opposite sex; or b(ss)/ 
(s)s(b), which may be considered a milder 
rank conflict than the first one, since the 
wife has an older sister and is thus partly 
accustomed to the role of a junior, if a 
junior to a female). There are sex con- 
flicts without rank conflicts (e.g. (b)b/ 
s(ss)): she, an older sister of two sisters, 
would tend to offer guidance and care, and 
he, the younger brother of a brother, would 
be willing to accept that, even though they 
have not learned in their original families to 
play those parts vis-à-vis a person of the 
opposite sex. Finally there are rank plus 
sex conflicts (e.g., b(bb) /s(s)). the older 
brother of two brothers married to the older 
sister of a sister, or (bbb)b/(ss)s, the 
youngest of four brothers married to the 
youngest of three sisters, or (b)b(b)/s(ss), 
the middle one of three brothers matched 
with the oldest of three sisters; the sibling- 
role conflict of the last couple would be con- 
sidered slightly milder, that is, something 
less than a complete rank plus sex conflict, 
since the husband has had an older sibling 
too, not only a younger one, and could be 
expected to find leadership by the partner at 
least partly acceptable. 

Matches with single children among the 
partners are considered to represent the 
lowest degree of sibling role complementar- 
ity. The single child has supposedly had no 
peer experience at all in his original family. 
Yet if he hits upon a marriage partner who 
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has had a sibling of the opposite sex (eg., 
b/s(b)), their match would be deemed 
somewhat better than a match between a 
single child and a partner with siblings of 
the same sex only (e.g. b(bb)/s) or the 
match of two single children (b/s). The 
assumption is that the partner with an early 
peer experience can help the single child 
better than can another single child to fit 
into the current peer relationship: marriage. 
. Repeating, three degrees of comple- 
mentarity of sibling positions of (marriage) 
partners have been distinguished : high com- 
plementarity (matches with complete or par- 
tial complementarity of sibling positions of 
partners), medium to low complementarity 
(matches with partial or full rank, sex, or 
rank plus sex conflict of sibling positions of 
partners), and lowest complementarity 
(matches with at least one partner being a 
single child). 


Tests or THE “DUPLICATION THEOREM” 


One critical test of the duplication theorem 
would be a comparison of divorced mar- 
riages versus intact marriages. If the the- 
orem has validity, divorced marriages should 
show a higher frequency of rank and/or sex 
conflict between the sibling positions of the 
partners than intact marriages. 

This was first tested with two groups of 
divorced and intact marriages (Toman 
1962b). The groups were matched by socio- 
economic status (education, approximate in- 
come, ethnic origin, and age). Pairs with 
partners who were single children were elim- 
inated. Intact marriages were to have lasted 
for at least 10 years and to have led to at 
least two children. 

Comparison of the compatibility of their 
sibling positions showed that of 16 married 
couples, 12 displayed high complementarity 
and 4 medium to low complementarity, 
whereas of the 16 divorced couples, 1 showed 
high complementarity and 15 medium to low 
complementarity. As mentioned before there 
were no single children among the couples. 
This distribution of frequencies tested with 
the Fisher- Yates correction formula is highly 
significant QC = 120, df=1, p < 001). 
In other words, complementarity of sibling 


` partners, respectively ) th 
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positions of partners seems clearly to bear 
on the outcome of marriage. Low comple- 
mentarity increases the likelihood of even- 
tual divorce. 

One might suspect that these effects could 
be attributable not so much to comple- 
mentarity or noncomplementarity of sibling 
positions of partners, but merely to the num- 
ber of siblings the partners had. Persons 
coming from larger families are less likely 
to divorce each other than are those coming 
from small families. While this aspect, num- 
ber of siblings, would be one of several other 
aspects by which the “duplication theorem” 
can be tested (and has been tested success- 
fully: see Toman, Preiser, Gasch, & Plattig, 
1967), the influence of mere number of sib- 
lings regardless of considerations of age- 
rank and sex of siblings would be somewhat 
more trivial and should be tested separately. 
It is logically evident that partners coming 
from a larger configuration of siblings are 
more likely than those coming from smaller 
configurations to have had the “right” sib- 
ling among them: one of the opposite sex 
and of an age-rank partly or fully identical 
with the spouse’s age-rank among his/her 
siblings. 

The samples of 16 divorced and 16 intact 
marriages showed no significant difference in 
numbers of siblings in their original families, 
although a tendency in this direction was ap- 
parent (see Table 1). 

As can be seen, partners of divorced mar- 
riages come from somewhat smaller families 
(i.e., from families with an “average” of 


2.31 and 2.50 children for male and female 
an did the married 


TABLE 1 


NUMBER OF PARTNERS OF DIVORCED AND OF 
MARRIED COUPLES IN RELATION TO 
NUMBER OF THEIR SIBLINGS 


Male partner® Female partnerb 


Status of 


hi One [ue | One | Twssr 
sibling | sipiings | sibling | more | 
Divorced 12 4 9 7 
Married 10 6 6 10 
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partners (with an "average" of 2.50 and 
2.63 children in their original families for 
male and female partners, respectively), but 
the frequency differences are not significant. 
Even pooling the sexes still yields an insig- 
nificant chi-square (xë 2 1.6, df=1, p< 
25). Compatibility of partners by sibling 
positions itself may be considered the signif- 
icant source of variation in this sample of 
marital behavior. 

The test was repeated with all divorced 
couples (including those whose partners were 
single children) encountered among some 
2,300 families studied in Nuremberg and Zu- 
rich under the sponsorship of the Deutsch 
Forschungsgemeinschaft. The 2,300 fami- 
lies provided the expected frequency val- 
ues, Of 108 divorced couples whose data 
were complete, 18 showed high complemen- 
tarity of their sibling positions (expected 
frequency 32), 52 couples displayed medium 
to low complementarity (expected frequency 
49), and 38 couples displayed the lowest 
degree of complementarity (expected fre- 
quency 27; x? = 10.8, df=2, p< 01). In 
other words, low complementarity of sibling 
positions of partners is more frequent and 
high complementarity is less frequent among 
divorced couples than in the population of 
marriages at large. 

Even among divorced couples themselves, 
those who held out longer before the divorce 
occurred should have greater frequencies of 
high complementarity of sibling positions 
than those couples who were divorced early 
in their marriage. This is what the duplica- 


TABLE 2 


DURATIONS OF MARRIAGE AMONG DIVORCED 
PARENTS IN RELATION TO COMPLEMEN- 
TARITY OF THEIR SIBLING POSITIONS 


No. of No. of 
divorced | divorced 
couples | couples 

Sibling positions of parents married | married 

for less for 10 
than 10 years 

years | or more 

High complementarity | 2 (7) |13 (8) 
Medium to low complementarity |22 (20) | 20 (23) 
Lowest complementarity | 17 (14) | 14 (16) 
Note.—Expected frequencies nx m a 
p "que sa parenth 
E 3 22 $505; when corrected for continuity: 
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tion theorem would lead one to expect. It 
predicts more permanence for those relation- 
ships that resemble early relationships more 
closely. Dividing the divorced couples that 
had provided adequate data into those who 
had been married for less than 10 years and 
those who had been married for 10 years 
or more before they were divorced yielded 
the frequency distributions for different de- 
grees of complementarity of sibling positions 
shown in Table 2. 

As can be seen, the prediction is con- 
firmed. Complementarity of sibling position 
is a factor in determining how long couples 
stay together. The higher the comple- 
mentarity of their sibling positions, the 
greater the likelihood of postponement of 
ultimate divorce. 

Of 108 divorced parents, 29 had had one 
(or more) children born out of wedlock be- 
fore they married. The expected frequency 
based on the number of families in the sam- 
ple of the population at large that had had 
at least one child born out of wedlock would 
be eight. The probability of such a fre- 
quency difference is very small (5 < .001). 
It appears that those parents whose mar- 
riages end in divorce are also more likely to 
have been trapped into their marriage to 
begin with by the arrival of children. Had 
it not been for those children, many of them 
may never have married. Note that children 
“on the way” at the time of marriage have 
not been included in these considerations. 

At any rate, those 29 couples constitute a 
subsample of all divorces and the author 
could check with them again to see whether 
they show lower frequencies of high com- 
plementarity of sibling positions than the 
population of marriages at large. Among 
those 29 divorced couples who had had a 
child born out of wedlock before they mar- 
tied, 5 showed high complementarity (ex- 
pected 9), 6 medium to low complentarity 
(expected 13), and 18 lowest complementar- 
ity (expected 7; x* 2229, df=2, p< 
001). In contrast, the distribution of those 
parents from intact marriages who had had 
at least one child before they were married 
over the three classes of complementarity of 
sibling position of partners did not differ 
significantly from chance. 
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Summarizing, it was found that divorced 
couples showed lower degrees of comple- 
mentarity of their sibling positions than the 
population at large. Among divorced 
couples, those who divorced early showed 
lower degrees of complementarity of sibling 
position than those getting their divorce 
late. Those among the divorced couples who 
had had at least one child before they mar- 
ried showed lower degrees of complementar- 
ity than the population at large. Current 
long-range social relationships such as mar- 
riage do indeed tend to be less enduring and 
less successful, other things being equal, the 
less they resemble early (intrafamilial, in 
particular sibling) relationships for the part- 
ners involved. They end in divorce, do so 
early, and may have been precipitated in the 
first place by the unintended birth of a child. 

Indirect evidence suggests that more im- 
pressive results could probably have been 
obtained if those divorces that had not led 
to any children could also have been included 
in this study. Since all 2,300 couples studied 
were selected via schools, that is, their child- 
ren, by definition there were no childless 
couples in the sample. 

Divorced couples with partners who had 
been single children were included in this 
study. Hence it is more obvious than in 
the previous comparison of 16 divorced and 
16 married couples that number of siblings 
of partners may itself be among the deter- 
minants of marital success. In order to test 
whether number of siblings of divorced part- 
ners differed from number of siblings of mar- 
ried partners in the population at large, the 
mean number of siblings of the partners 
without regard to the partners’ sexes were 
computed for the three classes of comple- 
mentarity of sibling positions. Table 3 
shows the results. 

As can be seen, the mean number of sib- 
lings of the partners to divorced marriages 
are consistently smaller than the mean num- 
bers of siblings of married partners at large. 
In order to test for significance, classes of 
no sibling, one to three siblings, and four 
or more siblings were formed and the fre- 
quencies of divorced partners in each class 
compared with the frequencies of partners 
taken from the entire sample of the popula- 
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TABLE 3 
NUMBERS OF SIBLINGS OF MARRIAGE 
s IN RELATION TO COMPLEMENTARITY 
or THEIR SIBLING POSITIONS 


Partners Partoer 
SCR Em from rom 
Sibling positions of partners piod divorced 
marriages? | marriages 
High complementarity 4.53 4.08 
Medium to low complementarity 2.54 2.08 
Lowest complementarity 2.54 2.30 


Note.—In marriages of lowest complementarity of sibling 
positions (at least one partner being a gle child), the average 
number of siblings refers to the nonsingle child among the 


partners, 
aN = 2,174 (4,348 partners). 
bN = 108 (216 partners). 


tion (2,300 families). Table 4 shows the re- 
sults. 

It is apparent that divorced partners do 
come from somewhat smaller sibling con- 
figurations than would be expected by 
chance. It has already been suggested that 
larger numbers of siblings make it more 
likely for a partner to have a sibling rela- 
tionship among them that is complementary 
to that of a given potential spouse. In order 
to test what part sibling role complementarity 
plays regardless of the number of the part- 
ners' siblings, the latter should be held con- 
stant. This was accomplished by selecting 
those divorced couples whose partners had 
each had one sibling only, plus those couples 
one of whose partners had had one and the 
other two siblings. 

It was found that of 31 couples thus se- 
lected, 5 showed high complementarity (ex- 
pected frequency 11) and 26 of them medium 
to low complementarity (expected frequency 
20; 9254, df=1, p< 05). Obviously, 
complementarity of sibling positions of part- 
ners is effective regardless of number of 


TABLE 4 
NUMBER OF SIBLINGS OF DIVORCED PARTNERS 


No. of siblings 


Partner 


No sibling | 1 to 3 siblings ^i more 
siblings 
Male 24 (15) 62 (61) 22 (32) 
Female 23 (16) 66 (60) 19 (32) 
Note.—Expected frequencies are in parentheses 
a s: x? =8.5, df = 2. p <.05; female ses. Male 
Sod = Xp < 05. à emale partners: y? = 
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siblings of partners. Marriage partners who 
had both come from two-children families 
or from a two-children and a three-children 
family displayed the same trend as did all 
108 divorced couples: a lower incidence of 
high complementarity and a higher incidence 
of medium to low complementarity of sibling 
positions than marriage partners of the pop- 
ulation at large. 

Another indicator of the success of a re- 
lationship such as marriage would be chil- 
dren. They come about as a resultant of 
many determinants. Besides ethnic, relig- 
ious, socioeconomic, political, geographic, 
and educational factors that influence the 
decisions of couples to have, or at least to 
risk the prospect of, children, we would add 
size of the partners’ original families and 
the relative ages of the partners’ parents. 
The duplication theorem would suggest that 
partners create families in accordance with 
their early experiences of family life and 
family relations. There is statistical evi- 
dence to this effect that has been reported 
elsewhere (Toman et al., 1967). 

But, other things being equal, we might 
still argue that more children mean more 
Success of the marriage relationship. Un- 
happy parents would tend to have fewer 
children or none, or to wish they had none 
and decide at least to have no more. Happy 
parents, in turn, would tend to be able to 
cope with their children better and to be in- 
clined to have more of the same. 

If this were indeed so, as seems likely by 
common sense and ample though anecdotal 
clinical evidence, let us check again whether 
complementarity of sibling position according 


TABLE 5 


NuMBER OF CHILDREN IN FAMILY AS RELATED 
TO COMPLEMENTARITY OF THE PARENTS’ 
SIBLING POSITIONS 


No. of children 


Sibling positions 
of parents 


5 and 

1 204 more 

High complementarity 110 (146) | 501 (482) | 71 (53) 
Medium to low comple- 

mentarity 205 (222) | 749 (732) | 80 (81) 

Lowest complementarity | 182 (129) | 388 (424) | 30 (47) 


Note.— Expected frequencies are in parentheses. x? = 48,4, 
df —4, p < 01. à p à 


to the duplication theorem—supposedly one 
of a number of prerequisites for marital suc- 
cess—is reflected, among other things, in the 
number of children that parents have. Do 
parents whose sibling positions are of high 
complementarity have more children than 
parents whose sibling positions are of low 
complementarity? Table 5 gives the answer. 

As can be seen, parents of high comple- 
mentarity of sibling position had indeed sig- 
nificantly more children than expected by 
chance, whereas parents one or both of whom 
had been single children (i.e., parents of the 
lowest degree of complementarity) had sig- 
nificantly fewer children. Broadly speaking, 
high complementarity of sibling positions of 
parents, on the average, makes for “half a 
child” more than does lowest complemen- 
tarity. Since the average number of children 
in the families studied was approximately 
2.7, this quantity of “half a child" is not 
negligible. 

Marriages at an early age could be ex- 
pected to be based on less experience with 
potential partners than marriages at a later 
age. One could assume that persons select- 
ing a partner at a later age have had potential 
access to all those partners whom they might 
have selected at an earlier age, plus those 
whom they have come to know since then. 
Hence if people go by the duplication theo- 
rem at all, wittingly or unwittingly, when 
selecting permanent mates, those with more 
experience should heed it better than those 
with little experience. In other words, peo- 
ple marrying relatively late should be more 
likely to choose partners of complementary 
sibling positions than would be those marry- 
ing relatively early. Dividing partners into 
those married below the median age of mar- 
riage and those married at or above the 
median age, we obtain Table 6. 

As can be gathered from Table 6, male 
and female partners alike tended to choose 
partners of complementary sibling position 
more often than chance when they had mar- 
ried late. They tended to choose such part- 
ners less frequently than chance when they 
had married early. 

One component of the decision of parents 
to have another child may be the configura- 
tion of children that they have had thus far. 
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TABLE 6 


3ES AT MARRIAC 


N RELATION TO COMPLEMENTARITY OF SIBLING POSITIONS 


Husband's Wife" 
age at marriage? 


age at marriage" 


Sibling positions of parents 


27 years Less than 24 years 

or more 24 years or more 
High complementarity 275 (311) 387 (351) 281 (322) 381 (340) 
Medium to low complementarity 453 (454) 513 (512 459 (470) 507 (496) 
Lowest complementarity 288 (251) 248 (285) 313 (261) 223 (275) 


Note.—Expected frequencies are in parenth 
m 18.1, df = 2, p < .001. 
bx? = 30.9, df = 2, p  .001. 


This may be so regardless of complemen- 
tarity or noncomplementarity of sibling posi- 
tions of parents, and may tend to confound 
the efficacy of the duplication theorem. 

As it turned out, parents did indeed re- 
spond differentially to the sexes of their 
children born to them thus far. If they had 
boys first or boys only, they were more apt 
to have still another child than were those 
who had girls first or girls only (see Table 
7). If their first child was a boy, b, they 
had, on the average, 1.51 more children. If 
their first child was a girl, s, they had, on 
the average, 1.38 more children. If their 
first two children were boys, bb, they had 
1.07 more children ; if their first two children 
were girls, ss, they had merely .71 more chil- 
dren. If their first three children were boys, 
bbb, they had still 1.06 more children; if 
their first three children were girls, sss, they 
had only .50 more children, etc. 

If one assumes that parents try for more 
of the same, if they like what they got so far, 
one could argue that they wanted boys a little 
more than they did girls. This would not 
preclude that they may not have been de- 
lighted to have a girl after a series of boys, 
but trying for another child they would, of 
course, risk getting still another boy. They 
seemed more willing to take that risk than 
did the parents of a succession of girls who, 
in turn, may have been equally or even more 
delighted to get a boy. 

One could argue otherwise : Parents are 
by a succession of girls and 


more satisfied c | ! 
hildren, particularly since it 


need no more c 


uld also be boys. oe , " 
a that as it may, the differential effect 


that particular configurations of children 


have on the decisions of their parents to have 
more children or not has been demonstrated. 

Some such argument could perhaps be 
made in favor of all the data and findings re- 
ported here. The reader may have little 
trouble accepting the incidence of divorce or 
the short duration of marriage before di- 
vorce occurs as some indication of failure of 
the relationship. He may be less apt to ac- 
cept number of children born to a couple as 


TABLE 7 


NUMBER OF SUBSEQUENT CHILDREN IN 
FAMILY IN RELATION TO DIFFERENT SEX 
DISTRIBUTIONS AMONG CHILDREN 
Born Tuus Far 


Frequencies of families ree 

Total | 28° 

Children No. of subsequent children — | "ber ber 

dius fu [pe—Re eee TUN A s 

flies | Quent 

0 1 2 3 4 5 chil- 

etc. dren 

b 72|108| 59 | 38 | 19 | 9 | 305 | 1.51 

s 73| 117| 67 | 20 | 16 | 8 | 301 | 1.38 

bb 54| 29| 16 | 18 2|1]|120]|1.07 

bs 54| 28| 22 3 6|1]|114| .96 

sb 54| 36] 9|10| 2|1]|112| .87 

ss 63| 34/10] 5| 2| 1 | 115] .71 
17 
E 15 
bss 17 
sbb 13 
bbs 4- bsb | 29 
ssb 4-sbs | 47 


-To test for si 


x 
2, 
? 
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an indicator of relative marital success, and 
above-average age of partners at the time of 
marriage as an indicator of their greater ma- 
turity when choosing a spouse. What can 
hardly be denied are the deviations of fre- 
quency distributions from chance that were 
observed and reported and that had not been 
known before. One may say in favor of the 
duplication theorem that it is at least at no 
great loss to explain those various devia- 
tions from chance in a somewhat consistent 
manner. 

To show the scope of this consistency, it 
Should be mentioned that the duplication 
theorem has been shown to apply also to 
friendships among persons of the same sex. 
Those friends who were of complementary 
sibling-age-ranks developed longer lasting 
friendships, particularly when one or both of 
them had come from monosexual sibling con- 
figurations, than did those friends who were 
of identical or similar sibling-age-rank (see 
Toman, 1969), 

The duplication theorem applies also to 
the parent-child relationship. It could be 
Shown that the parent and same-sex child 
identify more readily with one another if 
they hold similar or identical sibling posi- 
tions. Parent and opposite-sex child, on the 
other hand, interact better with one another 
if their sibling positions are complementary 
to each other (Toman, 1962a, 1965, 1969). 

Finally, the duplication theorem has been 
found to apply also to losses of persons suf- 
fered in the original family. They create ex- 
pectations of further loss and, as a conse- 
quence, poorer, more haphazard, choices of 
partners, friends, and spouses, more conflict- 
laden relationships (in the present case de- 
fined in terms of incompatibility of sibling 
positions of partners) and greater likelihoods 
of new losses than expected by chance (see 
Toman, 1962a, 1965, 1969). Among the two 
samples of the normal population (taken 
from Nuremberg and Zurich), it was found 
that parents of both sexes who had suffered 
the loss of a parent themselves took signifi- 
cantly longer than expected by chance to 
have children of their own. More often than 
parents without such losses. they were either 
unmarried or divorced. This was significant 


ToMAN 


for fathers regardless of whether they had 
lost their father or their mother, for mothers 
only when they had lost their fathers (To- 
man et al., 1967). 


Discussion 


These effects of family constellation char- 
acteristics (and other effects reported else- 
where) are only one among several groups of 
effects on long-range social behavior. They 
are not overwhelmingly strong themselves, 
but wherever family constellation character- 
istics known to be effective and independent 
of each other occur in combinations, their 
influence becomes more pervasive. The fact 
that they could be demonstrated at all may be 
considered a surprise in view of the fact that 
these effects extend over time spans of some 
20 years. 

These effects do not bridge the time spans 
by magic, to be sure. The sibling roles 
learned in the original family, first at a time 
when the child experiences social contacts 
other than within the family only rarely and 
briefly, may be applied in new social contexts 
such as on playgrounds, around church, in 
school and clubs, etc. By the time those 
other contexts begin to provide more sub- 
stantial and extensive opportunities for ex- 
perience, patterns for evaluating. those op- 
portunities have already been set. These 
patterns are capable of modification and cor- 
rection, but any such changes will be harder 
to accomplish, the longer and more exclu- 
sively those patterns have already prevailed. 
Individuals learn to generalize from family 
members to playmates and classmates, col- 
leagues, acquaintances, friends, and ulti- 
mately those friends whom they decide to 
keep or marry. If these generalizations hap- 
pen to work in new contexts, the patterns of 
evaluation and social behavior are being con- 
firmed. If they lead to negative experiences 
repeatedly or even once, but drastically or 
very unexpectedly, social avoidance behavior 
will result and perpetuate itself, since 
ance precludes further 


avoid- 
and possibly correc- 
tive experiences, y 

One may argue that the duplication theo- 
Tem would plausibly hold where family cir- 


cumstances were favorable, Under unfavor- 
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able circumstances, especially when due to 
conflicts among family members and losses 
of family members, the individual should 
fare better when searching for other or even 
Opposite opportunities for permanent social 
relationships than those encountered in his 
family. Persistent anecdotal evidence and 
some statistical data suggest that even in- 
dividuals coming from unfavorable circum- 
stances tend to do better and to be more suc- 
cessful in new social contacts when they do 
not turn to the opposite of what they had at 
home. Mild changes for the better appear 
to be more trustworthy than radical changes 
for the better. 

Socioeconomic strata also exert influences 
on permanent social relationships. Families 
of the lower classes tend to marry sooner, to 
come from larger families, and to have more 
children, but also more frequently to suffer 
losses such as absences or deaths of fathers, 
separations from mothers, losses of siblings, 
etc. This includes also temporary losses 
(absences of family members for months or 
years until they eventually reappear) and 
partial losses (discoveries, e.g., that father 
has been in jail, that he is a drunkard, that 
mother never went to high school, or that she 
has had and given away a child before she 
was married, etc.). 

'The sample of the population studied can 
be called urban middle class including small 
percentages of lower-middle and of upper- 
middle class. The samples of the schools 
from which the parents were taken had been 
matched by type, size, and relative location. 
'There is no denying, though, that we have 
studied only Western European urban indus- 
trial populations. In rural areas, in other 
geographic regions, under different ethnic or 
political conditions, some of the influences 
we have traced may be contaminated by 
other determinants. Studies of smaller sam- 
ples of urban populations in the United 
States, however, showed very similar results 
to those reported here. 

With increasing size of a sibling configu- 
there are more middle siblings and 
hence more multiple sibling roles. More 

roles (oldest, youngest, only 


pronounced : ; k à 
boy, only girl, etc.) are in the minority. 


ration, 


Consequently, all sibling roles may appear 
somewhat blurred in those larger sibling 
configurations. With the overall trend of 
our times toward urbanization and smaller 
families, however, sibling roles may tend to 
become generally more conspicuous and the 
present findings relevant for increasing num- 
bers of family constellations. 

The study presented is in the tradition of 
the work of Bowlby (1951), Glueck and 
Glueck (1959), Hollingshead and Redlich 
(1958), Strotzka (1965), and others active 
in the area of the social psychology of long- 
range interpersonal relationships and of so- 
cial psychiatry. The data and findings are 
of a type that may have escaped the notice 
of factor-analytic personality research (e.g., 
Brill & Beebe 1955; Cattell 1964; Eysenck 
1961), even though this multivariate tech- 
nique of analysis requires as much inclusive- 
ness as possible in its respective primary 
data. 
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COMPONENTS OF ATTENTION? 


MICHAEL I. POSNER? Ax» STEPHEN J. BOIES 


às 
University of Oregon 


The study of human attention may be divided into three components. These 
are alertness, selectivity, and processing capacity. This paper outlines ex- 
perimental techniques designed to separate these components and examine 
their interrelations within comparable tasks. It is shown that a stimulus 
may be used to increase alertness for process ig all external information, to 
improve selection of particular stimuli, or to do both simultaneously. Devel- 
opment of alertness and selectivity are separable, but they may go on to- 
gether without interference. Moreover, encoding a stimulus may proceed 
without producing interference with other signals. Thus, the contact be- 
tween an external stimulus and its representation in memory does not appear 


to require processing capacity. 


mental operations such as response selection or rehearsal must be 


iormed on the encoded information. 


The subject of attention is a broad one. 
There are many definitions of it and subcate- 
gories included under it. In a recent book, 
Moray (1970) has proposed at least six dif- 
ferent meanings of the term attention in cur- 
rent psychological research. It would be 
premature to propose any single taxonomy 
or exhaustive set of categories for the term. 
However, both in looking at the various defi- 
nitions that Moray proposes and in review- 
ing the literature, there do appear to be three 
major topics under which studies of atten- 
tion might be grouped. 

First is the notion of alertne Maintain- 
ing attention in the sense of alertness? is 
presumably involved in human ability to 
perform in long, boring tasks like those 
that psychologists design to study vigilance 
(Mackworth, 1970). The ability to develop 


Ss. 


1 Portions of this paper were presented in an in- 
vited address to the Western Psychological Associ- 
ation, Los Angeles, April 1970. The research in- 
cluded here was sponsored by National Science 
Foundation Grants GB 5960 and 21020 and by the 
Advanced Research Projects Agency of the De- 
partment of Defense monitored by the Air Force 
Office of Scientific Research under Contract No. 
F44620-67-C-0099 all to the University of Oregon. 
The authors are grateful to Barbara Lewis and 
Larry Moore for help in running the subjects. 

2 Requests for reprints should be sent to Michael 
I. Posner, Department of Psychology, University 
of Oregon, Eugene, Oregon 97403. 

3 The term alertness is used here rather than 

, 1 to avoid the emotional connotation of many 
iud the latter term. The degree of separation 
af the terms is not clear at this point. 
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Limited capacity results are obtained when 


per- 


and maintain an optimal sensitivity to ex- 
ternal stimulation is also studied when sub- 
jects (Ss) receive a warning to prepare 
themselves to take in information. The fore- 
period of a reaction time task may be con- 
sidered as a miniature vigilance situation 
where alertness must be developed rapidly 
and maintained over a relatively brief inter- 
val. Recent studies have investigated the 
time course (Bertelson, 1967) and the brain 
activity (Karlin, 1970; Näätänen, 1970; 
Posner & Wilkinson, 1969) during this in- 
terval. It appears that there is some simi- 
larity between the brain processes which take 
place during the foreperiod and those in- 
volved in vigilance tasks (Wilkinson & 
Haines, 1970). These studies provide some 
rationale for including alertness as a general 
component of attention. 

A second sense of attention is the ability 
to select information from one source or of 
one kind rather than another (Egeth, 1967 ; 
Triesman, 1969). Studies of selective atten- 
tion may require that Ss report information 
from a particular sensory modality, spatial 
location, or content (letters rather than dig- 
its). Broadbent (1958) has argued that at 
least some aspects of selection are performed 
by filtering mechanisms which block out or 
attenuate input. Other theories have stressed 
additional operations which are performed 
on information of particular importance 
(Deutsch & Deutsch, 1963; Norman, 1968) 
These views b ara relationship to physio. 
logical models of attention that have stressed 
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either a tuning process which blocks input 
from unselected sources (Hernandez-Peón, 
1966) or a general alerting reaction which 
enhances input from whatever source ( Nää- 
tanen, 1970). The former view is more in 
line with filtering mechanisms, while the lat- 
ter fits better with the idea of additional 
operations performed on highly processed in- 
put. One way of viewing this issue is to ask 
whether Ss can prepare for signals in a way 
which is not selective. If they prepare for a 
visual signal, is the capacity to process audi- 
tory signals decreased or enhanced over an 
unprepared state? Thus, the separability of 
alertness and selection as internal processes 
has important consequences for a theory of 
attention. 

A third sense of attention relates to the 
idea of a limited central processing capacity 
(Broadbent, 1958). It is well known that 
Ss have difficulty in handling two tasks 
simultaneously, even when they wish to do 
so. Signals which arrive during the reaction 
time (RT) period tend to be delayed (Wel- 
ford, 1968). Yet it is not clear what aspects 
of signal processing are subject to the single- 
channel limitation. Some have argued that 
the single channel extends to almost all pro- 
cessing (Welford, 1968), while other have 
seen it as applying mainly to response ex- 
ecution (Reynolds, 1964). A limited capac- 
ity mechanism suggests that any two opera- 
tions requiring it will interfere with each 
other. Thus, a study of the ability of Ss to 
time share mental operations can lead to an 
understanding of what aspects of processing 
require access to the limited capacity system. 

Most recent articles related to attention in- 
volve one of these three components (see 
eg., Koster, 1969; Sanders, 1967, 1970). 
Unfortunately, the experiments used to study 
each of the components are different and 
there has been little attempt to develop tasks 
which allow each of them to be separated and 
interrelated. Moray (1970) suggests the 
importance of doing so as follows: 


It might well be, for example, that the relation be- 
tween selective listening and arousal is such that 
arousal level acts as a parameter which will alter 
the over-all efficiency of selection and rejection as 
it varies. But, no systematic investigation has so 
far been carried out [p. 85]. 


This Paper is an attempt to develop means 
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for studying the separation and interrelation- 
ship of these components of attention at the 
behavioral level. 

Consider first a task which involves all 
three components mixed in a fairly complex 
way. The Ss see a single letter for a brief 
period of time, followed after a variable in- 
terval by a second letter. The task is to re- 
spond as rapidly as possible "same" if the 
two letters have the same name; otherwise, 
"different" (Posner, 1969;. Posner, Boies, 
Eichelman, & Taylor, 1969). The first let- 
ter has two functions. Tt servesasa warning 
signal and also tells the S what letter he is 
looking for. Thus, one might expect it to 
increase alertness and to introduce some bias 
of the perceptual system toward tests related 
to the configuration of the first letter. The 
bias might involve, for example, activation of 
trace systems related to the physical con- 
figuration of the letter (Posner, 1969). 
What happens during the interval between 
the two letters depends greatly on how the 
attention (processing capacity) of S is di- 
rected. For example, if he knows that all 
matches will be physically identical, he main- 
tains his speed on physical matches much 
better than if some matches must be based on 
the name (Posner, 1969). If the first letter 
disappears, there is a rapid increase in times 
for a physical match, while name matches re- 
main relatively constant, suggesting that Ss 
keep the letter name in mind. However, if 
the physical form remains present during the 
interval, physical matching remains constant 
while name matches increase in efficiency 
until they resemble the physical matches, 
This suggests that Ss generate a visual code 
of the opposite case (Boies, 1969; Posner, 
1969). These results indicate that Ss are 
very flexible in the coding they use for let- 
ters and similar visual material (Cohen, 
1969; Tversky, 1969), but their choices are 
constrained by the rivalry between physical 
and name codes for Ss' limited processing 
capacity. 

This brief description of the letter-match- 
ing sequence suggests that it may be a useful 
task for separating the three components of 
attention described above. The experiments 
reviewed in this Paper are an effort to do so, 
Moreover, they attempt to connect the opera- 
tions of this experimental paradigm to the 
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three theoretical components of attention 
outlined above. Two major questions are 
considered. First, Can the warning function 
and the selective function of the first letter be 
separated? If so, how do the two functions 
relate? Second, What is the relationship 
between these two components of attention 
and central processing capacity? 


PREPARATION 


Many investigators have studied the period 
of time between a warning signal and a sig- 
nal related to the required response (Bertel- 
son, 1967). Of primary interest are studies 
where S has little uncertainty about when the 
response-related signal will occur. This is 
achieved by using blocks of trials with a fixed 
warning interval. Whether the response 
task involves reaction time  (Bertelson, 
1967), signal detection (Egan, Greenberg, 
& Schulman, 1961), or other types of pro- 
cessing (Leavitt, 1969), the results are simi- 
lar. Performance is worse with no warning 
and improves as the warning interval in- 
creases to some optimal value. This value 
is most frequently about .2—5 second. An 
interval longer than optimal tends to produce 
à decrease in performance, but this is usually 
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not as pronounced, particularly when feed- 
back is used to keep motivation high. 

Figure 1 indicates functions obtained using 
two-letter matching tasks (Posner & Wilkin- 
son, 1969). Both tasks involved a warning 
signal followed after a foreperiod by a pair 
of letters. The foreperiod was constant for 
a block of trials but varied between blocks. 
The Ss had a single key which they pressed 
when the letter pair was defined as “same” 
for that task. The solid line is for a task 
which required S to press a key if the letters 
were physically identical, while the dotted 
line required him to press the key if both 
letters were either vowels or consonants. 
Although RT differs greatly in the two tasks, 
the time course of preparation is very similar 
and like that described in the studies cited 
above. 


ENCODING 


If S is provided with part of the informa- 
tion he needs to perform a speeded task, his 
RT when he receives the remainder will be 
reduced (Cohen, 1969; La Berge, Van 
Gelder, & Yellott, 1970; Leonard, 1958). 
If the time between the two signals is varied, 
it is possible to discover how long it takes to 
encode the first signal in a form which is op- 
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ordinate). 
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interval collapsed across four days; after Posner & Wilkinson, 1969.) 
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timal for processing the second. If the first 
signal is a letter that must be matched against 
a second letter, the improvement should rep- 
resent the encoding of the information from 
the first letter needed for the match. In 
order to reduce the involvement of general 
preparation in such functions, a warning sig- 
nal was given .5 second before the first letter, 
It was supposed that preparation could be 
maintained during the time between presen- 
tation of the first letter and the presentation 
of the second letter. 


Method 


Experiments were run on a PDP-9 
Displays were presented on a Tektronix 
using a P 11 phosphor which decays 
its energy in 20 milliseconds (msec.). 
ing signal was a cross 


computer. 
503 scope 
to .01 of 
The warn- 
presented .5 second be- 
fore the first letter, The first letter was presented 
directly below the warning signal and remained 
on until S's response, After a variable interval 
(interstimulus interval [ISI]), the second letter 
was presented adjacent to the first. At a distance 
of 16 inches the two letters were foveal (3 degrees 
of visual angle). The time between the two letters 
was varied between blocks of trials. 
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Two separate experiments were run. In the first 
experiment, S responded “same” if the letters were 
physically identical ; in the second, “same” was de- 
fined as both vowels or both consonants. Experi- 
ment I used only capital letters, while Experiment 
II used mixed uppercase and lowercase letters, 
The computer chose the letters at random from an 
alphabet of 15 letters with the restriction that half 
of the trials in each block be "same," In these ex- 
periments, Ss had two keys and were instructed to 
press the left one if the letters were defined as 
"same" and the right one if "different," 

In Experiment I, cach of eight Ss ran in six 40- 
trial blocks for each of four days. Each block con- 
tained one of six intervals between the two letters, 
The intervals were assigned randomly so that each S 
had all six blocks on each day. Experiment 1I was 
substantially the same except that each block con- 
tained four physical matches (eg, AA), eight 
name matches (e.g. Aa), 12 vowel consonant 
"same" responses, and 24 "different" responses. 


Results 


First, it is clear from Figures 2 and 9; 
as found previously (Posner & Mitchell, 
1967), that the level of match has sys- 
tematic effects on the reaction time. More- 
over, the encoding function for all three 
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levels is strikingly similar in both experi- 
ments. The optimal RT is at 500 msec. in 
all functions. This finding appears to be 
robust over the distribution of intervals, 
since a control study using intervals of 0, 1, 
50, 150, 250, and 500 showed almost the 
same function as that obtained for the first 
five points of Figure 2. The same result is 
also obtained in encoding functions discussed 
in the next section (see Figures 4 and 5). 
There is some tendency for sharpest im- 
provement in RT to occur later for more 
complex matches. Most of the improvement 
for physical matches appears to come in the 
first 150 msec. This is much less true of 
vowel-consonant matches, which show a 
sharp improvement between 250 and 500 
msec. 'The extent of improvement appears 
to be slightly greater for vowel-consonant 
matches than for other types, but name 


NI=name identity, PI = physic 


al identity.) 


matches do not show much more improve- 
ment than physical matches. 

In general, encoding and preparation func- 
tions appear to be rather similar. Both show 
an optimal point at about 500 msec. Both 
tend to give greater improvement for vowel- 
consonant than for physical matches. The 
next experiment is designed to compare them 
more directly. 

PREPARATION AND ENCODING 

If S is given 500 msec. to prepare, his RT 
will be about optimal. If he is given infor- 
mation at the end of that 500 msec, about 
what he is to select, he shows a further im- 
provement which also goes on for about 500 
msec, Are these two improvements due to 
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The additivity of improvement due to 
Preparation and encoding suggests that the 
two processes can proceed in parallel with- 
Out interference. This is true even at 150 
msec. when the functions are changing 
rapidly. Moreover, the additivity indicates 
that the amount of improvement due to en- 
Coding is the same regardless of the initial 
level of alertness. 

Tt remains to be shown that the encoding 
function is really related to specific informa- 
tion from the letter presented. A striking 
finding in the physical match encoding func- 
tion is a significant interaction between in- 
terval and type of response (same vs. differ- 
ent) F = 39. df = 2/9, p < .05. Recently a 
number of investigators have pointed out that 
the response “different” does not appear to 
be the mirror image of the response “same” 
(Bamber, 1969 ; Nickerson, 1965: Smith & 


Nielsen, 1970 Tversky, 1969), 
experiments where there is an interval be- 
tween the two items tp be matched, “same” 
responses are faster Y 
sponses (Bamber, 1969 ; Nickerson, 1965). 
Figures 7 and 8 show the data from the prep- 
aration, encoding, and “both” functions for 


tive efficiency in Processing "same" as com. 
pared to “different” stimuli increases ag a 
function of interval, but only when $ has 
been given the first letter. The relative in- 
crease in efficiency of the “same” response 
as compared to “different” shown in the 
RTs is also reflected in the error rates. No 
such change is found When the interval in- 
volves the period between a warning signal 
and the letter pair, Nor does it occur for 
name or vowel-consonant level matches. 
The finding that "same" and “different” 
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responses are identical in simultaneous pre- 
sentation has not been universal (Krueger, 
1970; Posner & Mitchell, 1967). This ap- 
pears to depend on many factors. It was 
suspected that one such factor was assign- 
ment of hands. To check on this, a control 
study was run which assigned six Ss “same” 
on the left and six Ss "same" on the right. 
The experiment involved ISIs of 0, 50, 150, 
and 500 msec, with a 500-msec. WI. The 
physical match instruction was used. The 
data for left-hand "same" Ss replicated Fig- 
ure 7 perfectly. For Ss assigned “same” 
on the right, however, "same" was faster 
even with simultaneous presentation. How- 
ever, regardless of hand assignment, the sig- 
nificant divergence over time between “same” 
and "different" responses was found (see 
also Figure 2 and Corballis, Lieberman, & 
Bindra, 1968). 

The finding that presentation of the first 
letter improves the efficiency of handling an 
identical letter suggests that the first letter 
changed S's sensitivity to a letter of iden- 
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TABLE 2 


AMOUNT OF IMPROVEMENT IN REACTION TIME 
FROM PREPARATION, ENCODING, AND BOTH 


Intervals 
Match 
150 500 

Physical 

Preparation 58 122 

Encoding 53 73 

Both 101 195 
Name 

Preparation 42 127 

Encoding 58 R98 

Both 111 201 
Vowel-Consonant 

Preparation 7 170 

Encoding 124 150 

Both 157 296 


Note.—Improvement is measured by subtracting RT at 
specified interval from RT at 0/0 for preparation and "both" 
function and 500/0 for encoding. 

a In milliseconds. 


tical form. One way of conceptualizing this 
is in terms of Sokolov’s neural model idea 
(Sokolov, 1963). The first letter serves as 
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interval for "both" functions of physical match task. 


a model of what S$ is looking for. When the 
second letter matches it, processing proceeds 
rapidly ; if it does not, further tests are made. 
A similar view would be that the first letter 
activates internal units representing the 
stored information concerning that letter 
(Morton, 1969; Posner, 1969), which may 
then be more sensitive to identical stimula- 
tion for some period of time. The latter idea 
would presumably allow for activation of re- 
lated items necessary to handle improve- 
ments at the name and vowel-consonant 
levels, since access to past experience is 
clearly needed for these matches. Both of 
these ideas are consistent with recent ideas 
about "same" and “different” responses to 
successive stimuli (Bamber, 1969; Tversky, 
1969). What is new is that the divergence 
over time indicates that at least part of the 
advantage of “same” is due to the encoding 
of the first letter rather than in the criterion 
governing the match itself. 


The results of tl 
encoding and prepar 
tions with 
However, 
both simul 
one alone, 
answer to 
whether sele 
alerted org: 
According 


his section suggest that 
ation are separable func- 
different effects on behavior. 
it appears possible to carry on 
taneously as efficiently as either 
In some ways this provides an 
Moray's question concerning 
ctivity is more efficient for an 
anism than for an unalerted one. 
to these data, whether S has had 
à previous warning signal does not change 
the efficiency with which he encodes a letter, 
at least up to the level of a name match. 
Nor does the relative bias in physical match- 
ing for "same" as opposed to “different” 
differ for alerted and unalerted conditions. 
These data strongly suggest that a warning 
signal does not improve the process of en- 
coding information from the first letter, 
_ 50 far, we have defined two separate func- 
tions having to do with attention and ob- 
served something of their relationship. It 
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now remains to relate each of them to the 
third component of attention. 


PROCESSING CAPACITY 


One sense of attention involves competi- 
tion between signals for some limited capac- 
ity system (James, 1890). The idea is that 
many tasks place demands on a central 
limited capacity system and thus tend to in- 
terfere. This basic idea has led to a number 
of experiments which have attempted to mea- 
sure the attention demands of one task by its 
interference with a secondary task (Welch, 
1898; Welford, 1968). One way of doing 
this is to use a probe as the standard second- 
ary task. For example, Näätänen (1970) 
measured the evoked potential to auditory 
clicks, which were probes occurring during 
a visual RT task. Posner and Keele (1967) 
and Ells (1969) used a simple or choice RT 
task as a probe during a movement task. 
Foss (1969) used a probe RT to study 
processing of sentences. The advantage of 
a probe RT task is that the experimenter 
can measure both the RT to primary and 
secondary tasks and thus be certain what 
effects each is having on the other. In 
studying movements, it was found that the 
probe had little effect on the primary move- 
ment task, but that the probe RT reflected 
the central processing demands of the pri- 
mary task in a very sensitive way. For 
example, prior to the movement, probe time 
was a function of a number of alternative 
movements and was unrelated to target 
width. During the movement, probe time 
was a function of target width and unrelated 
to number of alternatives (Ells, 1969). 
These findings suggested very strongly that 
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probe RTs could provide a dynamic picture 
of the central processing demands of primary 
tasks. 

The probe technique applied to the letter- 
matching sequence might indicate how prep- 
aration and encoding are related to central 
processing capacity. Of particular interest 
is whether preparation for a visual signal 
served to improve or retard RT to signals 
coming from another modality (Hernandez- 
Peón, 1966; Näätänen, 1970). Another 
question is whether encoding itself requires 
processing capacity in the sense of interfer- 
ence with a probe RT. Finally, there is the 
question of what aspects of letter matching 
seem to demand the greatest involvement of 
processing capacity. 


Method 


The same apparatus was used as described pre- 
viously. A trial consisted of a warning signal 
followed after a WI with the first letter, which 
remained on until the presentation, after an ISI 
of a second letter adjacent to it. After each 
trial, feedback was provided on the correctness and 
RT for the letter match. The values of the WI, 
ISI, and intertrial interval (ITI) are given for 
each experiment in Table 3. On half of the trials, 
50 decibels of white noise was turned on in Ss left 
ear at one of eight positions in the trial. The probe 
occurred equally often in each of the eight positions. 
The Ss were instructed that the letter task was 
primary and that they were to press their right 
index finger if the letter had the same name and 
their right middle finger if the names were differ- 
ent. They were told to press their left index finger 
whenever they heard the white noise burst. Feed- 
back on the letter-matching task did not occur until 
both responses were complete. No feedback was 
provided on the noise RTs. 

In Experiment I, Ss ran six 48-trial blocks per 
day. Each block was identical in form. All the 
first letters were uppercase and the second letters 


TABLE 3 


INTERTRIAL, INTERSTIMULUS, AND WARNING INTERVALS AND FinsT-LETTER 
EXPOSURE DURATIONS (IN SECONDS) FOR EACH EXPERIMENT 


Probe positions ^. 
(milliseconds following warning signal) 


ITI WI| Exposure duration | ISI 

Le iien 
l 1 5 5| —3000, —2000, 150, 600, 1100, 1300, 1600, 1900 
T 5 1 1 | —500, 150, 550, 650, 800, 1000, 1650, 1800 
5| .05 1 —500, 150, 550, 650, 800, 1000, 1650, 1800 
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lowercase, so all matches were at the name level, 
In Experiment IL, half the blocks were all upper- 
case (physical match) and half were identical to 
Experiment I (name match). 

All Ss were run individually for two days. Data 
from the second day only were used for those Ss 
whose median probe RTs were under 1 second for 
all positions. Twelve Ss were included in the 
analysis of each experiment (a total of three were 
excluded for not meeting the criterion). 


Results 


The basic results of each experiment 
are the mean of the median probe RTs 
at cach position and the mean of the median 
letter match RTs at each probe position, 
These are shown in Figures 9, 10, 11, and 12; 
Differences in letter match times between 
physical and name matches are consistent 
with the Instructions and suggest Ss used a 
visual code for the former and the letter 
names for the latter (Posner, 1969), 
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The first point of interest is that the RT 
to the probe is longer when it is in the ITI 
than when it occurs after the warning signal, 
Some evidence on this point appears in Fig- 
ure 9 where the two probes after the warning 
are slightly faster than those during the ITI. 
However, this improvement does not reach 
significance, In Experiment Il, RTs at 
Position 2 are faster than at Position l, but 
again this is not significant. However, at 
Positions 3 and 4, each S shows a definite 
improvement in RTs over the times obtained 
during the ITI ($ < 01, by sign test). 
Thus, a prepared Ș gives faster RTs to the 
secondary task than an unprepared S, even 
though the S is instructed to prepare for the 
visual signal. An analysis of the distribu- 
tions at each probe position revealed no evi- 
dence of anticipation errors (e.g., RTs to the 
noise of less than 200 msec.). Every effort 
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Ss was specific to the visual letter-matching 
The auditory 
-5 of the trials and 
eight different Positions, 
volved the times for the letter task and none 
referred to times on the probe task, 

t of interest is that probe 
i as a con- 
first letter, 
on for only .5 second 
after the first let- 
6) was increased 
However, when the first letter Was 
present for 1 second (Figure 11), RT to the 


until 300-500 


and 6). From Previous 
results (see Figures 2 and 3), it would seem 
likely that the encoding of the first letter was 
pretty much complete by the time the probe 
began to show interference, 

Third, it is clear that probe times are 
Systematically increased af certain points, 
Probes which occur shortly after the second 
letter are always long (Figure 9, Probe Posi- 


ion 7, and Figure 11, Probe Positions 7 and 
3). These times decrease quite consistently 
rom 100-400 msec. after the second letter. 


This result Suggests that Ss have trouble 
Processing the probe during the response 
phase of the letter-match task. If probes are 
delayed until close to the letter-match RT, 
they drop back to nearly the same values as 
obtained during the ITI (Figure 9, Probe 
Position 8). Probes which occur between 
the two letters are increased if they fall close 
to the Occurrence of the second letter see 
Figure 11, Probe Position 6). This increase 
does not appear to be due to the actual 
Pearance of the Second letter, 
distributions were computed for probes 500 
in Experiment 
Probe Position 6), it was 
found that few of the RTs were long enough 
to occur after the second letter. Even with 
these excluded from the analysis, RTs to 
still remained longer 
ITI and WI for all 
S 

Do the probe RTs really reflect the pro- 
cessing requirements of the letter-matching 
task? With the long ISI it might be possi- 
ble for $s to switch from Processing the let- 
ter to listening for the Probes. In order to 
Check on this, an experiment was run in 
Which the first letter Was turned off ran- 
domly after 50, 150, 500, or 1000 msec. The 
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Fic. 13. Probe RTs compared for exposure durations of the 
first letters of 50 and 500 msec. 


experiment was otherwise identical to the 
name letter-matching condition of Experi- 
ment II. Figure 13 shows the results for 
seven Ss who ran in this experiment for five 
days. The data represent probe times on 
Days 4 and 5 for trials which had only 50- 
msec, exposure of the first letter and those 
for which the first letter was on for 500 
msec, The RT results together with gen- 
ally very low error rate indicates that S's are 
not switching away from the letter task. If 
they were, they would miss many of the 
50-msec. exposures. Moreover, the probe 
curves from this study are quite similar to 
those illustrated in the previous studies. 
Even if the first letter was turned off after 
50 msec., S does not show substantial inter- 
ference with the probe until 500 msec. 
Another possibility is that Ss are learning 
the distribution of the probes and respond 
most rapidly to those in the middle of the 
distribution. This possibility seems rather 
remote—in part because of the low frequency 
of probes at any position (only on 1/16 of 
the trials), but also because of the complex 
[ the RTs as a function of probe posi- 


shape 0 = 
pes be Positions 4 and 5 are the two 


H D. 
tion. Pro 


mid positions and should be fastest based on 
an expectancy idea. These show very differ- 
ent results in both Figure 9 and in the name 
match condition of Figure 11. Moreover, 
the probes at Position 8 are quite fast al- 
though they are at the end of the distribu- 
tion. In addition, the results shown in Fig- 
ure 13 illustrate that the point at which the 
first letter is turned off has small but sys- 
tematic effects on the probe RTs in the di- 
rection which would be expected if Ss were 
showing interference from letters which were 
removed somewhat more rapidly than those 
which remained present. Thus, it appears 
that probe RT reflects the attention demands 
of the primary task. 


CONCLUSIONS 


The introduction outlined three component 
processes involved in the study of attention, 
These are alertness, selectivity, and process- 
ing capacity. In this paper each of these 
components was identified with a particu- 
lar experimental operation. Alertness was 
studied by varying the time between a warn- 
ing signal and a pair of letters which S was 
required to match. Since the warning signal 
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told S nothing about what letters he was to 
receive, it provided only rather minimal se- 
lective information. Selectivity was studied 
by providing one of the two letters so that S 
knew what he was to look for. Improve- 
ments in RT to the second letter provide a 
function related to encoding of the first let- 
ter. Both alertness and selectivity were 
found to improve performance in the letter- 
matching task. In both cases, the time re- 
quired to reach optimal performance was 
about 500 msec. This value appeared to be 
surprisingly consistent both between the two 
processes and over various levels of process- 
ing (physical, name, vowel). The encoding 
function appeared to be somewhat steeper 
over the first 150 msec. than the preparation 
function. As the level of encoding required 
by the match increased, the steepness of the 
function appeared to be somewhat reduced. 

The main consistent difference between 
the preparation and encoding functions was 
in the rate of responding to identical versus 
different letter pairs. The preparation in- 
terval improved physical matches and differ- 
ent responses equally. The encoding of one 
letter appeared to improve processing of 
physical matches more than of mismatches, 
This divergence between “same” and “differ- 
ent” responses was particularly sharp in the 
first 150 msec. For higher levels of instruc-- 
tions, no such divergence was shown. 

The preparation and encoding functions 
are additive at least for Physical and name 


matches, This is true even after only a 150- 
msec. interval when both are improving 
rapidly. The additivity between the two 


functions suggest that encoding may proceed 
with nearly equal efficiency regardless of the 
initial level of alertness. The bias intro- 
duced from the first letter appears to develop 
just as rapidly whether S has been prepared 
by an earlier warning signal or not, This 
provides one kind of answer to Moray’s 
question about the relationship of selection 
and alertness. The additivity also implies 
that the two operations can be carried on 
simultaneously without interference. If, for 
example, S must both extract the name of a 
letter for later matching and use it as a 
warning signal, he may perform both func- 
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tions with as 
alone. 

This conclusion is similar to that reached 
by Posner and Wilkinson (1969), in which 
asured by a negative shift 
in the electroencephalogram (contingent neg- 
ative variation), was unaffected by making 
the warning signal a letter Which had to be 


great efficiency as either one 


work suggests a relationship between alert- 
ness, as used in this paper, and the contingent 
negative variation. If this relationship is 
correct, then alertness might be thought of 
as a general and widespread increase in cor- 
tical activation. Results of studies with the 
contingent negative variation support that 
view and suggest that this improvement is not 
confined merely to tasks in which there is an 
Overt motor response (Karlin, 1970). Thus, 
the improvement does not appear to be con- 
fined to the motor system. How does alert- 
ness improve task efficiency? The failure of 
the initial level of alertness to vary the ef- 
ficiency of encoding indicates that alertness 
does not influence the earliest stages of pro- 
cessing. However, it is possible that with 
our method, the parallel development of 
alertness and encoding obscures the impor- 
tance of the initia] state of alertness so that 
i not affect the encoding function. 


Perhaps a more interesting possibility is that 
alertness 


the early Stages of Processing are unaffected 
by level of alertness. Studies of the rela- 
tion of early components of the evoked po- 
tentia] to alertness (Karlin, 1970; Näätänen, 
are not conclusive on this point. 

Preparation is viewed as a general pro- 
cess which is not selective, there must þe 
other mechanisms which account for the ob- 
vious ability of Ss to select information. 
The first letter appears to have a very spe- 
cific effect on selection of a new letter which 
resembles it. If the first letter is viewed as 
activating a trace system related to the pro- 
cess of recognition (Morton, 1969; Posner, 
1969). this activation May serve to make 
Selection of already active areas of memory 


* 


COMPONENTS OF ATTENTION 


more efficient. "Thus, in the "both" condi- 
tion, a single letter would lead to a general 
alertness and a specific activation, which con- 
tribute additively to the overall reduction in 
RT. Of course, the activation of such areas 
of memory could come either from’ internal 
or external sources. 

The nearly perfect time sharing between 
preparation and encoding suggests that at 
least one of these mental operations does not 
require central processing capacity. If they 
both did, they would be expected to interfere. 
In order to further test whether alertness 
and encoding were free of demands on pro- 
cessing capacity, the attention demands of 
the letter-matching tasks were measured by 
introduction of a simple probe RT task 
to white noise. The results suggest that 
a warning signal serves to reduce RT 
both to the main task and to the probe 
task, even when probes were introduced on 
only half the trials. Moreover, the encoding 
of the first letter did not seem to interfere 
with the probe task, provided Ss had one 
second before the second letter was intro- 
duced. This was the case even when the 
first letter was turned off so that S had to 
encode quickly. This suggests that encoding 
of a letter does not require processing ca- 
pacity as we have defined it here. Of course 
the reasonableness of the conclusion depends 
on the assumption that probe RT will be 
sensitive to processing demands of the pri- 


mary task. 
Interference with the probe was significant 


when it was introduced 500 msec. prior to 
the second letter. The interference was not 
due to the physical occurrence of the second 
letter, since the effect was still present when 
all RTs above 500 msec. were eliminated. 
What aspect of the letter matching causes 
interference with the late probes? One pos- 
sible explanation is that such mental opera- 
tions as rehearsal or generation of distinctive 
features for testing, which follow encoding 
of the first letter, require processing capacity, 
while encoding itself does not. dm 
These results suggest that attention in the 
sense of central processing capacity is related 
to mental operations of which we are con- 
scious, such as rehearsing or choosing a re- 
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sponse, but is not related to the contact be- 
tween the input and long-term memory that 
leads to the letter name. Several lines of 
evidence have tended to relate single-channel 
effects to response selection rather than all 
forms of stimulus processing (Herman & 
Kantowitz, 1970; Keele, 1970). Moreover, 
other research has indicated that conscious 
awareness is itself rather late in the sequence 
of mental processing (Fehrer & Raab, 1962; 
Libet, 1966). If single-channel effects were 
tied to conscious processing, one would not 
expect to see them closely correlated with en- 
coding of the first letter (Deutsch & Deutsch, 
1962; Norman, 1968), but rather with opera- 
tions on the retrieved product of such encod- 
ing. The data obtained in this study appear 
to be consistent with such a view. 
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STATISTICAL DECISION MODEL FOR 
AUDITORY WORD RECOGNITION" 
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A statistical decision model for auditory word recognition is proposed which 
accounts for response dependencies in a multitrial recognition task. An ex- 


periment 
stimulus word frequency is to increase 
in rejecting hypotheses. Additional da 


al test of the model is described in which the effect of increasing 


the criterion level of mismatch used 
ta are presented which suggest that 


(a) hypothesis word frequency is correlated with stimulus word frequency, 


and (b) high-frequency words may 
frequency words. 
guessing” and decision 
are discussed. 


theoretic explar 


It is a well-known finding that words 
which are common in the language are more 
readily identified than are words which are 
uncommon in their occurrence (Howes, 
1954; Howes & Solomon, 1951). While 
this word frequency effect has become a 
relatively routine observation in studies of 
auditory and visual word recognition, an 
adequately documented explanation for the 
phenomenon has as yet eluded investigators 
in the area. Until quite recently, the most 
accounts have explained the word 
frequency effect as the result of a strategy 
which the subject (S) uses in guessing the 
correct word on those trials where the stim- 
ulus word is not correctly perceived (e. g., 
Pollack, Rubenstein, & Decker, 1960; Savin, 
1963; Solomon & Postman, 1952). In this 
“sophisticated guessing” model, it is assumed 
that even when S has incorrectly perceived 
the stimulus word, he has still received 
enough information about phonetic character- 


successful 
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be phonetically distinct from low- 


The implications of these findings for “sophisticated 


nations of the word-frequency effect 


istics of the word to rule out all but a re- 
stricted set of alternatives from which he 
chooses a word at random to serve as his 
guess. In addition, it is assumed that the 
selection of a response from the pool of avail- 
able alternatives is carried out with a bias in 
favor of choosing words of high frequency 
ahead of words of low frequency. Accord- 
ing to this theory, the word frequency effect 
arises from the greater likelihood of S’s cor- 
rectly guessing a high-frequency stimulus on 
those occasions when he has not correctly 
perceived it. 

This view of the word recognition process 
has been challenged by Broadbent (1967), 
who proposes instead that the word fre- 
quency effect is due to a criterion bias, as 
conceived in signal detection theory. Ac- 
cording to the signal detection formulation, 
the presentation of a stimulus word evokes, 
to some extent, every word in < 's repertoire 
as a potential response. Associated with 
each possible response word is a decision axis 
for the particular response alternative in 
question. Information gained from the 
presentation of a stimulus (the "observa- 
tion") determines a point on the decision 
axis associated with each word in S's reper- 
toire. The “observation” corresponds to 
phonetic and structural information about the 
word which has been perceived, and a high 
value on the decision axis corresponds 
therefore, to a condition of phonetic-acoustic 
match between potential response and per 
ceptual input, while a low value on the 7 


deci- 


sion axis corresponds to an acoustic mis- 
match between input and response word. 
According to Broadbent's theory, S must 
decide among the alternative words in his 
repertoire by comparing their values on the 
decision axis for the available perceptual in- 
put and by selecting the word with the 
highest value, that is, the one which provides 
the best match to the stimulus. This deci- 
sion is complicated, however, by the fact that 
S is biased in the direction of accepting lower 
values on the decision axis for high-fre- 
quency words than for low-frequency words 
as a criterion for responding. This assump- 
tion is justifiable in light of the fact that 
high-frequency words have higher a priori 
subjective probabilities of occurring in the 
context of the experiment than do low- 
frequency words. The resulting criterion 
bias leads $ to favor high-frequency hy- 
potheses over low-frequency hypotheses 
when their values on the decision axis are 
the same. The S will thus respond more 
readily with the correct word when it is a 
word of high frequency than when it is of 
low frequency. Broadbent’s attempt to in- 
validate the sophisticated guessing theory by 
looking at data on the word frequencies of 
errors made in response to high- and low- 
frequency stimuli has been disputed in a re- 
cent paper by Catlin (1969) ,* who argues 


? Catlin'S argument follows the urn model for 
sophisticated guessing of Pollack et al. (1960), 
which assumes that the effect of a stimulus presen- 
tation is to select a sample of balls from an urn 
representing S’s response hierarchy. All of the 
balls in the sample selected correspond to words 
which bear a phonetic relationship to the stimulus 
word, as it has been in part perceived. While a 
vortion of the balls in this “restricted” urn cor- 
respond to the stimulus word, the remaining balls 
represent erroneous responses. Catlin maintains 
that the assumption of an acoustical equivalence of 
high- and low-frequency words implies that the 
number of balls in the restricted urn corresponding 
to high-frequency (En) and low-frequency (Es) 
errors does not depend on the frequency of the 
stimulus word. From this assumption, he arrives 
at the conclusion that the proportion of high-fre- 
quency errors En/(En+ E) is independent of 
stimulus word frequency, a result which is con- 
trary to Broadbent's conclusion. It might here be 
Pointed out that Catlin’s assumption that En and Ey, 
are independent of stimulus word frequency implies 
also that the size of the sample of balls drawn from 
the original urn is a function of stimulus word 
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that a modification in the interpretation of 
the effects of acoustical equivalence of high- 
and low-frequency words yields a guessing 
model which can account for Broadbent’s 
data. In addition, Morton (1968) has 
shown that both the sophisticated guessing 
and the decision theory models require re- 
vision in order to account for the data pre- 
sented by Brown and Rubenstein (1961), so- 
that at present no clear empirical basis exists 
to distinguish between the two points of view. 
In the present study, an attempt was made to 
demonstrate criterion bias in auditory word 
recognition, using mathematical and experi- 
mental techniques which allow us to provide 
a metric for the decision axis and to actually 
locate the distributions for response words 
and the differential criteria which, it is hy- 
pothesized, are used for high- and low- 
frequency words. 


A DzcisioN Mopkr ron SEQUENTIAL 
Wonp RECOGNITION 


In order to investigate the effect of word 
frequency on factors determining S’s choice 
of response in auditory word recognition, a 
sequential recognition procedure was em- 
ployed. In such a task, S is given an oppor- 
tunity to make a series of consecutive guesses 
about the identity of a masked stimulus word, 
as the degree of masking is reduced (and 
the signal-to-noise ratio [S/N] increased) 
over a series of trials (Frederiksen, 1967, 
1969). For example, a stimulus word might 
be presented 15 times, starting at an S/N of 
—6 decibels (db.) on the first trial, and with 
an increase in S/N of 1 db. on each sub- 
Sequent trial. After each trial, S writes 
down his guess as to the identity of the 
masked word, 
In the sequential task, S’s response on 
Trial i+ 1 is clearly dependent not only on 
the acoustical character of the stimulus on 
that trial, but on his response on the previous 
trial (i) as well. Since S is required to 


lrequency, since the restricted urn always contains 
balls representing the stimulus word, and the num- 
ber of balls corresponding to the stimulus word 
depends on the stimulus word frequency, Whether 
mption is consistent 
ation of a stimulus 


1 0 alls from the initial 
urn is a question worth pondering. 
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write down his current hypothesis after each 
trial, response statistics, such as the trial on 
which he first writes the correct word, can 
be computed. Through an analysis of re- 
sponse statistics, obtained over a series of 
recognition problems, estimates can be ob- 
tained of parameters which reflect the under- 
lying decision processes operating in per- 
ceptual recognition. 


The Decision Model 


In the terms of statistical decision theory, 
we suggest that S’s response on any par- 
ticular trial is determined by his choice of a 
criterion used in rejecting his previous re- 
sponse. On each trial, S is thought to com- 
pare his current hypothesis with information 
available from the masked stimulus, and to 
test for a disparity between the two. Since 
the disparity between hypothesis and percep- 
tual input to the decision system varies from 
moment to moment, it projects a distribution 
of values on a judgmental (match-mismatch ) 
axis, as in the top drawing in Figure 1. 
When the degree of mismatch on a particular 
trial exceeds a criterion level, the S rejects 
his present response and reenters his re- 
sponse hierarchy to select, on perceptual 
grounds, a new response. If, on the other 
hand, the degree of mismatch fails to reach 
the criterion level, the S instead perseverates 
with his previous response. _ The area under 
the probability density function to the left of 
the criterion corresponds to the probability 
(o) that $ will perseverate in this way. 

The second drawing in Figure 1 shows 
one way in which two hypotheses might dif- 
fer, namely, in their positions on the decision 
Two distributions are shown, one for 
(Hypothesis 1) and the 
other for one that is less so (Hypothesis 2). 
Since the average degree of mismatch is 
higher for Hypothesis 2 than for Hypothesis 
1, the probability « is smaller for the second 
hypothesis than for the first, indicating that 
an S currently holding the second hypothesis 
will be more likely to reenter his response 
hierarchy and select a new response tian will 
an S holding the first one. E Enn 
mental variable which might a " Ie ex- 
cted degree of hypothesis mismate 1 0n site- 
trials is the step-size change in S/N 


axis. y 
a tenable hypothesis 


pe 
cessive 
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Criterion 
a 
Match Mismatch 
Criterion 
a Hypothesis Hypothesis 
= One Two 
[3 
Hi 
[3] 
= 
Š Match Mismatch 
H 
e 
a 
Match Mismatch 
Decision Axis 
Fic. 1. An illustration of the effects of stimulus- 


hypothesis congruence and criterion shift on a, the 
probability of perseveration, as it is represented in 
the decision model. 


occurring on each trial. A large reduction 
in noise level from one trial to the next 
would presumably reveal more information 
about the discrepancy between stimulus and 
hypothesis than would a small change in 
S/N. 

The drawing at the bottom of Figure 1 
illusrates a second way in which two hy- 
potheses may differ, namely with regard to 
the criteria which are used by S in his deci- 
sion to reject them. Under the circum- 
stances, even though the hypotheses are equal 
in their degree of match to the perceptual 
input, S will reject one more readily than the 
other. In particular, following Broadbent's 
(1967) suggestion, Criterion 1 in the figure 
might correspond to the criterion for reject- 
ing a low-frequency hypothesis word, while 
Criterion 2 might correspond to that used in 
rejecting a high-frequency word. The § 
would then be biased toward accepting less 
discrepant information as grounds for re. 
jecting the low-frequency hypothesis than he 


would in order to reject a hypothesis of hi 
: sls of hje 
frequency. hig h 
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The key parameter in each of these illus- 
trations is o, the probability of perseverating 
with one's previous hypothesis on a given 
recognition trial. Direct estimation of the 
value of this parameter for real data (by 
simply counting perseverative responses) is 
complicated by the fact that an S can emit the 
same response on two consecutive trials for 
either of two reasons: (a) he can refuse to 
reject his previous hypothesis when its de- 
gree of mismatch has failed to exceed his 
criterion, or (b) he can resample his previous 
response after having first rejected it, since 
sampling from the response hierarchy is car- 
ried out with replacement (cf. Frederiksen, 
1969). While direct measurement of @ is 
thus ruled out, indirect measurement of æ is 
possible through the application of a mathe- 


matical model for the word recognition 
process. 


The Mathematical Model 


In the mathematical model, 


it is assumed 
that whenever S 


rejects a previous hy- 
pothesis, he reenters his response hierarchy 
to select, with replacement, a new response. 
His probability of selecting the correct re- 
sponse is assumed to be the value one would 
predict in the absence of intertrial response 
dependencies; it thus will depend on S/N 
but not on the number of previous trials to 
Which S has been exposed. The probability 
that the new response is correct is assumed, 
therefore, to be a linear function y + øs; of 
the S/N s; on the particular trial (i). The 
parameter y represents the probability that 5 
responds correctly at the reference S/N 
which is the S/N on Trial 1. Theta repre- 
sents the change in probability correct for a 
unit change in S/N. Ample evidence sup- 
porting this linearity assumption for situa- 
tions in which successive trials are inde- 
pendent exists in the literature and here 
needs no detailed citation (e.g., Hawkins & 
Stevens, 1950; Miller, 1947). There cer- 
tainly is no reason to Suspect great depar- 
tures from linearity for the range of S/N's 
employed in the present study (—6 db. to +8 
db.). In any event, if serious departures 
from the model predictions are encountered, 
alternative functions relating probability cor- 
rect to S/N could be considered, such as the 


logistic or the normal ogive. The function 
Y + 0s; can be thought of as giving the prob- 
ability of S's being correct on Trial i that 
would be expected if S's response on this 
trial were inde 
Trial i — 1. 
The trial-to-trial response dependencies, 
which are more characteristic of a sequential 
recognition task, are dealt with in the model 
by assuming that there is à fixed probability 
« (for a given set of experimental condi- 
tions) that S perseverates with his previous 
Tesponse on any given trial, rather than re- 
enter his response hierarchy to select a new 
Tesponse (with probability ] — a). This as- 
sumption, combined with the assumption of 
linearity, is sufficient to derive a theoretical 
probability distribution of first recognition 
trials obtained in a group of Ss, for a set of 
equivalent items. A first recognition trial is 
defined as the trial on which S states for the 
first time the correct response word. Let- 
ting P; stand for the probability that S is 
correct for the first time on Trial j, and set- 
ting s;=0 for j=1 (which merely estab- 
lishes the S/N on Trial 1 as the reference 
level), we can write 


pendent of his response on 


P, F [1] 
for S’s probability of being correct for the 
1 


first time on Trial 


= (Tus yd = a) (0s. +- y»? [2] 


for his probability of being correct for the 
first time on Trial 2; and 


Fi= A= (05; + y) 


i- 
XUE- (a)y) [3] 
for his probability of being correct on 
Trial J for the first time. Finally, the 
probability p, that S fails to recognize at 
all during the k trials of the experiment is 
given by 


Pr= — s) 


ý 
x TI i= ( — a) (0s; ct» DE 


i=2 
Together, Equations 1 thre 
theoretical probability 
recognition points in th 


ugh 4 specify the 
distribution of first 
€ sequential recogni- 
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tion task. It should be noted that since we 
have chosen to fit our mathematical model to 
the distribution of first recognition trials, no 
data beyond the trial on which S first writes 
the correct response word are considered in 
the analysis. Admittedly, information is lost 
in using this response statistic, since 5's may 
reject the correct response on the next trial 
and then return to it later. However, the 
advantage in using this response statistic lies 
in the relative simplicity of the equations for 
the probability distribution and in the ex- 
perimental independence of a from the word 
frequency of the stimulus. The details of the 
derivation of the above equations and of the 
maximum likelihood estimation procedure 
are presented elsewhere (Frederiksen, in 
preparation). Fitting the theoretical prob- 
ability distribution to experimental data pro- 
vides, in addition to estimates of y and 6, an 
estimate of the probability « that S persever- 
ates with his current response on any given 
trial. Maximum likelihood estimates of the 
three model parameters can be routinely cal- 
culated for any such distribution, using a 
computer program that employs an iterative 
solution for maximizing the log likelihood 
function.’ Note again that it is through the 
inclusion of a perseveration parameter a that 
the mathematical theory has taken into ac- 
count the dependency of behavior on Trial 
i+ 1 on that of Trial i. The mathematical 
model, however, makes no assumptions about 
the sensory-perceptual basis of the decision 
to perseverate. It is only in the terms of the 
decision model that the probability @ is inter- 
` preted as the likelihood that the degree of 
mismatch between hypothesis and acoustic 
(perceptual) input fails to exceed a criterion 
level. The mathematical model thus has 
here the status of a measurement model, 
which in its application enables us to study 
probability of perseveration as t relates to 
other characteristics of the experimental 


situation. 


Aw EXPERIMENTAL TEST OF THE 
Decision MODEL 


The present experiment was designed to 
t some implications of the decision model 


of the FORTRAN program, written for 
able from the author. 


tes 


4 Copies t 
the IBM 360, are avail 
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for sequential word recognition outlined 
above. Experimental manipulations were 
sought which would vary (a) the position of 
the hypothesis distribution on the judgmen- 
tal (match-mismatch) axis, and (5) the cri- 
terion employed by S in rejecting his previ- 
ous hypothesis. In order to satisfy Require- 
ment a, stimuli were presented using each 
of three different rates of change in S/N: 
O db. per trial, 1 db. per trial, and 2 db. per 
trial. It was hypothesized that the distribu- 
tions obtained for these three conditions 
would be similarly ordered in their positions 
on the decision axis. 

In order to influence S’s choice of criterion 
as specified in Requirement b, stimulus 
words belonging to two word frequency 
classes were presented, in the belief that 
manipulating stimulus word frequency would 
have the effect of altering hypothesis word 
frequency in a similar direction. Such a cor- 
relation between stimulus and hypothesis 
word frequency is not without experimental 
support (cf. Broadbent, 1967; Brown & Ru- 
benstein, 1961), and data necessary to test 
for this effect would certainly be available 
in the results of the present study. Accord- 
ingly, anticipating that this correlation be- 
tween stimulus word frequency and hy- 
pothesis word frequency would be found to 
occur, it was further hypothesized that the 
criterion used by S in rejecting hypotheses 
formed in response to high-frequency words 
would exceed the criterion used in rejecting 
hypotheses formed in response to low-fre- 
quency words. Finally, note that while this 
second hypothesis resembles the assumption 
underlying Broadbent's explanation for the 
word frequency effect, we are here dealing 
with criteria used in rejecting previously 
held hypotheses, whereas Broadbent asserts 
that differential criteria are employed in the 
selection of new responses. However, since 
the rationale underlying the present hypothe- 
sis involves the notion that high-frequency 
words have a higher subjective probability 
of occurring in the context of the laboratory 
experiment than do low-frequency words 
and since this notion appears to be as ap- 
plicable to response selection as it is to re- 
sponse rejection, a verification of the second 
hypothesis should have some bearing on our 
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TABLE 1 
RESULTS OF THE EXPERIMENT DEALING WITH CRITERION SHIFT IN AUDITORY WORD RECOGNITION 
sias 
0 db. per trial 1 db. per trial 2 db. per trial 
Item 
Word frequency 
High Low High Low High Low 

Parameter 

“Alpha -730 -509 .533 .250 A13 .000 

Gamma .050 .034 A41 .033 .077 .024 

Theta -000 .000 .008 .012 .036 .015 
LogL —15.471 —17.420 —30.448 —34.211 — 20.143 —18.804 
x = nudi 6.167 16.272 6.631 5.607 
df 0 oO S 9 10 5 4 
b — — 8 <.75 <.10 <.25 <.25 
Z 62 03 f .08 —.68 —.22 — 1.64" 


^ Since an inverse normal transformation for 
puting a z score. 


judgment about the plausibility of Broad- 
bent's model for the selection of responses. 


Method 


Subjects. "The S's in this experiment were 
10 undergraduates attending Brandeis University. 
Each S volunteered his services and was paid $2.00 
for his participation in the study. Those Ss who 
reported a history of hearing defects were excluded 
from the sample. 

Procedure. In the present experiment, the Ss 
were presented a set of 72 masked disyllabic stimu- 
lus words to recognize. Each word was presented 
for a series of consecutive trials, and after each 
recognition trial, the S’s task was to write down 
his guess as to the identity of the masked word. 
The 2 X 3 factorial design called for two levels of 
Thorndike-Lorge frequency (high and low) and 
three rates of change in S/N (0 db. per trial, 1 db. 
per trial, and 2 db. per trial). The S/N on Trial 1 
was in all cases — 6 db. The total number of trials 
for the three rates of change in S/N were 10, 15, 
and 8, respectively. The order of presentation of 
the six conditions was randomized so that within 
any block of 12 recognition problems, two instances 
of each condition occurred. The 12 words as- 
signed to each combination of word frequency and 
rate of change in S/N were randomly selected from 
the Thorndike-Lorge tables (Thorndike & Lorge, 
1944) subject to the following constraints: (a) 
Words having Thorndike-Lorge frequencies (gen- 
eral count) above 27 were classified as high fre- 
quency, with all other words classified as low 
frequency ; (b) all words were of two syllables; 
(c) words assigned to the six conditions were bal- 
anced as to location of the stressed syllable; (d) 
each word was chosen to have at least one phoneti- 


b = 0 is indeterminate, alpha was arbitrarily set to .05 for the purpose of com- 


cally similar word of the same frequency class 
available. Stimuli were spoken by a student radio 
announcer and were recorded on magnetic tape 
using an Ampex Model PR-10 tape recorder. Re- 
production was by means of a Magnacorde Model 
1024 tape recorder and a Telex St-11 Stereo Head- 
set. 

Each S was presented with the same series of 72 
word recognition problems along with 12 practice 
problems. The loudness of the stimulus was ad- 
justed by each S to a comfortable listening level 
before the experimental series was begun. Then, 
S pressed a key to begin each recognition problem, 
and through suitable control circuitry, the Magna- 
corde tape deck was programmed to halt at the con- 
clusion of each problem. The Ss thus paced them- 
selves and were allowed to rest whenever they de- 
sired to do so. Trials within a problem, however, 


Were presented at a fixed rate of one trial every 
four seconds. 


Results 


By computing maximum likelihood esti- 
mates of the model parameters for each of 
the six conditions, separate estimates of a, y, 
and 8 were obtained for each of the experi- 
mental conditions. "These results are pre- 
sented in Table 1. The fit of the mathe- 
matical model can be evaluated for the 1 and 
2 db. per trial data using a chi-square test 
of goodness of fit, and can be seen to be satis- 
factory in each case (with p taking on values 
between .75 and 10). Comparing the y's 
obtained for high- and low-frequency words 
for each rate of change in S/N, it can be seen 


E 
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that in every case, the high-frequency words 
are recognized more frequently at the refer- 
ence S/N (— 6 db.) than are the low-fre- 
Thus, the word frequency 
effect is evident in these data. At the same 
time, there is no consistent effect of word fre- 
lope parameter 0, which sug- 


quency on the s 
gests that there is no perceptual advantage 


of one kind of word over another. 

The present model enables us to directly 
test the criterion-shift theory of the word 
frequency effect. It is clear that if hypothe- 
sis word frequency is correlated with stimu- 
lus word frequency, lowering the average 
frequency of the stimulus words should re- 
sult in a decrease in the probability « that the 
fails to exceed criterion, 
hift in the direction of 
increasing the readiness of the S to reject 
hypotheses. Increasing the average word 
frequency, however, will have the opposite 
effect—that of increasing the probability a 
that S will perseverate with his former re- 
sponse. On the other hand, increasing the 
step-size change in S/N occurring on each 
1 level of word frequency ) 


trial (at a fixed 
should result in a decrease in a, since the 


average degree of mi 
when the change in noise 


quency words. 


level of mismatch 
due to a criterion s 


smatch should be greater 
level is consider- 


-4 -3 -2 
Match 
Fie. 2. Theoretical distributions 


i T 
es of change 1n S/N 
in word recognition. 


and low-freau 
nit was chosen 


the three rat 
criterion shift 
words [Cu] 

set at Ci, and the uU 


wency words [C1]. 
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able than when it is slight. The experimen- 
tal results clearly support both of these pre- 
dictions. For each rate of change in S/N, the 
estimate of «œ for high-frequency words is 
greater than it is for low-frequency words. 
Moreover, the effect of increasing the step- 
size change in S/N was to increase the aver- 
age discrepancy between hypothesis and per- 
ceptual input to the decision process, with a 
consequent decrease in the value of the pa- 
rameter a. 

Since the œ parameters obtained for high- 
and low-frequency words (for a fixed rate of 
change in S/N) can be interpreted as cutting 
off two areas of a normal distribution, it is 
possible to solve for the mean and standard 
deviation of this distribution (and in the 
process establish a scale ior the decision 
axis) if we set the origin of the scale at the 
low-frequency criterion and define the unit 
of measurement as the difference between the 
high- and low-frequency criteria. If we as- 
tion that the difference between 
and low-frequency 
e step-size change 


sume in addi 
the criteria for high- 
words is independent of th 
in S/N, a metric has then been specified for 
the decision axis because we then have à 
common unit. The three distributions cor- 
responding to the 0, 1, and 2 db. per trial 


3 4 
Mismatch 


Decision Axis 
on the decision 
employed in the ex 


(The two criteria arc 1 
The origin wa 


Cu-Cu.-) 


axis corresponding to 
periment dealing with 
are for high-frequency 
as arbitrarily 


to be 


conditions can then be located on this com- 
mon scale. If sq denotes the z score corre- 
sponding to the probability o obtained for the 
high-frequency words and if sy, is the z score 
corresponding to e obtained for the low-fre- 


quency words, then the mean of the distribu- 
tion is given by 


— 
oes 
Zu — ZL 
while the standard deviation of the distribu- 
tion is given by 

1 


ZH — ZL 


c= 


The relative spacing and widths of these dis- 
tributions for 0, 1, and 2 db. per trial condi- 
tions are shown in Figure 2. The ordering 
of the three distributions is in the hypothe- 
Sized configuration and is taken as support 
for the interpretation that (a) the expected 
degree of mismatch is positively related to 
the amount of change in noise level, and (b) 
differential criteria for rejection of hypothe- 


Ses are employed for high- and low-fre- 
quency words. 


Discussion 

The results of this ex 
consistent with predictio 
decision theory model, and lend support to 
the notion that differential criteria are asso- 
ciated with words belonging to different fre- 
quency classes. The present findings show 
in particular how such multiple criteria are 
used in making perceptual decisions in the 
context of a sequential word recognition 
task. They show how the amount of dis- 
crepant information necessary for rejecting 


periment are clearly 
ns derived from the 


TABLE 2 


Proportions or Hyporirsts Wonps IN EACH 


or Tu Worp FREQUENCY RANGES 
Similis Hypothesis word frequency 
t word Total 
T AAY qupcni 20-1 
AA (>100) | .336 (66) | .328 (64) | .336 (66) (196) 
100-21 -266 (154) «334 (194) .400 (232) (580) 
20-1 -216 (144) | .310 (206) | [474 (315) | (665) 
Total :253 (364) | .322 (464) 


425 (613) | (1441) 


Note.—Raw frequencies are given in parentheses. 
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high-frequency responses exceeds that neces- 
sary for rejecting low-frequency hypotheses. 
This difference is comparable in magnitude 
to that produced by increasing the step-size 
change in S/N from 0 db. per trial to 1 db. 
per trial (see Figure 2). These results are 
also consistent with Broadbent's criterion- 
placement model for response selection, al- 
though they do not necessarily imply that 
such a model is correct. Unfortunately, the 
acceptance of a single model for response se- 
lection must await further research. How- 
ever, as will be seen from the data on word 
frequencies of errors, it is possible that pho- 
netic correlates of word frequency may play 
a part in leading S to select responses which 
belong to the same frequency class as that of 
the stimulus. This finding, in turn, has im- 
plications for a choice of response-selection 
model. 

In order to understand more clearly why 
Ss set a higher criterion for rejecting hy- 
potheses developed in response to high-fre- 
quency stimuli than to low-frequency stimuli, 
a tally was made (and is shown in Table 2) 
of the number and Proportion of hypotheses 
falling in each of three Thorndike-Lorge fre- 
quency ranges for stimulus words ir 


1 each of 
the same three frequency ranges. Note that 
in tallying the hypothesis Word frequencies, 


correct responses were excluded from con- 
sideration. It is evident that the majority 
of hypotheses developed in response to low- 
frequency stimuli belong in the low-fre- 
quency class, while equal numbers of high-, 
medium-, and low-frequency hypotheses are 
developed in response tg high-frequency 
stimuli, This result is clearly inconsistent 

adbent’s (1967) conclusion that er- 


Tors of high frequency are in a constant ratio 
(regardless of stimulus w 


ord frequency ) 
to errors of low 


: frequency. It should be 
Pointed out again that the association illus- 


trated in Table 2 between stimulus word fre- 
quency and hypothesis word frequency has 
been found previously in at least two studies. 

Town and Rubenstein (1961) reported a 
positive relationship between the frequency 
Class of the stimulus and that of responses 
to the stimulus; Broadbent (1967) observed 
à trend similar to this in his disyllabic data, 
but not in his data for monosyllabic words. 
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The relation observed here between the 
word frequency of stimulus and that of hy- 
pothesis is also at variance with predic- 
tions from sophisticated guessing theory, as 
Broadbent (1967) has pointed out. Evi- 
dently, while the effect of the stimulus pre- 
sentation is to delineate a restricted set of 
alternatives from which S chooses his re- 
sponse, this restricted set of potential re- 
sponses is far from being random with re- 
spect to word frequency. The above data 
indicate that the set contains a dispropor- 
tionate number of words belonging to the 
same frequency class as the stimulus. An 
implication of this finding, as Brown and 
Rubenstein (1961) pointed out, is that 
stimulus words carry information as to their 
frequency class even when presented with an 
accompanying masking sound. 

The correlation obtained between stimulus 
word frequency and hypothesis word íre- 
quency suggests, then, the additional theo- 
retical hypothesis that low-frequency words 
might be characterized by particular phonetic 
features which are common only to infre- 
quently occurring words. As a preliminary 
test of this "phonetic hypothesis," a linguis- 
tic comparison was made of the high- and 
low-frequency stimulus words employed in 
the present experiment. In particular, 
counts were made of the number of occur- 
rences of Zipf's (1935) "high-complexity" 
and “low-complexity” phonemes. These two 
groups of phonemes, Zipf argued, differ in 
articulatory difficulty, and on the basis of 
“the principle of least effort,” should be (and 
are) similarly ordered with respect to over- 
all frequency of occurrence in the spoken 
language. Zipf's figures for the percentage 
of frequency of occurrence of each of eight 
phonemes in spoken English are reproduced 
in the first column of Table 3. The eight 
phonemes include the unvoiced and voiced 
stops, and the two nasals n and m. Also 
given in Table 3 is a new set of within-word- 
class frequency counts, in which the number 
of occurrences of each phoneme have been 
tallied within each of the two classes of 
words, belonging to different frequency 
groups. In every case, there is an mo ease 
in the percent frequency of occurrence of 
high-complexity phonemes (d, b, g, and m) 


TABLE 3 


FREQUENCIES OF OCCURRENCE OF ZIPF’S HIGH- A. 
Low-COMPLEXITY PHONEMES IN STIMULUS 
Wonps or Two FREQUENCY LEVELS 


Frac e Ride qu En 
Pis pcena lus words lus words 
x English 

speech Eum 95 adr) [A 

Total phonemes 181 192 
t 743 11 | 6.1 10 | 5.2 
r 2.04 6| 3.3 4|! 24 
k 2.71 1 6 4| 21 
Total unvoiced 11.88 18 | 9.9 18 | 94 
d 4.31 $1 2.8 10 | 5.2 
b 1.81 1 6 4 2A 
£ 74 3] 1.6 6| 341 
Total voiced 6.86 9] 5:0 20 | 10.4 
n 7.24 8| 44 9! 47 
m 2.78 4|22 7 | 3:6 


^ Data are taken from Zipf (1935, pp. 75 and 79), 


as one goes from high-frequency words to 
low-frequency words. No such consistent 
increases are found for low-complexity pho- 
nemes (t, p, k, and n)—a result which is 
exactly as Zipf would have predicted. 

The usual statistical test of the equality 
of proportions cannot be applied to the data 
in Table 3, since it is unreasonable to assume 
that there is an equal chance (under the null 
hypothesis) for each phoneme within a word 
to belong to a particular phoneme class. In 
order to obtain an indication of the statistical 
reliability of the difference in frequencies of 
occurrence of high- and low-complexity pho- 
nemes for the two classes of words, it was 
decided instead to look at the proportions of 
words of high and low frequency that con- 
tain at least one phoneme from a given set. 
These results are given in Table 4. 

If we compare the proportions of words 
containing unvoiced and voiced stops, we see 
that while equal percentages of high- and 
low-frequency words contain one or more 
unvoiced stops (z= 47), twice as many 
low-frequency words contain one or more 
voiced stops as do the high-frequency words 

2—219, p «.025). If we include the 
nasals n and m, respectively, in the groups 
of low- and high-complexity phonemes, we 
find again that equal percentages of high- 
and low-frequency words contain at least one 
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TABLE 4 
PROPORTIONS OF Low- AND HIGH-FREQUENCY STIMULUS WORDS WHICH CONTAIN 
AT LEAST ONE PHONEME FROM A GIVEN SET 
High-frequency words Low-frequency words T E 
Phoneme set ; ips 
Frequency % (Pu) Frequency % (P1) 
i EI z 
t, p, k (unvoiced) 14 E = : B 
d, b, g (voiced) h Es 5 ue z 
t, p, k, n (low complexity) 20 2: ig z 
d, b, g, m (high complexity) 11 22 7 z 


low-complexity phoneme (s = .24) and that 
twice as many low-frequency words contain 
at least one high-complexity phoneme as do 
high-frequency words (2 = 2.60, p < .005). 

While these results bearing on the “pho- 
netic hypothesis” must be considered pre- 
liminary, they do point to the need for a 
systematic investigation of the phonetic cor- 
relates of word frequency, perhaps using dif- 
ferent techniques for sampling representative 
groups of high- and low-frequency words. 
It is not known, at present, the extent to 
which the above results are dependent on the 
particular constraints used in selecting the 
samples of high- and low-frequency stimuli 
employed in the present study. In any event, 
it is clear that with disyllabic materials, Ss, 
for whatever reason, do tend to form hy- 
potheses which are in the same frequency 
class as the stimulus word, and this tendency 
introduces systematic differences into the Ss 
auditory recognition performance as a result 
of criterion bias in their rejection of previ- 

sly held hypotheses. ; 
oF le, e existence of phonetic features 
which distinguish. words of high frequency 
from words of low frequency raises ee 
tional possibility that such pimen differ- 
ences can themselves account for the word 
frequency effect. Suppose sng those pho- 
nemes correctly perceived by = bs given 
S/N determine a response hierarchy. zu 
posed primarily of words belonging in e 
same frequency class as that of the stimu = 
word. Zipf’s law would then predict that the 
size of such a hierarchy would be smaller 
when the stimulus word was of high fre- 
quency than it would be if the stimulus word 


were of low frequency. This result suggests 
the possibility that on a simple (unsoph 
cated) guessing basis, high-frequency stimu- 
lus words will still be more often correctly 
guessed than will low-frequency words. The 
ability of such a mechanism to account for 
the magnitude of the word-frequency effect 
of course remains questionable, even if it can 
be established in the future that the phonetic 


differences between high- and low-frequency 
words are substantial. 
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ON THE WORD FREQUENCY EFFECT: 
COMMENTS ON THE PAPERS BY J. CATLIN AND L. H.. NAKATANI! 
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D. E. Broadbent considered four models for the word frequency effect. He pre- 
sented evidence rejecting the "sophisticated guessing" model and supporting a 
response bias model based on signal detection theory. J. Catlin and L. H. 
Nakatani have both proposed that Broadbent's arguments are faulty and that 
the "sophisticated guessing" account should be maintained. It is here suggested 
that it is their arguments that are at fault, and Broadbent's that stand. 


'The tendency for common words to be 
detected at lower signal-to-noise ratios 
than rare words, or to be detected more 
often at the same signal-to-noise ratio, is 
known as the word frequency effect. 
Broadbent (1967) discussed four possible 
explanations for this, labeling them the 
“pure guessing," "sophisticated guessing," 
“observing response," and "criterion bias" 
models. Green and Birdsall (1958) ana- 
lyzed word recognition in terms of a signal 
detection model in which each word has its 
own decision axis.  Broadbent's fourth 
model is similar to this, but in developing 
it he employed Luce's (1959) choice theory 
approach to provide an approximation to 
signal detection theory. He presented 
evidence which rejected the first three 
models and favored the last one. Recently, 
however, both Catlin (1969) and Nakatani 
(1970) have proposed that his reasoning is 
fallacious, and they have each concluded 
that the sophisticated guessing theory 
should be reinstated. The present note 
critically examines their arguments. 

The sophisticated guessing model (New- 
bigging, 1961; Savin, 1963; Solomon & 
Postman, 1952) proposes that when a stimu- 
lus word is below its intelligibility threshold, 
some of its component features may never- 
theless be detectable and may serve to 
delimit a small class of words consistent 
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with them. Subjects’ guesses are not then 
drawn at random from their whole vocabu- 
lary, but come from the subset of words 
so delimited, and the more common words 
are more likely to serve as guesses. Thus 
when the stimulus word presented in a 
recognition task is not detected, it is more 
probable that it will be guessed correctly if 
it is common than rare, with the result that 
frequent words will get higher correct 
detection scores. To illustrate this, let 
us assume a somewhat simplified situation 
in which presentation of the stimulus word 
results either in its detection, with proba- 
bility P (d), or in the delimitation of a set of 
words consisting of Ny high frequency 
words and N; low frequency words, one 
word being selected from this set as a guess. 
For simplicity, assume that each high- 
frequency word has the same probability 
of appearing as a guess, and that the same 
is true for the low-frequency words, but 
that the probability of any one high- 
frequency word being guessed is v times 
(x > 1) greater than the probability of 
any one low-frequency word being guessed. 
Fhe probability of guessing an individual 
high-frequency word, when it has not been 
detected, is then P(g) = x/(xNu + Ni); 
lor a low-frequency word, it is P(g) 
7 1/@Nuy + Ni). When a stimulus word 
1S presented, the probability that it will be 
Correctly reported is then P (r) = P(d) 
+ [1 — P (d) ]P (g), and the value of x will 
determine the magnitude of the word fre- 
quency effect. 

The “error ratio” shall be defined as the 
ratio of the number of errors in which a 
high-frequency word is reported to the 
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number of errors consisting of a low- 
frequency word when a given, stimulus 
word has been presented for detection. 
The terms “high-frequency error ratio” and 
“low-frequency error ratio" will be used 
for the error ratios given when, respectively, 
high-frequency or low-frequency stimulus 
words are presented. Broadbent (1967) 
derived from the sophisticated guessing 
model the prediction that the high-fre- 
quency error ratio should be less than the 
low-frequency error ratio. If his arguments 
are applied to the present simplified illus- 
tration, they imply that when a high- 
frequency stimulus word is presented, the 
proportion of the responses which are 
high-frequency errors will be [1 — P (d)] 
X (Nu — 1)x/(xNu + Nx) and the low-fre- 
quency error proportion will be [1 — P (d)] 
X Nyw(xNu-- Ni) giving a high-fre- 
quency error ratio of x(Nu — 1)/Nr. But 
when a low-frequency stimulus word is pre- 
sented, the error ratio will be xNu/ 
(NL — 1), which will be greater. Moreover, 
to account for the magnitude of the word 
frequency effect, the difference between the 
two error ratios must be substantial. To 
illustrate this for the present example, let 
us suppose that P(r|HF), the probability 
of correctly reporting a high-frequency 
stimulus word, is .33, and P(r|LF) — .13, 
where HF = high frequency and LF = low 
frequency (these particular recognition 
probabilities were obtained by Broadbent, 
1967, in an experiment in which subjects 
attempted to identify monosyllables pre- 
sented in noise). Then P(r| HF) — P(r| LF) 
2.33 — 43 = [1 — P(d) Jv/ (Nn + N1) 
— [1 — P(d)]/ Nu + Ny) which gives 
x(5— Nu) 254+ Ny from which it follows 
that Ng < 5. This would imply a con- 
siderable difference between the two error 
ratios. N 

If the sophisticated guessing model is 
correct, it appears, on Broadbent's interpre- 
tation of it, that the low-frequency error 
ratio should be found to exceed the high- 
frequency error ratio, but his experiments 
failed to demonstrate such a difference and 
Broadbent therefore rejected this theory. 
Catlin (1969), however, claims that the 
which led him to predict 


arguments i : 
ratios wrong. His 


unequal error are 
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objection proceeds as follows. He starts 
with the assumption that on the average, 
the acoustical characteristics of high- and 
low-frequency words do not differ, that is, 
partial information will delimit similar 
acoustical neighborhoods in each case: 


Now, given that words of similar frequency . . . 
show no greater tendency to share acoustic features 
than do words of dissimilar frequency, the frequency 
distribution of words (excepting the stimulus word 
itself) in neighborhoods delineated by low-frequency 
words should be the same as the frequency distri- 
bution of words in neighborhoods delineated by 
high-frequency words [Catlin, 1969, p. 505]. 


Using an urn model, there will be .S balls 
for the stimulus word, Ey balls for high- 
frequency potential errors contained in its 
acoustic neighborhood, and Æ, balls for 
the low-frequency words in the neighbor- 
hood. The error ratio, Eu/Er, will then 
be independent of the frequency of the 
stimulus word, and Broadbent's experi- 
mental data, therefore, are completely con- 
sistent with sophisticated guessing theory. 

Although the suggestion that the acousti- 
cal properties of words are unrelated to 
their frequency of usage has not been 
demonstrated to be correct, and quite 
possibly does not hold exactly, it is probably 
an acceptable approximation, and has in 
practice been incorporated into sophisti- 
cated guessing theory. It underlies both 
Broadbent's and Catlin's arguments. Their 
difference lies in the implications they take 
to follow from it. Broadbent's argument 
has been discussed. Catlin claims that 
"the principle of acoustical equivalence 
implies that the frequency distribution of 
words other than the stimulus word in the 
acoustical neighborhood of a stimulus word 
will be independent of stimulus-word 
frequency [p. 505]" However, Catlin 
does not demonstrate that the exclusion of 
the stimulus word from the relevant 
acoustical neighborhood can be derived 
from the assumption that the frequency 
and acoustical characteristics of words are 
independent; he treats this as a self- 
evident consequence. It is important, 
therefore, to examine whether it does in 
fact follow from this assumption. 

In approaching this question, it will be 
useful to consider a hypothetical dialeet 
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of English which contains only the following 
monosyllables ending in "at": high- 
frequency words—cat, hat, pat, fat; low- 
frequency words—tat, vat. For each 
high-frequency word, we suppose P (g) 25.2; 
for each low-frequency word, .1 (ie. we 
take x = 2). It is then natural to suppose 
that if a word is presented and not recog- 
nized, but the partial cues “monosyllable” 
and “terminal -at" are detected, then these 
will delimit the whole of the “-at subset" 
above, whether these partial cues have in 
fact come from the high-frequency stimulus 
word “cat” or from the low-frequency 
stimulus word uyat.” Since these partial 
cues exclude any other word in the language 
from occurring as a response, it follows 
that if “cat” is presented but not correctly 
identified, the error ratio will be .6/.2 — 3, 
but when "vat" is presented, it will be 
8/1 2 8. It is not possible to envisage 
any operation on these partial cues which 
could realize Catlin's expectation that 
high- and low-frequency error ratios should 
be the same unless the assumption that the 
subject has detected only cues common to 
both the possible stimulus words under 
consideration is violated. This assumption 
requires that the word emitted as a response 


will depend on the structure of the subset 


alone, and not in any way on the actual 
stimulus word presented. When the re- 
sponse happens to coincide with the stimu- 
lus word, this will not be counted as an 
error, and thus the different error ratios 
given above necessarily follow. Obviously 
the same sort of result would be true of any 
other "common" subset, that is, any subset 
delimited by cues common to both the 
stimulus words being considered, such as, 
in this instance, ‘‘- -t or ae,” 

We now need to examine whether this 
conclusion will also hold for "specific" sets, 
that is, subsets delimited by partial cues 
peculiar to one or the other stimulus word, 
but not to both. When “cat” is presented 
and not fully identified, the partial cues 
detected might be “monosyllable” and 
“ca” and these might determine the 
specific subset : high-frequency words— cat, 
can š low-frequency words—cad, cam. When 

vat” is presented, detection of “va-” might 


delimit: high-frequency words- vac, van; 
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low-frequency words—vat, valve. If we 
assume that for each high-frequency word 
in these subsets P(g) = .333, and for each 
low-frequency word, .167 (i.e., we again 
take x = 2), it follows that when Seat?’ ps 
presented and "'ca-" detected, the resulting 
high-frequency error ratio is .33/.33 — 1.0; 
when "vat" is presented and ''va-" de- 
tected, the low-frequency error ratio is 
.66/.167 = 4. Thus a difference in the 
error ratios is present and is in the same 
direction as before when the subsets 
specific to each stimulus word are con- 
sidered. It is, of course, obvious that we 
get this result because we have assumed 
that the two specific subsets have essentially 
the same structure. We could avoid this 
result by assuming that the “ca-" subset 
is constituted by high-frequency words 
cat, can, Cap, cab, catch, each with P (g) 
= 167; low-frequency words—cad, cam 
each with P(g) = -083; it would then 
follow that when “cat” is presented, and 
this subset delimited, the resulting high- 
frequency error ratio would be .66/.167 = 4 
the same ratio as for the ''va-" subset. 
Evidently, by increasing the number of 
high-frequency words in the ''ca-" subset 
still further, we could produce an even 
larger high-frequency error ratio, and the 
point could be reached where the advantage 
to the overall high-frequency error ratio de- 
rived from trials on which specific subsets of 
Uus E are activated could just compen- 
epa ee eee 
duit ue Cock mon atl si are delimited, 
bs achieved. B 4 reme pee in 
cien. jd ; But ins Success would have 
E ilustrate p ta we "poi Pei ipio: 
Ds jud uf > camre dt je language, and 
nos JH he comparison of "cat" and 
vat, then, in general, partial cues from 
high-frequency stimulus words must tend 
to delimit subsets which have a higher 
ie ta prs ilb iid words in them 
ae oe | in the partial cues come 
E quency stimulus words. Thus, 
the two populations of partial cues must be 


different: to achieve acoustical neighbor- 
hoods which are the same when the stimulus 
word is excepted in the sense that they give 
Catlin’s prediction, the essence of the 
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“principle of acoustical equivalence" has 
had to go by the board. 

It might seem that if it will keep sophisti- 
cated guessing theory afloat, this ballast, 
though precious, could happily be sacrificed. 
But in fact, not even this will save Catlin's 
prediction comfortably. For it is not 
sufficient to jettison acoustical equivalence. 
The difference in the specific subsets that 
would be required to equalize the high- and 
low-frequency error ratios when the stimu- 
lus words we choose to compare are "cat" 
(high) and "vat" (low) will have the 
opposite effect, worsening the disparity 
between the error ratios, when it is "van" 
(high) and "cam" (low) that come under 
examination. The number of ad hoc 
assumptions that would be required to 
balance out this and all other such contrast- 
ing effects for the language as a whole 
would clearly be so large, and so arbitrary, 
that even if this is possible, it would 
obviously be quite unacceptable. Thus it 
seems fair to conclude that Catlin's defense 
of sophisticated guessing theory must fall 
away. 

Nakatani (1970) attacks 
distinction between the "pure guessing 
and the signal detectability "criterion bias" 
models. He claims to demonstrate that 
the two are “mathematically equivalent"; 
this rests on a proof that the models 
“both predict the same set of confusion 
matrices." If this demonstration is valid, 
its implications are far-reaching; they re- 
quire to be considered carefully. Nakatani 
proves this "equivalence" for 4 X 4 con- 
fusion matrices, but as he notes, his proof 
can be generalized to any confusion matrix. 
In assessing what weight should be given 
to his argument, it will be convenient to 
apply it to the 2 X 2 probability matrix 
for the standard yes-no signal detection 
task. In terms of what Swets (1961) has 
called “high-threshold theory,” this is pre- 
dicted by the following matrix: 

Yes No 
SN pow —pg u—pa-so 


Broadbent's 


Mi= [1] 
NL E (—g) 
es the probabilities of the responses 


Mı giv or tn 
d No when the subject is presented 


Yes an 


with signal plus noise (SN) or with noise 
(N) alone. The true probability of detect- 
ing the signal is given by p; the correspond- 
ing probability for noise is effectively zero, 
and there is a guessing rate parameter, g. 
Using Luce's (1959) "choice theory” param- 
eters to give an approximation to the signal 
detectability model, we get: 


Yes No 
SN HB v 
lato ate 
M: = [2] 
N 1 v 
1-9 14v 


where o and v are parameters relating to the 
separation between SN and N means, and 
the criterion level, respectively (Broadbent, 
1967; Luce, 1959). The pure guessing 
model is essentially a generalization of high- 
threshold theory, and Broadbent uses 
Luce's parameters in developing his cri- 
terion bias model. 

In this case, Nakatani's argument should 
proceed as follows. By applying the 
substitutions 


v 


i] 


(1 — g)/z. 
ip +g — pig — P^. 


or their converses, any matrix in the set 
predicted by high-threshold theory can be 
shown to be contained also in the set 
predicted by the response bias model, and 
vice versa. This renders the models 
"formally equivalent." 

From this application of Nakatani’s 
argument, it is evident that if it is correct 
to accept that Broadbent's distinction 
between the pure guessing model and his 
“apparently new” criterion bias model is 
“misleading” because the models are ‘‘form- 
ally identical” (Nakatani, 1968), then we 
are obliged also to accept that Tanner and 
Swets’ (1954) distinction between threshold 
theory and signal detection theory is 
equally vacuous. This will not be generally: 
acceptable, nor is it difficult to find reasons 
for questioning this argument. First, we 
note that in the yes-no case, each matrix 
represents a possible data point on or above 
the positive diagonal in the unit square. 
What Nakatani's equivalence establishes, 


[3] 


a 
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and all that it establishes, is that any point 
which can be fitted ,by the one model, in 
the sense that that point could be derived 
from allowable parameter values, can also 
be fitted by the other. That is, each model 
can accommodate the same range of 
potential data. However, being applicable 
to the same data is not a sufficient criterion 
for establishing the identity of alternative 
models. Indeed, on this argument, any 
further theory, for example, Luce's (1963) 
version of threshold theory, which is 
capable of placing a receiver operating 
characteristic curve at any specified point 
on or above the positive diagonal would 
also be "formally identical with the two 
models of the yes-no task discussed above. 

It is generally accepted that models 
which generate the same predictions can 
be regarded as equivalent. But the mini- 
mum requirement for this conclusion is 
not that a given prediction, or class of 
predictions, should be identical, but that 
all possible predictions of both models 
should be the same. In the present case, 
the models are clearly capable of producing 
contrasting consequences. Thus, the re- 
ceiver operating characteristic curve, which 
can be derived from M, by eliminating g, 
is linear, whereas M» gives a nonlinear 
curve symmetric about the negative di- 
agonal. Similar differences can be derived 
from the pure guessing and criterion bias 
word frequency effect models. 

Nakatani presents a further argument for 
rejecting the criterion bias model. Applied 
to matrix M it would run as follows. 
If P(No|SN) 2 v/(e +v) = 0, then we 
must have v — 0. But then we should 
get P(No|N) = v/(1 +) = 0. However, 
data may give P(No|SN)=0, but P(No|N) 
>0. Thus, the signal detectability 
model (or the choice theory approximation 
to it) is not reasonable and cannot he 
maintained. Extended to larger matrices, 
the argument is that if one cell in a column 
has a zero value, so should all. Since 
confusion matrix data that contradict this 
prediction are available, the criterion bias 
model “is clearly contrary to intuition and 
data” and must be rejected. 

Unfortunately this rests on an assumed 
infallibility of data. It does not allow for 


MICHEL TREISMAN 


random error. In practice, it is never 
possible to say with complete certainty that 
a zero response rate represents a true zero 
probability, rather than a small nonzero 
value. For this reason, it has never been 
suggested that experimental findings of the 
form P(No|SN) = 0, P(No|N) > 0, re- 
quire the rejection of signal detectability 
theory (or its "choice theory" approxi- 
mation). For the same reason, this sort of 
finding constitutes unimpressive evidence 
against the criterion bias model for word 
recognition. 

Finally, both Catlin (1969) and Nakatani 
(1970) emphasize that the criterion bias 
model imposes no restriction on the words 
which may manifest as errors, whereas 
sophisticated guessing theory suggests, and 
Savin (1963) has found, that errors tend to 
be acoustically similar to the stimulus 
word. Newbigging (1961) has reported 
similar findings for visual stimuli. How- 
ever, these findings pose no great difficulty 
for a signal detection theory of word 
recognition. It is entirely possible to 
suppose that when partial cues compatible 
with a limited number of words are received 
from a stimulus, the perceptual system will 
produce a rise in the mean value of the 
central distribution on the decision axis 
Corresponding to each of those words. 
Erroneous responses would then m 
the joint effects of the 
applying 
rise 


anifest 
lowered criteria 
to high-frequency words and the 
in the means of the corresponding 
central distributions suffered by words 
acoustically similar to the stimulus word, 
and this would explain the clustering of 
errors found by Savin (1963) and others. 
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PSYCHOPHYSICAL 


MEASUREMENT’? 


Two classes of ratio-scaling procedures are outlined—magnitude matching 


and ratio matching—and their assets and liabilities are noted. 
scaling procedures, which are supposedly designed for interv; 


duce results that can be 
"as if" exponent. Since the virtu 
exponent of the continuum, 


operations. 


range on exponents. 
imperative. 


"The task here is to review the matching 
procedures used to determine the power func- 
tions that govern the growth of sensation 
magnitude and to consider some of the 
sources of deviation and perturbation that 
have raised questions concerning the nomo- 
thetic quality of the psychophysical power 
law. 

Since all procedures of measurement in- 
volve matching operations, the interesting 
differences among different scales and differ- 
ent kinds of measurement can often be re- 
duced to a basic question 
to what, and how? In the domain of psy- 
chophysies, numerous scaling methods h 
been invented, many of the 
determination of 
magnitude. 


: What was matched 


ave 
m useful for the 
ratio scales of apparent 
The approaches of r. 
can be catalogued in different wavs, but for 
present purposes they fall into two general 
classes : magnitude matching, which includes 
the subclasses (a) cross-modality matching, 
(b) magnitude estimation, and (c) magni- 
tude production; and ratio matching which 
includes the subclasses (a) cross-modal ra- 
tio matching, (b) ratio estimation, and (c) 
ratio production. m 

Since there are endless variations on psy- 
chophysical procedures, it is possible here to 


atio scaling 
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Partition- 
al scaling, pro- 


described by a power function with a virtual or 
al exponent is smaller than the actual 


the category scale is nonlinear. 
exponent provides a convenient descriptor 


Other topics discussed include 
subjects’ exponents, procedures of averaging, and the 
It is suggested that the power | 


The virtual 
of several kinds of partition 
individual differences among 
effects of stimulus 
aw asserts a nomothetic 


comment on only a few of the features that 
characterize the principal methods. Con- 
sideration is also given to some of the Hr 
terval or partition methods, and to the dis- 
tinction between the virtual exponent and 
the actual exponent. Other problems dis- 
cussed concern individual differences, aver- 
aging, and range effects. 


MAGNITUDE MATCHING 


These procedures include all direct equa- 
tions between two continua, Three principa 


V AES a t x ve hen 
varieties of magnitude matching have bee! 
distinguished. 


Cross-Modality Matching 
When the 


stimulus for a continuum can be 
readily varied by means of a control of some 
kind, it becomes possible to match that con 
tinuum to any other continuum, Figure 
gives examples of matching functions pro 
duced when several different continua wert 
matched to vibration on the fingertip. 
Ideally, the experiment comparing tW? 
Continua should provide for a balanced de- 
sign in which each continuum is matched ™ 
turn to the other continuum, A balance“ 
procedure may help to assess and correct the 
regression effects that are always present an 
the matching operation (Stevens & Green 
baum, 1966). In the typical experiment, the 
Observer tends to shorten the range of which- 
ever variable he controls. Even within p 
same modality the regression effect shows Fi 
in matching functions. Thus, two somewha 
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20 


vibration amplitude (60 Hz) 


Relative 


10? 
Relative intensit 


Fre. 1. Equal sensation functions 


tween various continua and a 60-Hz. vibration on the fingertip. 
bration amplitudes were set by the experimenter. 


other stimulus to produce an apparent 


different functions were obtained, depending 
on which auditory stimulus the subject ad- 
justed in matching a tone to a noise. 

Because it is often difficult to give the sub- 
ject control of the stimulus, many cross- 
modality comparisons have not yet been 
made. It is difficult, for example, for the 
subject to vary the heaviness of lifted weights 
in order to match heaviness to loudness. 

When a balanced design in the matching 
of two continua is impracticable, an evalua- 
tion of the regression effect may sometimes 
prove possible by way of a third continuum. 
Two principal paradigms can be distin- 
guished. 

1. Continuum A is adjusted to match each 
of two continua, B and C. The ratio of the 
exponents of the matching function A to B 
and A to C determines the exponent of the 
function relating B to C. The derived func- 
tion would presumably be free of the regres- 
sion effect, provided the regression oc- 
casioned by adjusting A remained constant 
when the criterion continuum was changed 
from B to C. The hoped-for constancy 
might be upset by such factors as disparities 
in difficulty or range. 

2. A second procedure for counterbalanc- 
ing the regression effect can be utilized 
whenever it is possible to match both A and 


10* 105 


of criterion 


10? 


ies stimulus 


obtained by cross-modality matches be- 
(The vi- 
The observer adjusted the 
match—from Stevens, 19682.) 


B to a common continuum C. The two 
matching functions provide exponents whose 
ratio determines the exponent of the power 
function relating A to B. Whether the re- 
gression effects in the two matching func- 
tions A to C and B to C are exactly equal 
may not be known, of course, but the pro- 
cedure may still cancel a major portion of the 
regression bias. 

An instance of the second paradigm was 
provided by Moskowitz (1969), who asked 
observers to match both numbers and loud- 
ness to a wide variety of taste mixtures. 
From each of the 68 pairs of experiments 
that Moskowitz conducted we can derive an 
estimate of the power-function exponent re- 
lating number to loudness. The geometric 
mean of the 68 estimates was .67, which 
agrees with the value of the exponent 
adopted for loudness calculation (Stevens, in 
press). The standard deviation was .51 
decilog, or about 12%. That degree of scat- 
ter may be regarded as an empirical guide to 
the amount of variability to be expected in 
tests of transitivity among the exponents of 
the power functions. When the regression 
effect has been canceled, about two-thirds of 
the measured exponents may be expected to 
lie within plus or minus half a decilog of the 
predicted exponents (see Stevens, 1969). 


The regression effect, of course, is only 
one of the sources contributing to the sys- 
tematic errors that affect the outcome of ex- 
periments, but it is one of the most obstinate, 
and therefore perhaps the most important. 
And it may be composed of more than a 
single factor. 


Magnitude Estimation 


This procedure is actually a form of cross- 
modality matching in which numbers are 
matched to stimuli. When first used, the 
procedure was called absolute judgment 
(Stevens, 1953), later numerical estimation 
(Stevens, 1954), and still later magnitude 
estimation (Stevens, 1955a). That last 
name appears to have stuck. In this con- 
text, the number continuum can be regarded 
as another perceptual modality, Magnitude 
estimation or "number matching" has be- 
come a popular method, mainly because of its 
convenience. The subject brings the num- 
bers with him, so to speak, and the experi- 
menter needs only to provide the target 
stimuli to which the numbers are to be 
matched. "The nature of the task can be 
portrayed in terms of a typical set of w 


ritten 
instructions to the Subject. 


You will be presenti 
irregular order. Yo 
seem by assigning n 
stimulus any numbe 


ed with a series of stimuli in 
ur task is to tell how 
umbers to them. Call 
T that seems to you appi 
Then assign successive numbers in such a 
they reflect your subjective impression, 
ample, if a stimulus seems 20 
a number 20 times as large 


seems one-fifth as —, assign a number one-fifth as 
large, and so forth. Use fractions, whole numbers, 
or decimals, but make each assignment proportional 
to the — as you perceive it. 


— they 
the first 
ropriate. 
way that 
For ex- 
times as —, assign 
as the first. If it 


Experience has shown that it is usually 
better not to designate a standard. The sub- 
ject then remains free to choose his own 
modulus. If possible, stimuli should be pre- 
sented in a different irregular order to each 
subject, but the first stimulus is usually 
chosen from among those in the middle re- 
gion, rather than from one end of the range. 
Between 10 and 20 stimuli may be presented 
at a session. A good schedule provides for 
one judgment, or at the most two judgments, 
per stimulus by each subject. After the sub- 
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ject has learned to recognize a particular 
stimulus, little or no new information is ob- 
tained from subsequent judgments of its 
repeated presentation. Furthermore, biases 
due to range and spacing of stimuli seem to 
have less effect when the subject is limited 
to one judgment per stimulus. 


Untrained, inexperienced college subjects 
seem to do as well at the matching tasks as 
those who have had many years of practice. 
Hence, there is no need to "train" the sub- 
jects. Indeed, since there is no right or 
wrong to the subjects' responses, it is not 
clear what would be meant by training. 
Under some circumstances, the nature of the 
task may profitably be clarified by allowing 
the subjects to begin by matching numbers 
to an easier continuum, such as apparent 
length of lines, or apparent size of circles. 

Averaging can be done by computing geo- 
metric means or medians, The log-log slope 
(exponent) determined by the geometric 
means is not affected by the fact that each 
observer uses a different unit of modulus. 
When it is desired to adjust the judgments 
to a common modulus, a good method is to 
minimize the squares of the individual sub- 
ject’s intercept differences. The procedure 
is: convert all scores to logs, compute grand 
mean of logs, and adjust each log score for 
each observer by whatever additive constant 
makes the observer's mean correspond to the 
grand mean. That Procedure of modulus 
equalization permits each of an observer's 
estimates to contribute to the correction fac- 
tor to be applied to that observer’s modulus. 


Magnitude Production 


Here the experimenter presents the num- 
bers one at a time in irregular order, and the 
subject adjusts the stimulus to produce an 
apparent match. The numbers themselves 
should normally approximate a geometrical 
Progression, For example, in an extensive 
study of loudness and its inverse, softness: 
the successive numbers presented were in the 
ratio 2 to 1 and ranged from 1.25 to 80 
(Stevens & Guirao, 1962). Sample results 
are shown by the triangles in Figure 2. : 

Because of the regression effect, the powe! 
Unctions obtained by magnitude production 
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are typically steeper (have larger exponents) 
than those obtained by magnitude estima- 
tion. An unusually large regression angle is 
illustrated in Figure 2. It is often assumed 
that the unbiased function lies between the 
two functions obtained by estimation and 
production, and indeed it may. But where? 
Are the error sources in the one procedure 
exactly balanced by the error sources in the 
other? Although exact balance may be pos- 
sible, it seems hardly likely that no asym- 
metry exists. An average function may 
nevertheless be desired, in which case it may 
-be well to compute the geometric mean of 
the two exponents. The geometric mean is 
invariant under an interchange of the two 
coordinates (Indow & Stevens, 1966). 
When the results of magnitude estimation 
and production are combined in an appropri- 
ate way, the combined procedure offers ad- 
vantages over either procedure alone. 

A particular version of the combined pro- 
cedure designed to produce a balanced func- 
tion has been spelled out by Hellman and 
Zwislocki (1968) and applied to loudness 
functions. 

Another way of combining some of the 

features of production and estimation is to 
permit the subject to set stimulus levels at 
his own pleasure and to report the apparent 
magnitude. The experimenter in effect gives 
up all control over the stimuli. A radical 
procedure of that kind was used by J. C. 
Stevens and Guirao (1964) to show that in- 
dividual subjects produce power functions 
and that the power function is not, as some 
writers had suggested, an artifact of averag- 
ing. The key parts of the instructions were 
as follows: 
Your task is to set the tone to different levels of 
loudness and to assign numbers to each of the 
loudnesses. Make your numbers proportional to 
the loudness you hear. You may make as many 
settings as you want. Try to cover a wide range 
of loudness. 

Eleven subjects, chosen at random from 
among students, staff, and secretaries, gave 
the results shown in Figure 3. One subject 
made as few as seven production estimations, 
Another made five times that many. Two 
subjects made settings that extended over 
almost 100 decibels (db.) and made estima- 
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Fic. 2. Magnitude estimation and magnitude 
production of loudness. (Each point is the geo- 
metric mean of two estimates or two productions 
by each of 10 observers.) (Reprinted with per- 
mission from an article by J. C. Stevens and M. 
Guirao published in the Journal of the Acoustical 
Society of America, 1964, Vol. 36. Copyrighted 
by the Acoustical Society of America, 1964.) 


tions that ranged over about 10,000 to 1, 
which implies an exponent near .8. The ex- 
periment was repeated some months later. 
The geometric mean of the 22 exponents for 
the 11 subjects was .7, which is fairly close 
to the value 2/3, which has been proposed as 
the standard value (Stevens, in press). 


Ratio MATCHING 


The earliest form of ratio matching ap- 
pears to have been devised by Merkel (1888) 
in order to determine what he called the 
“doubled stimulus.” It was a direct ratio- 
scaling method, but its potentialities were 
not effectively exploited. The method would 
now be classed as one of the varieties of ratio 
production. It is convenient to distinguish 
three subclasses of ratio matching, as follows : 


Cross-Modal Ratio Matching 


J. C. Stevens set two different bright- 
nesses in front of the subject and asked him 
to adjust one of two noises to make the ratio 
of the noises match the apparent ratio of the 
brightnesses (see S. S. Stevens, 1961a, pp 
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17-18). The results confirmed the relative 
values of the exponents for loudness and 
brightness, showing, in fact, that the two 
exponents are approximately equal. 


Ratio Estimation 


Here the subject matches numerical ratios 
to apparent stimulus ratios. In the “com- 
plete" version of the procedure, stimuli are 
presented in all possible pairs and the ap- 
parent ratios are estimated (Ekman, 1958). 
Other versions use fewer stimulus pairings, 
and some versions provide for reporting in 
terms of fractions or percentages. For ex- 
ample, in an early experiment, Ham and 
Parkinson (1932) presented a sound at one 
level followed by a sound at a lower level, 
and asked the subjects to estimate what per- 
centage of the loudness remained. 


Ratio Production 


In this once-popular scaling procedure, the 
subject is required to find or produce the 
stimulus that seems to stand in a prescribed 
relation to a standard stimulus. As we have 
seen, Merkel invented that kind of task with 
his method of doubled stimulus. Fractiona- 


hen each of the 11 observers s 
(Each point represents a judgment. 


— NN — 


20 dB) 


et the stimulus level and esti- 
There was no averaging—from J. C. Stevens 


tion is the name commonly used for pro- 
cedures that require the subject to set a 
stimulus to one-half (or some other fraction) 
of the standard. (For a tabulation of many 
of the numerous ratio productions that have 
been made with acoustic stimuli, see Stevens, 
1955a.) 

Ratio production has fallen into disuse 
mainly because magnitude matching seems to 
be a superior procedure. The biases in ratio 
production are such that the method often 
fails to produce a clean power function. 


INTERVAL MATCHING AND VIRTUAL 
EXPONENTS 


Although the judgment of intervals or dif- 
ferences, as required in various kinds of par 
titioning operations, may produce satisfactory 
Tesults on metathetic continua, systematic 
biases afflict partitions carried out on pro" 
thetic continua. Furthermore, the partitio. 
ing operations can produce at best an inter i 
Scale, not a ratio scale. Nevertheless, one ui 
another form of interval matching has Loe 
duced data that have played a role in dr 
establishment of the psychophysical pow 
law (Stevens, 1953). sed 

In order to describe the procedures U* 
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and the results obtained in the various kinds 
of partition operations, it is convenient to 
distinguish two exponents: a virtual or func- 
tional exponent and the actual exponent of 
the continuum in question. The virtual ex- 
ponent is the one the observer appears to be 
using when he makes his partition judgments. 
It is an “as if” exponent. The value of the 
virtual exponent a turns out to be lower 
than that of the actual exponent 8. Since 
« < 8, the scales created by partitioning are 
nonlinear relative to the corresponding mag- 
nitude scales created by magnitude or ratio 
matching. 

Perhaps the best known example of a 
partition scale is the Munsell scale for the 
lightness of grays. That scale has been 
determined and redetermined by several 
kinds of partition operations. A series of 
gray papers may also be scaled by magnitude 
estimation, as was shown by Stevens and 
Galanter (1957). The virtual exponent of 
the Munsell scale is approximately .33 and 
is decidedly lower than the actual exponent, 
approximately 1.2, obtained by magnitude 
estimation. 

Let us now consider four varieties of in- 
terval scaling procedures. 


Cross-Modal Interval Matching 


This procedure seemed to work well in 
a 1953 experiment when subjects adjusted 
markers along a line (position, a metathetic 
continuum) in order to match the apparent 
spacing of a series of loudnesses. Subjects 
also matched marker position to the apparent 
spacing of the heaviness in a series of lifted 
weights. The same principle is involved, of 
course, in numerous rating scales: the sub- 
ject expresses his opinion by marking a posi- 
tion on a line. Newhall (1950) used a some- 
what similar method, involving markers on 
a two-dimensional grid, in order to deter- 
mine spacings among the apparent light- 
nesses of gray papers. His results agreed 
with the Munsell scale. As a general 
method, however, interval matching suffers 
from a basic ambiguity, especially when 
metathetic position is not one of the continua 
used. If two prothetic continua are involved, 
the subject may find it easier to match ratios 
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than differences, and without intending it, he 
may actually produce ratio matches between 
the pairs of stimuli on the two continua. 

Efforts to judge intervals often encounter 
a dramatic hysteresis effect, which makes the 
judgment highly contingent on the order in 
which the stimuli are presented (Stevens, 
1957b). 


Interval Estimation 


Here the subject may be asked to assign 
numbers to represent the sizes of apparent 
differences. For example, Dawson (1968) 
presented pairs of loudnesses and asked the 
Observers to make a magnitude estimation of 
the apparent difference in each pair. He 
also presented pairs of visual areas. The 
typical biases that emerge under partitioning 
procedures were apparent in the results, 
especially in the judgments of loudness dif. 
ferences, A constant loudness difference (in 
sones) is not judged to be constant; rather a 
given difference is judged smaller when it is 
moved up the stimulus scale. 

Another demonstration of the bias in in- 
terval judgments—the operation of a virtual 
exponent—is contained in the results of Beck 
and Shaw (1967) who asked 28 subjects to 
judge four loudness intervals, 5, 10, 15, and 
20 sones in width, each located at four stim- 
ulus levels. The median estimations for 
three of the interval sizes are shown in 
Figure 4 as a function of the sound pressure 
level of the tone at the lower end of the in- 
terval. The curved lines show the general 
trend of the data. 

If the world were so constructed that equal 
prothetic intervals appeared equal to the per- 
ceiving subject, the lines in Figure + would 
be straight and horizontal. The downward 
trend of the data in Figure 4 illustrates the 
typical result obtained in partition judgments 
of whatever variety: equal intervals are not 
judged equal at different locations on a pro- 
thetic continuum. As an interval of a con. 
stant size moves up the scale of the contin- 
uum, the constant interval is judged to pe 
smaller and smaller, 

The curves in Figure 4 were generated by 
a partition model in which it was postulated 
that the observer's judgments are governed 
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Fic. 4. Showing how the judgment of an inter- 
yal of a constant size depends on the location of the 
nterval. (Observers made magnitude estimations 
of sets of intervals 5, 10, 20 sones wide. In another 
set of experiments (triangles) the intervals were 
approximately 30 sones wide. The stimulus level 
at the bottom end of the interval is shown by the 
abscissa. The ordinate gives relative values only. 
As a constant interval moves upward in sound pres- 
sure level, the perceived size of the interval de- 
creases. The family of three curves was generated 
by assuming that instead of the actual exponent of 
the sone scale .6, the observers used a virtual or “as 
if” exponent equal to .3. Triangles from Dawson, 
1968; other data from Beck & Shaw, 1967.) 


by a power law that does not have the actual 
exponent of the continuum, but rather has a 
virtual or functional exponent equal to .3. 
The fit of the curves is only fair, for the data 
do not provide enough information to dis- 
tinguish between the family of functions gen- 
erated by the virtual exponent value 3 and 
the family given by some other nearby ex- 
ponent. If the observer’s virtual exponent 
were .6, it would correspond to the actual 
exponent, and the lines in Figure 4 would 
then become straight and horizontal. As the 
virtual exponent becomes smaller, the family 
of curves tilts more steeply downward, and 
the distance between the curves decreases, 
To a first approximation, then, it appears 
that the observer judges loudness intervals 
as if his power function had a virtual ex- 
ponent about half as large as the Actual 
exponent of the continuum. That principle 
was rather nicely confirmed in a second 
experiment by Beck and Shaw (1967) in 
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which 29 observers made magnitude estima- 
tions of another set of loudness intervals— 
intervals that were constructed to be constant 
in size as determined by the lambda scale, a 
scale that was constructed so as to agree with 
a particular set of bisection data (Garner, 
1954). Over the stimulus range of interest 
here, the lambda scale has an effective ex- 
ponent of approximately .3. In other words, 
the "constant" intervals provided by the ex- 
perimenters were generated by a function 
whose exponent coincided with that of the 
virtual exponent. As we should expect, 
therefore, the judged size of the intervals 
did not show a downward drift with increas- 
ing stimulus level. In fact, when the judg- 
ments are plotted as in Figure 4, but with 
lambda interval rather than sone interval as 
the parameter, the data describe functions 
that are very nearly horizontal. Thus the 
principle is clear: When the generating func- 
tion used to set up the equal intervals has 
the same exponent as the virtual operating 
function employed by the observers in their 
partition judgments, then the intervals all 
appear equal. 

It is of interest next to consider the other 
extreme and to ask what happens when the 
equal intervals are generated by a function 
with an exponent that is lower than the vir- 
tual exponent. Since a power function with 
a very low exponent resembles a logarithmic 
function (see Figure 5), we can examine the 
problem by setting up equal logarithmic or 
equal decibel intervals. Again the results 
turn out as expected. Decibel intervals ap- 
pear to grow larger as their absolute level 15 
raised. Thus a series of successive 10-db. 
intervals beginning at 40 db. produced E 
following magnitude estimations: 1.51, 1.89. 
2.65, 4.14, and 9.10 (Dawson, 1968). 1" 
values represent averages over four differen 
experiments, They show that the 10-db. m 
terval 80-90 db. was judged to be about e 
times larger than the 10-db. interval agea 
db. Tf plotted in Figure 4, the judgments s 
10-db. intervals would describe a curve "i 
Sweeps upward rather than downward. “a 
other words, observers’ judgments deed 
strate that the virtual exponent is decider 
greater than zero, and that a logarithn 
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function does not accord with interval judg- 
ments. 

As a matter of fact, although Dawson did 
not use intervals that were exactly constant 
in sones, his data agree fairly well with func- 
tions having the form of those generated by 
a virtual exponent of about .3. The tri- 
angles in Figure + show the magnitude esti- 
mations of the largest sone intervals em- 
ployed, ranging from 16 to 31 sones. The 
magnitude estimations of those intervals were 
multiplied by a factor in order to bring each 
of them to the value it would have had if all 
the intervals had been 30 sones. That multi- 
plicative correction appears to do little or no 
violence to the data. The path depicted by 
the triangles in Figure 4 shows how the ap- 
parent size of a 30-sone interval grows 
smaller as the stimulus level used to define 
the bottom end of the interval increases. 


Interval Production 


The production of prescribed intervals was 
the method invented by Plateau (1872). He 
asked eight artists to paint a gray such that 
the intervals from gray to black and gray to 
white would appear equal. That type of 
procedure is often called bisection. When 
more than two equal intervals are involved, 
it is called equisection (Garner, 1954). 

The intervals to be produced need not be 
equal. For example, in one of my own ex- 
periments, observers produced loudness in- 
tervals corresponding to markers spaced 
unevenly along a line. The results from that 
procedure of multisection were consistent 
with the results obtained by equisection. 

Under conditions designed to make the 
judgment maximally easy, the results of bi- 
section experiments have sometimes been 
found to agree fairly closely with the psy- 
chophysical functions produced by magnitude 
and ratio matching. In other words, al. 
though the inevitable systematic difference 
was clearly evident, the size of the difference 
proved to be fairly small (see Stevens, 19553 
for loudness, 1961b for brightness). To a 
limited extent, then, the bisection experi- 
ments have confirmed the exponents of the 
magnitude scale obtained by ratio-scaling 


procedures. The magnitude-scale exponents 
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logarithmic coordinates, (As the exponent of the 
power function decreases, the graph of the function 
becomes straighter. It thereby approaches the form 
of a logarithmic function, which is described by a 
straight line in these coordinates, ) 


serve as an upper bound and the bisection- 
scale exponents normally lie below the bound. 

In a bisection experiment in which ds is 
set midway between ¢3 and 4; the virtual ex- 
ponent of the power function can be deter- 
mined by the bisection equation (Stevens, 
1955a), which may be written 

ga" — go = do — qi". 

The exponent a may be found by iteration. 
It seems to be an invariant rule that, as with 
other partitioning procedures, the value of 
the virtual exponent determined from bisec- 
tion has a lower value than the exponent 
determined by procedures that call for ratio 
judgments. 

The family of curves in Figure 4 tells us 
that a true bisection on a prothetic con- 
tinuum will not appear correct to the ob- 
server. The lower half of the interval will 
appear larger than the upper half. Conse- 
quently, when the observer himself makes 
the bisection, he lowers the bisecting point. 
In some bisection experiments, the bisection 
point has fallen low enough to agree with the 
exponent used to generate the curves in 
Figure 4 (Garner, 1954). In other experi- 
ments, the virtual exponent has been found 
to lie closer to the actual exponent (Stevens, 
1955a). 
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Sum Production 


The virtual exponent describes the con- 
vexity in the partition function under still 
another procedure. The bisection problem can 
be turned around, so to speak, and instead of 
asking the observer to produce segments, the 
experimenter can present the segments and 
ask the observer to produce the whole, or 
the sum. If the virtual function is convex, 
we should expect that the perceived whole 
would prove to be less than the sum of the 
perceived parts. When observers were 
shown two or more line segments and asked 
to produce a line that appeared equal to the 
sum, the line produced was longer than the 
sum of the separate lengths (Krueger, 1970). 
In other words, a line equal to the sum of 
the separate lengths would have appeared 
shorter than the apparent sum. The experi- 
mental results demonstrate that the exponent 
1.0, which observers ordinarily use in judg- 
ing apparent length, has been replaced by a 
virtual exponent having a lower value. 


RELATION TO PowkR Law 


Perhaps the most important outcome of 
the work on partition scales lies in the dem- 
onstration, by means of interval methods, 
that the psychophysical function is a power 
aw. The argument rests on the fact that the 
apparent bisection point remains approxi- 
mately constant when all the stimuli are 
increased or decreased by the same factor. 
For example, Plateau’s eight artists each 
worked in a different atelier, under a differ- 
ent level of illumination, and yet they all 
produced bisecting grays that, when viewed 
in the same setting, were “presques identi- 
ques." When that happens, the sensory 
magnitude must follow either a logarithmic 
law or a power law. If the bisection point 
falls at the geometric mean between the end 
points used to define the bisected interval, 
then the log function is indicated. If the bi- 
section point falls above the geometric mean, 
as it seems to have done in Plateau's experi- 
ment, then a power function is indicated. 
And the virtual exponent o increases as a 
function of the distance of the bisecting point 
above the geometric mean. Some of those 
basic relations are illustrated in Figure 6. 


Under optimal experimental conditions, 
the virtual exponent o, determined by bisec- 
tion, may approach the value of the actual 
exponent of the continuum as an upper 
bound. Normally, as we have seen, the bi- 
section. exponent. lies below the continuum 
exponent. Nevertheless, the invariance of 
the bisection exponent under multiplicative 
variations in the stimulus levels has provided 
evidence that equal stimulus ratios produce 
equal sensation ratios—which is the invari- 
ance principle that underlies the power law. 


CarEGORY SCALES AND THRESHOLDS 


The category scale calls for special con- 
sideration because it is by far the most com- 
mon and yet perhaps the least satisfactory 
form of partition scale. Most of the many 
users of category scales throughout science, 
education, engineering, and commerce do not 
intend a partitioning operation. Their hope 
is to grade or assess some variable, but they 
proceed to prescribe—and to limit !—the sub- 
jects’ response scale. Commerce does much 
of its buying and selling with the aid of the 
grading of goods by subjective assessment on 
simple category scales, a crude procedure 
perhaps, but it serves its limited purpose. 
The business of scaling and measurement is 
not a primary aim in industry as it is in psy- 
chophysics. Nevertheless, category scaling 
is occasionally used in psychophysical experi- 
ments, despite the demonstrated inefficacy 
of the procedure. Stevens and Galanter 
(1957) examined some 70 different category 
scales on a dozen different perceptual con- 
tinua and found little to justify the use of 
category scaling in quantitative studies. 
Their hope was that category scaling would | 
thereafter fall into disuse. They hoped in 
vain. 

, The categories may be designated by @ 
limited set of adjectives, such as large. 
medium, and small. Or the categories may 
be designated by a finite set of numbers. 
such as 1 to 6. Those were the numbers 
used for history's first recorded category 
scale, the scale of stellar magnitude, which 
dates from about 150 B.C. and which in a 


much revised form still serves the astronomer 
(see Stevens, 1960). 


Issurs IN PSYCHOPHYSICAL MEASUREMENT 


On prothetic continua, the category scale 
is invariably nonlinear relative to the magni- 
tude scale. When plotted against the scale 
obtained by magnitude estimation, for ex- 
ample, the data from category judgments 
produce a curve that is concave downward. 
Whenever the subject is asked to categorize, 
he is forced to divide the continuum into 
parts or segments in order to make it con- 
form to the limited, finite set of numbers or 
adjectives that he is required to use. In 
other words, he is obliged to attend to differ- 
ences or distances. Under those circum- 
stances, the subject is forced out of ratioing 
and into partitioning. 

What we learn from category experiments 
is that the human being, despite his great 
versatility, has a limited capacity to effect 
linear partitions on prothetic continua. He 
may do quite well, to be sure, if the con- 
tinuum happens to be metathetic, but, since 
most scaling problems involve prothetic con- 
tinua, it seems that category and other forms 
of partition scaling ought generally to be 
avoided for the purposes of scaling. If, for 
some reason, an unbiased interval scale is 
needed, it can be obtained from a ratio scale, 
for the ratio scale contains the interval scale 
(Stevens, 1946). 

The reverse is not possible, however. The 
ratio scale cannot be recovered from the in- 
terval scale when only interval information is 
available. 

Is there a use for category scaling? Al- 
though essentially useless for ratio scaling, 
category methods play an indispensable role 
in threshold measurements, where the prob- 
lem reduces to the determination of bound- 
aries between classes. In fact, all threshold 
determinations involve category procedures, 
because the problem is to sort stimuli into 
classes, for example, those that are detectable 
and those that are not, those that are dis- 
turbing and those that are not, or those that 
are acceptable and those that are not, and 
so on. 

Psychophysical thresholds are boundaries 
between classes. Although the boundary 
that we call a sensory threshold may be 
sharp at a given point in time, in a living 
organism the boundary behaves as though it 
were jittering about. Consequently, the 
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Fic. 6. Schematic diagram showing a family of 
power functions with exponents ranging from .1 to 
1.2. (As shown by the circles, the position of the 
bisection point moves upward along the stimulus 
scale (abscissa) as the exponent increases. Bisec- 
tion at the geometric mean would correspond to a 
logarithmic function, or a vanishingly small ex- 
ponent.) 


measurement of a threshold becomes a sta- 
tistical process: On the basis of repeated 
samples, we make a statistical decision re- 
garding the location of the boundary. Still, 
the underlying experimental operation for 
determining any kind of threshold always in- 
volves a procedure of matching either stimu- 
lus to category or category to stimulus. 


Power-Group TRANSFORMATIONS 


As we have seen, partition scales can often 
be usefully described by power functions. 
Consequently, the nonlinearity of the parti- 
tion scale can be conveniently described by 
the difference between the virtual exponent 
of the partition scale and the actual exponent 
of the magnitude ratio scale, In other words, 
the operation of partitioning causes the ob. 
server to behave as though a power-group 
transformation had been performed, that is, 
as though an exponent had been altered, 
Whenever two power functions differ in ex. 
ponent, they are nonlinearly related. Since 
the virtual exponent is always less than the 
actual exponent, the curvature of the parti- 
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loudness, measured in sones, grows very 
large as loudness increases. . y 
In order to illustrate how unlikely, indeed 
impossible, it would be for an observer to 
make a correct judgment of a loudness differ- 
ence regardless of stimulus level, let us con- 
sider the following paradigm. We will as- 
sume that the loudness exponent is .6 and 
that the interval from 40 to 50 db. corre- 
sponds to the subjective interval between a 
loudness of 1 sone and a loudness of 2 sones, 
or a difference of 1 sone. That happens to 
be a clear and obvious difference. Now con- 
sider the same l-sone difference at 100 db. 
where the loudness is 64 sones. An increase 
from 100 to 100.2 db. makes a 1-sone differ- 
ence, raising the loudness from 64 to 65 
sones. But a change as small as .2 db. is so 
small that it would be detected less than half 
the time. It seems beyond possibility, there- 
fore, that a constant difference of 1 sone 
could be judged to be the same size regard- 
less of where it occurred on the continuum. 
That illustrative example may help to sug- 
gest why it is that every variety of partition- 
ing must give a distorted result on a pro- 
thetic continuum. A constant difference 
transforms itself from obvious to undetect- 
able as we go from weak to strong stimuli. 


Therein lies the essence of the prothetic 
principle. 


A graphic illustration of the underlying 
principle is shown in Figure 7. The ordi- 
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Fie. 7. Showing how the difference in decibels 
required to produce a l-sone loudness difference 
falls off with increasing sound pressure level. (At 
40 db. it requires an added 10 db. to add 1 sone. At 
110 db. it requires an added .1 db. to add 1 sone. 
The dashed line shows the approximate value of the 


jnd, the increase that would be detected about half 
the time.) 
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nate shows the stimulus change in decibels 
that corresponds to a change of 1 sone. As 
in the example above, at a level of 40 db. a 
change of 1 sone corresponds to 10 db. At 
100 db., the required change is only .2 db., 
and it becomes still smaller at higher levels. 
The jnd (Weber fraction) is plotted in Fig- 
ure 7 as a horizontal line with an ordinate 
value of about .5 db. (see Miller, 1947). A 
plot similar to Figure 7 can be constructed 
for any prothetic continuum. For visually 
presented lengths, for example, such a plot 
could be made to show how, in visual per- 
ception, a centimeter added to a centimeter 
makes a clearly perceived difference, whereas 
a centimeter added to a meter becomes only 
marginally detectable. Under successive 
visual presentation, length behaves like loud- 
ness, and a constant added difference be- 
comes lost to view as the stimulus increases. 

The smooth continuity of the curve in Fig- 
ure 7 suggests that the underlying process 
that forces a given constant difference to be- 
come less and less apparent as the level in- 
creases is a process that operates all up and 
down the prothetic continuum. There are 
no discontinuities, no sudden transitions. 
When the observer attempts a comparison of 
differences at any place on the continuum, 
it is as though his perception undergoes an 
asymmetrical distortion; a constant differ- 
ence seems larger toward the lower than to- 
ward the higher part of the continuum. 
From that basic asymmetry, it follows that 
the operations of partitioning on prothetic 
continua will fail to produce a linear, un- 
biased interval scale. Where the asymmetry 
does not exist, as on a metathetic continuum, 
it becomes possible for partitioning to pro- 
duce a linear interval scale. 


INDIVIDUAL DIFFERENCES 


Let us turn at this point to a problem that 
was especially well formulated by Jones and 
Marcus (1961). Do different individuals 
have different operating characteristics in 
their sensory systems? If not, then why do 
the slopes (exponents) differ when observ- 
ers match numbers to stimuli, as, for exam- 
ple, in Figure 3? 

If we were to take some of the measured 
values literally, it would be incumbent on u$ 
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to examine the implications. Consider in 
Figure 3 the functions for observers EG 
(exponent .4) and PK (exponent 1.1). 
They both happen to be “normal” observers, 
having similar audiometric thresholds. If, 
as seems reasonable, the two observers ex- 
perience the same loudness when a tone is 
near threshold, what about a tone 100 db. 
above threshold? The two exponents sug- 
gest that the loudness at 100 db. would be 
about 3,000 times greater for PK than EG. 
Yet both observers react very similarly to 
acoustic stimuli. For example, both observ- 
ers report that 100 db. sounds very loud ; 
they both call 120 db. slightly painful; and 
they both jump when it is first turned on. 
Both observers set the intensity control of 
the radio to roughly the same level for com- 
fortable listening. Both carry on conversa- 
tions in about the same level of voice. All 
those bits of evidence make it hard to credit 
a difference in exponent that entails a 3,000- 
fold difference in loudness at 100 db. 

Another consideration is this. The value 
of an exponent reflects the curvature in the 
operating characteristic of the sensory sys- 
tem. As a function of sound pressure, the 
exponent .4 suggests that the loudness func- 
tion for EG is a decelerating function, 
sharply concave downward. The exponent 
1.1 suggests that the function for PK is con- 
cave upward—an accelerating function of 
sound pressure. In order to achieve such 
an unlikely difference in curvature, nature 
would have had to produce two radically dif- 
ferent kinds of operating characteristics in 
the two auditory systems. 

An alternative hypothesis is that all hu- 
man auditory systems operate on much the 
same design, with very nearly the same ex- 
ponents, but that there are wide individual 
differences in what observers take to be the 
numerical value of a loudness ratio. We 
encounter that kind of individual difference 
especially clearly when an observer is asked 
to adjust one sound to make it appear half 
as loud, or twice as loud, as another sound. 
In an experiment with 22 observers, the 
scatter of eight distributions of settings coy- 
ered a median range of 14 db. (For plots 
of the distributions, see Stevens, 19572.) 
That range of settings would correspond to 
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a range of exponents from about .3 to 1.5. 
The distributions of the ratio settings tendec 
to be roughly log normal, however, and there 
appeared to be no reason not to average the 
data, which is equivalent to averaging in- 
dividual exponents. 

Consider this question: What would hap 
pen if an experimenter were to report thai 
each of 22 observers in a group made at 
identical setting when asked to produce : 
two-to-one loudness ratio? It is an interest 
ing question whether the scientific commu 
nity would regard such a result as a nev 
marvel, or whether scientists would simpl; 
reject it as an implausible outcome. We ex 
pect variability and we usually find it. Th 
substantive question concerns whether anc 
how to average the data. 

Although the operating characteristics 0: 
normal sensory systems may have the sami 
or closely similar exponents, there are cir- 
cumstances in which an abnormal exponen! 
can be demonstrated and in which the ab. 
normality calls for careful measurement 
The otologist, contemplating middle-ear sur- 
gery, needs to know whether so-called re- 
cruitment, with its attendant large exponent, 
characterizes the lower part of the patient’: 
loudness function. If the exponent is ab- 
normally large, middle-ear surgery is con- 
traindicated. But if the exponent is normal, 
the otologist may feel free to try to correct 
a middle-ear difficulty. (For a model de- 
picting the manner in which Méniére's dis- 
ease may produce power-group transforma- 
tions on the loudness function, see Stevens & 
Guirao, 1967.) . 

Except for clinical cases, it seems fair to 
say that seldom, if ever, have investigators 
undertaken the serious effort that is needed 
to establish the existence of individual dif- 
ferences in sensory power functions. Mostly, 
we are shown the variability that happens to 
be found in a particular matching response, 
usually magnitude estimation. The obvious 
next step, usually not undertaken, would be 
to try magnitude production. I have found 
that many individual differences vanish a; 
soon as the matching task is inverted, For 
example, by magnitude estimation, the low. 
est loudness exponent in a group was .4, bui 
by magnitude production, that particular ob. 
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server produced the exponent 9; the geo- 
metric mean of the two exponents 1S 6 (see 
graphs in Stevens & Greenbaum, 1966). 
Five observers in those same experiments 
(Stevens & Guirao, 1962) made both magni- 
tude estimations and magnitude productions 
of the loudness of noise. The ratio of the 
largest to the smallest exponent for estima- 
tion was 142; for production, it was 1.27. 
When the geometric means of the expon- 
ents for estimation and production were ex- 
amined, the range ratio fell to 1.14. In other 
words, the individual differences were less 
pronounced when both estimation and pro- 
duction were used in a balanced design. 
The balancing of estimation by production 
may be good for a start, but if we are seri- 
ously interested in the power function for a 
particular individual, we will not stop with 
magnitude estimation and production. We 
will want to know the results from a bal- 
anced array of additional cross-modality 
matching tasks, the more the better. 


VARIABILITY AND AVERAGING 


Criticism of the power law has sometimes 
centered on one or another aspect of varia- 
bility. It would be good, of course, if varia- 
bility could be reduced, so that the psycho- 
physical functions could be determined with 
higher precision. But empirical functions 
always suffer from variability, and the cen- 
tral question is not so much whether a mea- 
surement is variable, or whether subjects 
disagree, but whether averaging is appro- 
priate. If the data can be appropriately 
averaged, it does not matter how widely the 
variability may range, provided the number 
of independent measurements can be in- 
creased. In principle, the standard error can 
then be brought down to any desired level. 

In electrophysiology, for example, mira- 
cles of averaging are performed routinely by 
computers programmed to dig a particular 
waveform out of the myriad variations in the 
ongoing neural activity of the brain. A re- 
peated click delivered to the ear can then be 
seen as an evoked potential at the scalp. 
The response to the click emerges remark- 
ably clear and unencumbered by the noise of 
the brain, for the noise has been suppressed 
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by being averaged out. An analogous strat- 
egy of error cancellation by averaging finds 
usefulness in psychophysics and in all the 
rest of science. 

In the long run, since scientists tend to be- 
lieve only those results that they can repro- 
duce, there appears to be no better option 
than to await the outcome of replications. It 
is probably fair to say that statistical tests of 
significance, as they are so often miscalled, 
have never convinced a scientist of anything. 
By contrast, a tabulation of 178 determina- 
tions of the loudness exponent, based on 25 
years of accumulated results from several 
different laboratories, produced a median re- 
sult .6, which became the exponent recom- 
mended by the International Standards Or- 
ganization (see Stevens, 1955a). Since 
then, a further accumulation of experimental 
determinations has begun to fix the second 
decimal place, and it now appears that the 
value 2/3 may be more representative (Stev- 
ens, in press). The value 2/3 happened to 
correspond to the modal value of the 1955 
distribution, but at that time, the median 
seemed a better choice than the mode. 

How to average data presents serious and 
interesting questions. The median is per- 
haps the single most unbiased measure of 
location, and it has often been used in psy- 
chophysics. A popular rule is: when in 
doubt, use the median. On the other hand, 
a more efficient average is often wanted, and 
the choice of an efficient measure can usually 
be made to rest on the form of the distribu- 
tion. Thus, two different averages have 
proved appropriate in psychophysical scal- 
ing, each under a particular circumstance. 

In an equisection experiment, 45 subjects 
divided a 40-db. segment of the loudness con 
tinuum into four equal-appearing intervals. 
Because the decibel measures of the subjects 
settings gave skewed distributions, it did not 
seem proper to average the decibel values, 
which would have been equivalent to com 
puting geometric means. When the decibe 
measures were converted into sones (à jineat 
loudness value), the settings showed the de- 
sired symmetries. It was concluded that 
averaging should be done by computing the 
arithmetic means of the loudness values, aoe 
an iteration procedure for determining thos 
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values was outlined (Stevens, 1955b). The 
arithmetic mean is also an appropriate aver- 
age for other kinds of partition scales, such 
as the category scale. 

In experiments involving magnitude esti- 
mation and other forms of cross-modality 
matching, it is the geometric mean, not the 
arithmetic mean, that appears to be the ap- 
propriate average. Against a linear scale, 
the response distributions are skewed, as in- 
deed they must be when error is proportional 
to magnitude. When error grows in pro- 
portion to magnitude, so that the relative 
error stays constant, a logarithmic trans- 
formation tends to undo the skewness. The 
applicable principle states that when error is 
relative, the error distribution is log normal. 

The log-normal model for magnitude esti- 
mation was tested by J. C. Stevens who 
plotted the results obtained when 70 naive 
observers estimated the loudness of a white 
noise presented by a loudspeaker in a class- 
room (J. C. Stevens & Tulving, 1957). In 
the first of two experiments, the observer 
chose his own modulus by assigning to the 
first stimulus (85 db.) whatever number 
seemed appropriate. In the second experi- 
ment, the stimuli were presented in pairs, a 
standard at 85 db. called 10 followed by a 
variable. When the cumulative frequencies 
for this second experiment were plotted on 
probability paper against the logarithm of 
the magnitude estimations, the result was a 
family of straight lines. In other words, the 
distributions were log normal. 

Some five years later J. C. Stevens and 
Miguelina Guirao applied the modulus equal- 
ization procedure to the data of the first ex- 
periment, the one in which each of the 70 
subjects had chosen his own modulus. (The 
deviation of each subject’s scores from the 
average function was minimized by the pro- 
cedure for modulus equalization described 
above.) The cumulative frequencies of the 
judgments subjected to modulus equalization 
are shown in Figure 8. Again the straight 
lines demonstrate that the distributions are 
approximately log normal. 

Other features of the results in Figure 8 
are also of interest, especially in view of the 
fact that those were the first magnitude esti- 
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Fic. 8. Cumulative frequency distributions of 
magnitude estimations. (Each of 70 subjects, mak- 
ing their first judgments ever, assigned whatever 
number seemed appropriate to eight levels of white 
noise presented in the order shown. The data of 
three subjects who used negative numbers or zeros 
were not tabulated. In these coordinates, a straight 
line signifies a log-normal distribution. Data from 
J. C. Stevens & Tulving, 1957.) 


naive listeners. Since it was a classroom 
experiment, the order of the stimuli could 
not be made different for the different listen- 
ers. Consequently, the order of the stimuli 
is reflected in the slopes of the cumulative 
frequency lines, or, in other words, in the 
standard deviations. The first stimulus has 
the smallest standard deviation (1.0 deci- 
log), the second stimulus the next smallest 
(1.4 decilog), and so forth. If the stimulus 
order had been made different for each ob- 
server, the lines in Figure 8 would be more 
nearly parallel. But a tendency has been ob- 
served in numerous experiments for the vari- 
ability to increase at the low end of the scale, 
and to a lesser extent, at the high end. That 
same tendency is apparent in Figure 8. 
Hence, it can be seen that the effect due to 
stimulus order cuts across the normal tend- 
ency for the variability to be slightly lower in 
the middle range. 

The lines in Figure 8 make it clear that 
the geometric mean is an appropriate aver- 
age for the data. In this instance, the geo- 
metric means determine a power function 
with the exponent .55. 

A similar treatment. was applied to the 
data from the original "no standard" experi- 
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Fic. 9. Showing how Munsell grays, extending 
from black to white, are judged when they are pre- 


sented entirely alone with no other visible lumi- 


nance in the field of view. (The luminance then 
covers a range of about 16 db., which seems sub- 
jectively rather short. On the three successive pre- 
sentations of the set of stimuli, the exponent be- 
came larger ( unfilled symbols). The filled symbols 
show the results for an extended luminance range. 
The exponent is 35, which is much lower than the 
value 1.2 obtained when the Munsell grays are 
placed one by one on a table in front of the ob- 
server.) 


COUNTERBALANCING FOR RANGE 


The general problem of the interaction of 
range and exponent may be thought of in 
terms of the matching of two continua, A 
and B, each of which may serve in turn as 
the adjusted and as the criterion continuum, 
What does the observer tend to do to the 
adjusted stimulus as à function of the range 
(logarithmic difference) between two cri- 
terion stimuli set by the experimenter ? For 
purposes of discussion let us assume that the 
true exponent determined by the slope A/B 
in log-log coordinates is 1.0. 

]f the experimenter were to set the cri- 
terion range to a very small value, say, to a 
small fraction of a decilog, on the average the 


observers would necessarily respond with 
adjusted ranges that average larger than the 
criterion range. That value of the adjusted 
range would then determine an exponent 
greater than 1.0. On the other hand, if the 
criterion range was set at a very large value, 
the observers would tend to match it with 
shorter ranges, determining thereby an ex- 
ponent smaller than 1.0. Between those two 
extremes, the exponent would decrease 
monotonically as the range increased. 

Since the two continua A and B can be 
interchanged in their roles of criterion and 
adjusted variable, in a balanced experiment 
two paths would be traced by the exponent 
as the criterion range was varied from very 
small to very large. Those two paths would 
cross, and the crossing point would presuma- 
bly determine the unbiased exponent—un- 
biased by the effect of range, but not neces- 
sarily free of other possible biases. 

A concrete example of the two paths fol- 
lowed by the exponent when range is varied 
in a fairly well-balanced experiment can be 
constructed from the data of Stevens and 
Poulton (1956). The loudness function for 
a 1000-Hz. tone was studied by allowing 
groups of 8 to 11 unpracticed observers to 
make only a single judgment, either a ratio 
estimation or a ratio production. A stand- 
ard stimulus was sounded first and called 
100, and the observer expressed the apparent 


ratio by assigning a number to a second 
stimulus at a lower level. 


The exponents 
correspondin 


g to the median estimations of 
each group are shown by the triangles in 
Figure 10. Each triangle represents a dif- 
ferent group of listeners. 

For the ratio productions, the observers 
adjusted a sone potentiometer to produce a 
prescribed fractional loudness relative to a 
standard. The exponent corresponding to 
the ratio productions (sone average) for 
each group of subjects is shown by a circle 
in Figure 10. Again each circle represents 
a different group of listeners. The data 
show an approximate symmetry and suggest 
that for those naive observers making their 
first loudness judgments, the exponent lies 
between 6 and 7. The range effect emerges 
as a dramatic but orderly variable, and since 
it shows an approximate symmetry, it can in 
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Fic. 10. The range effect in a partially balanced 
experiment. (Each point represents a separate 
group of subjects who estimated a loudness ratio 
(triangles) or produced a loudness ratio (circles). 
When the range effect produces symmetrical func- 
tions, as is approximately true here, the exponent 
may be uniquely determined. It is the value that 
makes the exponent corresponding to the crossover 
point equal to the exponent implied by the relation 
of the scales at the top and bottom of the figure. 
The exponent so determined is .65. Data from 
Stevens & Poulton, 1956.) 


principle lead to a unique exponent de- 
termined by the crossing point in Figure 10. 

The actual crossing point depends, of 
course, on the relation between the two scales 
(top and bottom in Figure 10) against which 
the two sets of data are plotted. But the re- 
lation between the values on the two scales 
(both logarithmic) also expresses an ex- 
ponent. The problem, then, is to adjust the 
relation between the two scales (ratios to be 
produced and ratios to be estimated) so that 
the exponent determined by the scale relation 
coincides with the exponent determined by 
the crossing point. The adjustment can be 
carried out by iteration. 

In plotting Figure 10, I first assumed that 
the exponent was .60. Accordingly, the 
ratio 1/2 at the top of the graph was set 
directly above 10 db. at the bottom of the 
graph. The other ratios were then set ac- 
cording to a logarithmic spacing. When the 
data were plotted, the crossover was found 
to correspond to .64 on the ordinate—a value 
that was larger than my assumed exponent. 
T next assumed a larger exponent, .67, and 


changed the upper scale and the location of 
the circles accordingly. The resulting cross- 
over then fell at .65—a value smaller than 
my assumed exponent. Thus the two as- 
sumptions, one too high and one too low, 
had succeeded in bracketing the exponent 
between .64 and .65, but closer to .65. With 
fallible empirical data, the exact value of the 
exponent cannot be taken too literally, but it 
is nevertheless interesting that the crossover 
value accords approximately with the con- 
sensus of other measurements. 

The search for other instances in which a 
pure range effect could be studied has been 
only partially successful. Experiments with 
the required balanced design seem to be rare, 
but we can compare two separate experi- 
ments that add up to a partly balanced de- 
sign. In Figure 11, the triangles show the 
exponents corresponding to the median ratio 
estimations made by groups of about 30 sub- 
jects (Poulton, 1969). Each group judged 
one noise level set at 5, 20, or 35 db. above a 
standard noise (600 to 1200 Hz.) at 65 db. 
The circles in Figure 11 represent the ex- 
ponents determined from the ratios produced 
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Fic. 11. Range effect shown in two noncompara- 
ble experiments. (Separate groups of subjects esti- 
mated loudness ratios (triangles) of a band of 
noise, 600-1200 Hz. (Poulton, 1969), A single 
group of subjects produced fractional and multiple 
ratios (circles) of a 40-tone complex (Gean 
Firestone, 1933).) 
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by 31 listeners who made multiple and frac- 
tional settings of noise consisting of a 40- 
tone complex (Geiger & Firestone, 1933). 
Both the multiple and the fractional produc- 
tions were averaged to determine the ex- 
ponents. The same group of listeners heard 
all the stimuli, and the order of the ratio 
productions was 1/2, 1/4, 1/10, uuo Be 4, 
and 10. 
In order to plot the circles in Figure 11, 
the exponent was assumed to be 67, and 
the ratio 2 on the upper scale was set di- 
rectly above 9 db. on the lower scale. The 
crossover point then fell at .77. 
ponent is assumed to be 8 
values and the circles are m 
we find that the cross 
about .78. Thus the tw 
have bracketed the crossover exponents, 
Consequently, the exponents that would be 
determined by the data in Figure 11, if they 
were homogeneous and suitable for 
purpose, would lie between 
That value is on the high side, 
the measured exponents for no 
larger than those for tones. 
ponent for white noise in an earlier compila- 
tion was approximately .72 (Stevens, 1955), 
The groups of subjects who gave the re- 
sults that determined the triangles in Figure 
11 subsequently judged all the other stimuli, 
The pooled results determine a power func- 
tion with an exponent of about 73, provided 
no attempt is made to force the best fitting 
line to pass through the standard stimulus, 
Although experimenters have sometimes as- 
sumed that the function must pass through 
the standard, there is in fact no such re- 
quirement. The observer does not respond 


to any stimulus, whether standard or vari- 
able, with zero error. 


If the ex- 
, and if the scale 
oved accordingly, 
over point becomes 
0 assumed exponents 


such a 
77 and 7$. 
even though 
ise are often 
The mean ex. 


The foregoing examples of the sources of 
error and distortion that may plague a psy- 
chophysical measurement are more illustra- 
tive than exhaustive. As has often been 
said, there are many ways to perform a bad 
experiment. Not even the expert in statis- 
tical design can tell exactly how to perform 
a good one. In principle, however, we can 
study each experimental ailment and utilize 
the rules of its behavior in order better to 
diagnose the nomothetic substrate. 
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Tue Nomorneric IMPERATIVE 


The scientist’s contest with nature has 
prospered to the degree that simplicities and 
uniformities have been detected amid the 
complexities that afflict observation and ex- 


periment. The simple invariances have often 
proved hard to find, however, because no ex- 
periment can be performed without its “con- 
text,” and no measurement can be made 
without error. In Psychophysics, each ex- 
perimenter records results that disagree to 
Some extent with those of his colleagues, 
and a penumbra of uncertainty surrounds 
even our best determinations, Consequently, 
there is room for many hypotheses and for 
many views regarding the structure of the 
Psychophysical domain, 

First there 


3 are those whose working hy- 
pothesis st 


: ates that there exist laws to be 
discovered, Heeding the nomothetic im- 
perative, those investigators refuse to be put 
off by the apparent chaos in the organism’s 
Teactions to stimuli, and they try to order, 
classify, and Systematize the hehavioral facts- 
Some people seem willing to gamble that the 
Sense organs operate in beautifully simp Is 
Ways and that What we take to be complexity 
lies more ir r inept descriptions than "m 

COmportment. As regards 
€stion of reactions to sume 
Nomothetic outlook assumes 
Xists an orderly input-output 
^ a Simple law of some sort—how- 
ever difficult it might be to pin it down in its 
exact form, Perhaps it was that same nomo- 
thetic Outlook that drove Kepler to search 
or simple invariant laws in the baffling paths 


of the wandering Planets, which were thought 
In early times to e 


faim d 
l h 9 crisscross the skies impelle 
by their own Volition 


lus intensity, the 
that there. exi 


The nomothetic Viewpoint is consonant 
With an objective. Operational approach, but 
It does not 


$ necessarily entail a particular 
philosophy of reality, Ty the fitting of sche- 
Matics to empirical fact—what T have called 
the schemapiric endeavor—there is no neces 
sity that we take a stand Tegarding any ulti- 
mate concern. We can simply note, for €X% 
ample, that when the intensity of the visu? 
stimulus increases, the Observer's response 
changes in an orderly fashion, If he is in^ 
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structed to squeeze a hand dynamometer to 
match the apparent intensity of what he sees, 
then the force of his squeeze increases as a 
power function of the physical luminance (J. 
C. Stevens, Mack, & Stevens, 1960). That 
is the first-order fact. But what shall we say 
about it? We have obviously measured 
something, but have we measured sensation ? 
As soon as that word appears, a discordant 
chatter sets in, because many authors con- 
tend that sensation is not a thing that can 
be measured. 

If I thought it would help, I should hap- 
pily give up the word sensation in favor of 
some other term, such as behavioral response 
or apparent effect, but the philosopher would 
shortly discover that my new term is only a 
euphemism for what I regard as a straight- 
forward construct, a construct that can per- 
haps best be communicated to other people 
by my labeling it sensation. 

A prime source of confusion in this seman- 
tic issue rests with the fact that each of us 
experiences sensations, and to each of us our 
sensations seem personal and private and in- 
accessible to measurement. As scientists, we 
should try to ignore that fact. To help free 
us of narcissistic introspection, there is the 
example of Plateau, the blind physicist, who 
gave us our first interval scale for the light- 
ness of grays. He himself did not need to 
see the grays that the eight artists painted, 
for he could record the reports of other ob- 
servers. Plateau's achievement drives home 
a crucial point: the blind could develop the 
psychophysics of vision; the deaf could de- 
velop psychoacoustics. 

Sensation then is a construct, a name 
given to a constellation of behaviors. The 
justification for saying that sensation is sub- 
jective is that a human subject exhibits the 
behaviors. The heating coil in an electric 
stove also exhibits interesting behaviors, but 
quite arbitrarily we refrain from using the 
term subjective for those behaviors or for 
the constructs built on them. The simple, 
operational dichotomy into subjective (mean- 
ing people) and objective (meaning not peo- 
ple) seems eminently convenient, but words 


and slogans sometimes cause clash and con- 
flict, In the schemapiric view of science, 


words and symbols serve only the neutral 
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purpose of implementing a schematic struc- 
ture which may be related by operational 
rules to an empirical structure (see Stevens, 
1968b). 

As an experimenter, I feel free to use 
terms like sensation and subjective, because 
they can be defined operationally in the con- 
text of psychophysical research. For some 
writers, however, the use of such words puts 
the user in the camp of Subjectivism, as op- 
posed to Behaviorism (e.g., see Baird, 1970). 
I have long thought of myself as a behavior- 
ist, but it has not seemed defensible to assert 
that verbal taboos provide a tool for ad- 
vancing science, as some behaviorists have 
seemed to believe. A viable injunction is 
this: Use any words you care to, but let it 
be plain what operations lie behind your ver- 
bal forms. Otherwise said, construct the 
schema as you will, but tell us precisely how 
the empirics articulate with the schematic 
terms in order to produce the schemapiric 
substance. 

Such protestations, I realize, will avail 
nothing with those who want to maintain a 
philosophical distinction between certain 
traditional views and to preserve a dualism 
that to an operationist is devoid of meaning. 
Since I do not believe in the usefulness of 
the distinction, I am happy to read that 
It is impossible to locate Stevens' view precisely, 
since he constantly shiíts back and forth between a 
behaviorist and some other way of treating the 
question . . . [Savage, 1970, p. 390]. 

More damning than that, however, the whole 
psychophysical enterprise is said by Savage 
to be wrongly conceived, so that 

However we reconstrue Stevens' law, it cannot be 


construed as one relating sensations to stimuli, 
since the former are incapable of measurement [p. 


541]. 
More than half a thousand pages, it should 
be said, are devoted by Savage to the thesis 
that we cannot measure what psychophysi- 
cists seem so delightedly to be measuring. 
Philosophical problems must not detain us, 
however, for philosophical issues are eternal 
and do not go away. More tractable, per- 
haps, are some of the specific issues that 
strike directly at the nomothetic imperative, 
A cluster of questions has concerned the 


problem of picking the best sensory scale. 
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When three different scales have been created 
on the same continuum by three separate 
sets of operations—jnd's, category estima- 
tion, and magnitude matching—on what 
grounds do we presume to ascribe to one of 
those scales a superior position? — Helson 
(1964) seemed to deny that we can make 
any such judgment, for he said that 


no one scale, however carefully established, can be 
considered better than other scales obtained under 
different conditions of judging [p. 179]. 


Other authors have also lamented the ab- 
sence of criteria for determining the validity 
of a choice of one kind of scale over another. 
Validity is indeed the issue here. Which 
scale best measures what it is that we want 
to measure? Since it is that kind of ques- 
tion, the answer becomes a matter of opinion 
—a value judgment. It appears that all 
problems related to validity must seek their 
ultimate solution in the pragmatic domain. 
If a scale does the job we want done, we 
usually accept it. 

But opinions differ about problems and 
solutions. How else can we understand the 
decision of an experimenter to limit the ob- 
server's responses to a finite set of numbers, 
such as 1 to 7 or 1 to 20? That curious 
maneuver of constraining the observer's re- 
sponses is a tactic that seems somewhat 
mulish to those who have allowed the ob- 
server a full range of responses and have 
witnessed the greater usefulness of the re- 
sulting ratio scales. It may be true that in 
the long run, superior procedures tend to 
replace inferior procedures, but in almost 
two decades of practice, magnitude estima- 
tion has not displaced category estimation— 
nor does it seem likely to do so any time 
soon. 

Even among those who have given up 
category scaling in favor of ratio or magni- 
tude matching, there remains a matter of 
taste and opinion that divides the practition- 
ers. One view holds that the construct we 
call sensory magnitude follows simple laws, 
and in particular that under proper circum- 
stances, the sensation magnitude experienced 
by the typical (median) observer grows as a 
power function of the stimulus magnitude. 
The power law is thought to set constraints 
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on our expectations regarding the outcome 
of experiments, so that a discordant result 
becomes suspect until verified by adequate 
replication. In other words, the sensory 
power law takes precedence over the results 
of any particular experiment, just as the 
power law governing gravitational attraction 
usually remains unquestioned despite a. par- 
ticular experimenter’s inability to confirm it 
by dropping objects in a laboratory. Both 
kinds of power laws make possible many 
kinds of predictions; yet they both can be 
shown to fail to some degree in particular 
contextual circumstances. 

The other view holds that the nomothetic 
imperative has no compelling jurisdiction in 
psychophysics, because departures from the 
power law are too numerous to be ignored. 
It is claimed that the ways in which the 
human observer responds to stimulus in- 
tensity depend on prior learning, adaptation, 
range of stimuli, nature of the matching task, 
and so on ; and that until those many factors 
and contextual influences can be discovered, 
explored, and understood, it is premature to 
speak of a psychophysical law. 

The two views sketched above may prove 
more extreme than the attitude of any par- 
ticular scientist, but they illustrate two poles 
of opinion. Perhaps the least nomothetic 
view yet expressed was that of Poulton 
(1968) who concluded his review of "the 
new psychophysics” by saying, “The mechan- 
ism of response learning and response bias 
must be included in any adequate descrip- 
tion. To this reviewer," he added, "they 
present the more interesting and challenging 
problems." In my own view, the problems 
of response bias rate no better than a nui- 
sance, an interesting nuisance, perhaps, as 
some of the foregoing sections have tried to 
show, but nevertheless a diversion from the 
basic business of sorting out the fundamental 
principles. Fortunately for science, how- 
ever, its practitioners are motivated by 9 
diversity of values and interests. It would 
stifle the enterprise if we all tried to crow? 
in on one single problem. 
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This paper hypothesizes that in early 
herited high potential for intellectual s 


for imprinting with human behaviors much before birth. 
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infantile autism, fetuses with an in- 


uperiority are developmentally ready 
Instead of being 


imprinted postnatally at critical periods in cerebral development as are 
most children, these neurologically advanced fetuses are imprinted on the 


restricted uterine environment. 
drome of early infantile autism. 


Sugg 

Since Kanner (1943) wrote his paper in 
which he described a group of children who 
were characterized by "extreme autistic 
aloneness," the construct of early infantile 
autism has become engulfed in a morass of 
semantic confusion. Although Kanner be- 
lieved that these children were special cases 
included in a larger group of children with 
childhood schizophrenia, subsequent investi- 
gations have used the terms schizophrenia, 
psychosis, atypical child, brain injured, and 
many others, to describe Kanner's group. 
'The semantic problem has become further 
confounded by the fact that experimenters 
have purported to be investigating autism 
when their subjects were not at all like those 
Kanner discussed. For example, a study 
may report that a particular drug is useful 
with autistics when in fact the investigation 
failed to include a single child in the study 
who met Kanner's criteria. i 

The remainder of this paper, then, will dis- 
cuss children designated by Kanner as being 
characterized by (a) inability to relate to 
people from ‘the beginning of life, (b) fail- 
ure to use language to communicate, (c) an 
obsession with maintaining sameness, (d) 
preoccupation with objects, and (e) evidence 
of good cognitive potentialities. 


PARENTAL INTELLIGENCE 


Based largely on his observation of their 
educational and social status, Kanner (1965) 
inferred that the parents of autistics are often 
remarkably intelligent. His statements have 
been questioned on the basis that wealthier, 


1 Requests for reprints should be sent to Dewey 
J. Moore, Porter Evaluation Center, Indiana State 
University, Terre Haute, Indiana 47809. 
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Such atypical imprinting leads to the syn- 


estions for research are offered. 


better educated, and presumably more intelli- 
gent parents are more likely to seek psychia- 
try than those of lower status (Bettelheim, 
1967 ; Creak & Ini, 1960). 

In spite of the doubts regarding Kanner's 
beliefs concerning the intelligence of parents 
of autistic children, evidence is accumulating 
to support his contention. For example, 
Rimland’s (1964) careful analysis of the 
American-European literature containing de- 
scriptions of the parents of autistic children 
led him to state that the evidence over- 
whelmingly supports Kanner's early findings. 
Rutter (1969) points out that there are sig- 
nificant differences between the social class 
status of autistics and schizophrenics. In an 
epidemiological study, Lotter (1967) found 
that a high proportion of parents of autistic 
children are above average in intelligence and 
of superior socioeconomic status. Lotter is 
careful to show that this finding is not an 
artifact of referral policies, and is supported 
by Treffert (1970) who has found that 
highly educated parents do not identify their 
children as disturbed more often than less- 
well-educated parents. 

Recent data from Treffert (1970) further 
supports the inference of high parental in- 
telligence based on educational achievement. 
In his longitudinal study of 280 cases of 
childhood schizophrenia and infantile autism, 
Treffert found the educational level of par- 
ents with autistic children to be significantly 
higher than that of parents with schizo- 
phrenic children. Treffert states that his 
verification of this persistent finding 


could well point toward a genetic view of the ill- 
ness with the autistic child as seen as the mutant of 
especially intelligent progenitors [p. 434]. 


n 

Not all studies support the suggestion of 
high parental intelligence ( Levine & Olson, 
1968; Lowe, 1966; Schain & Y annet, 1960 Js 
but these, as well as similar investigations, 
are difficult to evaluate because of their 
failure to adhere to the definition of autism 
as proposed by Kanner. Included in some 
of these studies are chronic undifferenti- 
ated schizophrenics and autistic/symbiotics 
(Lowe, 1966), autistic "mental defectives" 
(Schain & Yannet, 1960), and undifferenti- 
ated psychotics (Creak & Ini, 1960). 

When investigators are careful to differ- 
entiate autism from other conditions that 
may simulate it, and when referral policies 
to clinics are properly accounted for, evi- 
dence is emerging that the inference of pa- 
rental intellectual superiority is justified. 
Treffert’s (1970) hypothesis that autistic 
children are mutants of highly intelligent 
parents appears entirely plausible. Interest- 
ingly, such a hypothesis lends credence to 
Kanner's (1943) observation that autistic 
children show evidence of good cognitive 
potentialities, and Rimland's (1964) review 
of the literature in which the finding of very 
special memory and manipulative abilities in 

od 


autistics was commonplace. 


NEURAL DYSFUNCTION 


There is substantial growing support for 
the suggestion that early infantile autism 
is associated with neurophysiological dys- 
function of the reticular activating system. 
Working with autistic children who were 
selected on the basis of Kanner’s criteria, 
Hutt and Hutt (1969) found that electro- 
encephalogram (EEG) records of their sub- 
jects showed desynchronized activity. These 
findings, together with behavioral observa- 
tions, led the investigators to hypothesize 
that the children were in a condition of high 
physiological and behavioral arousal. Ornitz 
and Ritvo (1969) postulate that the basic 
condition causing autism rests on faulty 
homeostatic regulation of perceptual input, 
Indirect evidence by Holzman (1969) shows 
that dysfunction in the final or propriocep- 
tive-autonomic response portion of the total 
perceptual process may be a significant factor 
in the etiology of the schizophrenic syn- 
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drome. Finally, Mirsky (1969) found 13 
recent studies supporting the assumption that 
all schizophrenia begins with a basic fault in 


the ascending reticular activating system. 


Brain DAMAGE 


While many investigators agree that au- 
tism has a neurophysiological basis, probably 
in the reticular activating system, and that a 
potentially superior organism is involved, the 
interaction of these two findings needs ex- 
ploration. 

Rimland (1964), who subscribes to both 

of the above concepts, neurophysiological 
basis and organismic superiority, states, 
If a potentially very intelligent child whose brain 
Was continuing to develop at a rapid rate, were ex- 
posed to atmospheric oxygen (or worse yet, to rela- 
tively high concentrations of medical oxygen) while 
the vasculature was still immature, damage to the 
vasculature of his brain could result [p. 129]. 


Rimland’s (1964) theory of oxygen insult 
crumbles under the simple logic that if the 
child is genetically superior and developing 
at a rapid rate, it, even more than a normally 
growing organism, should be able to with- 
stand normal stresses. The circumstances of 
a superior, rapidly developing brain that is 
more sensitive to normal environmental in- 
sults seems contradictory and denies all we 
would logically hypothesize about the ability 
of superior biological organisms to survive 
physical trauma. 

A number of inves 


that autistic children 
dence v. 


tigations have suggested 

are more likely to evi- 
arious kinds of brain pathology than 
are other children. Even when it is indicated 
that the researchers have taken great care to 
differentiate autistics from other behaviorally 
impaired children, the findings are inconclu- 
sive. As Rutter (1969) points out, 


There are no published histopathological studies; 
the EEG studies are inadequate and contradictory + 


and metabolic investigations are still in their J^ 
fancy [p. 397]. 


Experimenters who purport to find bra 
damage in autistic children may actually be 
observing dysfunction of the reticular p 
tivating system. As will be shown later © 
this paper, such atypical functioning is n? 


we 


— 
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necessarily a result of brain damage but can 
result from sensory deprivation. 

A review of "brain damage" as an etiologic 
factor in the production of autism is of little 
value. Rutter (1969) states, 


However, even if it is accepted that autism may 
sometimes, or perhaps often, develop on the basis 
of brain damage, the explanation is too general to 
be of much help in understanding how autism de- 
The term “brain damage” includes a wide 
variety of disturbances. The vast majority of chil- 
dren with brain damage are not autistic, and it re- 
mains to be explained why the minority who become 
autistic do so. More specific biological hypotheses 


are required [p. 397]. 


velops. 


Sensory RESTRICTION 


Until very recently, psychologists have 
largely ignored the prenatal psychological 
environment. The assumption seemed to 
have been in the past that all children start 
from a zero point at the moment of birth 
and few significant environmental influences, 
other than those affecting physical growth 
and development, have made an impression 
up until that time. Such an assumption has 
remarkable commonsense validity in view of 
the limited sensory experiences available to 
the fetus, but the question remains as to 
whether this restricted environment is the 
same for all fetuses. Does the prenatal psy- 
chological environment have a different effect 
on the cerebral structures of superior, rapidly 
developing fetuses than it does on those who 
develop more normally ? 

Part of the answer to this question is sup- 
by experiments concerned with the ef- 


plied co e 
f sensory restriction. Schultz (1965) 


fects o 
comments, 


restriction of early environmental stimulation in 
animals has produced drastic and enduring effects 
on emotionality, learning ability, activity level, so- 
cial behavior, and perception. _ The optimal level of 
sensory variation of the individual is influenced by 
early postnatal levels of stimulation and the result- 
ing level of cortical arousal as mediated by the 
reticular activating system. In other words, the 
level of stimulus variation to which the neonate is 
exposed functions to influence the optimal range of 
cortical arousal appropriate for adaptive behavior 
of the adult organism. ‘The adaptation level of the 
ascending reticular activating system « « - 15 thus 
determined to some degree by the amount of early 
stimulation [pp- 25-26]. 


Schultz suggests that an impoverished 
sensory environment would prevent differen- 
tiation of cue functions, sensory modalities, 
and of events within modalities. Such a fail- 
ure to differentiate leads to defects in the 
perceptual process. 

Evidence that autism may result from 
sensory deprivation is supported by Rim- 
land’s (1964) review of the literature in 
which he concludes, 


there is obviously a good deal of behavioral simi- 
larity between the victims of early infantile autism 
and the victims of sensory deprivation [p. 104]. 


He notes that subjects who experience long 
periods of sensory deprivation are "detached 
from the external environment,” and that 
they “stare right at you but never see you 
and just sit and look into space and not talk 
[p. 104]." The vacant “long eye" stare usu- 
ally observed in these sensorially deprived 
subjects closely resembles the unfocused 
gaze of autistic children. Furthermore, Rim- 
land's (1964) search of the literature has 
shown that the reticular formation is the 
most likely site in the brain to be associated 
with the phenomenon of sensory deprivation. 
He explains that there is sufficient reason for 
believing that autistic children are so per- 
ceptually impaired that sensory deprivation 
psychosis is entirely within the realm of 
possibility. 


DEVELOPMENTAL READINESS 


An additional concept needs to be consid- 


ered in order to answer the question : Does 
prenatal ps chological environment have a 
different effect on the cerebral structures of 
superior, rapidly developing fetuses than it 
does on those who develop more normally ? 
This additional concept has to do with the 
developmental readiness of the fetal brain. 
Differences in the maturational levels of 
full-term neonates as evidenced by variation 
in physical size, activity level, and the pres- 
ence of developmental landmarks have been 
noted for decades. These same maturational 
ist prior to birth. Some fetuses 


difference 
have cerebral structures that are develop- 
mentally ready for sensory stimulation long 


before others. The neurophysiology of a 
fetus that is the product of intellectually su- 
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perior parents can reasonably be expected 
to show advanced maturation. As Ames 
(1969) has suggested, intellectual superior- 
ity tends to go along with advanced maturity. 

The quantity and quality of early stimula- 
tion have enduring effects on the functioning 
of the brain. Profound environmental depri- 
vation produces organic dysfunction i Rutter, 
1969) and may simulate “brain damage. 
Significantly, the negative effects of such en- 
vironmental deprivation are relatively greater 
for genetically brighter organisms (Schultz, 
1965). 1 

Given a fetus with a brain in an advanced 
state of developmental readiness for stimula- 
tion but residing within a restricted uterine 
environment, what would be the probable 
results? What are the effects of sensory re- 
striction and failure to experience primary 


socialization when these two circumstances 
occur simultaneously ? 


PRIMARY SOCIALIZATION AND IMPRINTING 


The striking similarities between the be- 
havior of autistic children and that of non- 
imprinted or artificially imprinted animals 
are too great to ignore. The autistic alone- 
ness of the child and his lack of interest in 
his own species has many parallels in studies 
demonstrating the species-ignoring behavior 
of animals that are not imprinted with their 
own kind. Writing about his experiments 
with geese, Lorenz (1957) says, 


Once their social relations are transposed to a hu- 
man being their behavior does not change even if 
they are kept for years with other members of their 
species... [p. 105]. 


Moltz (1960) points out that once imprint- 
ing occurs, its effect cannot be reversed. 
The stimulus the animal experiences during 
the critical imprinting period becomes either 
the preferred stimulus or the only stimulus 
toward which the animal will move. Scott 
(1969) states in his discussion of the pri- 


mary socialization in canine and human in- 
fants that 


an individual at the proper period 
in life wi 


1 become attached to anything in 


the surrounding environment, both living 
and nonliving. 


Addressing himself to the importance of 
the critical period concepts, Money (1969) 


Dewey J. Moore anp Davi» A. SHIEK 


writes, 


The essential elements of the concept are: 1) that 
a developmental readiness of the organism must be 
met by a suitable stimulus environment before the 
given phase of development can take place; 2) there 
is a time limit, after which the developmental op- 
portunity is lost, often forever; 3) the events of 
the developmental phase leave a permanent, or at 
least a very durable, residue; 4) defective or im- 
paired development may leave a permanent residual 
effect; and 5) as applied to social science, there is 
no clear distinction between body and mind for the 
events of the social life as assimilated by the brain 
which is affected by what it assimilates as well as 
effecting in turn, the events outside of itself in 
social life [p. 145]. 

Does the autistic child’s insistence on 
sameness in the environment, his repetitive 
and stereotyped behaviors, his hand regard- 
ing, and his failure to develop speech sug- 
gest that he was imprinted by an atypical en- 
vironment at a critical period in his primary 
socialization? In this essentially unchanging 
environment, the fetus has only the hear 
beats of himself and his mother, and perhaps 
the other rhythms of their bodies, to affect 
him. Is it possible that the autistic child is 
so firmly imprinted with his relatively fea- 
tureless, secluded, and limited world during 
critical developmental periods, when he 
should be vividly stimulated and experienc- 
ing primary socialization, that he in fact 
seems to be living "in his own world" after 
birth? i Does it explain his apparent need to 
maintain environmental sameness ? 

What are the effects of rhythmic heart- 
beats and body movements on a develop- 
mentally superior fetus? Do these rhythms 
account for the repetitive and stereotyped 
behaviors observed later in life? Does such 

rhythmic imprinting” explain Rimland's 
(1964) findings that astonishing musical 


ability has been reported in at least nine 
studies of autistic children ? 


Is the fetus “kinesthetically imprinted” 0” 


his own hands? They constitute the major 
movable stimuli in his 


dark prenatal world. 
Does thi 


s account for the frequently observed 
exceptional manipulative skills shown afte" 
birth? 

Rutter (1969) notes that at 16 weeks. the 
normal infant is definitely responding mean- 
ingfully to human speech, What happe” 
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when a neurologically advanced, develop- 
mentally ready fetus does not receive early 
speech stimulation because it is unavailable 
to him? 

Psychological literature contains many re- 
ports of feral children who remained wild 
after many years of association with humans. 
Was their inability to assume human char- 
acteristics due to a failure to experience pri- 
mary socialization with humans at a critical 
period in their development? 

Spitz (1965), reporting on the work of 
M. Von Senden, notes that several congeni- 
tally blind persons whose sight was restored 
between the ages of 3 to 40 never learned to 
"see" although they had vision. Had the 
critical period for imprinting for meaningful 
visual perceptions passed? Was their be- 
havior similar to autistic children who “look 
through” persons but never "at" them? Do 
humans as humans have any meaning for 
autistic children, or are they similar to lower 
animals who have been prevented from being 
imprinted on their own species? 

Are autistic children, by their frequent 
obsessions with inanimate objects, demon- 
strating Scott's (1969) finding that an in- 
dividual at the proper time in life will be- 
come attached to both living and nonliving 
things in the surrounding environment ? 

"The period for primary socialization in the 
human infant extends from approximately 
six weeks to six months after birth. It is 
during this time that the infant forms his 
first meaningful responses to his environ- 
ment (Scott, 1969). This is the critical 
period for much of his important species 
imprinting. What happens when a fetus, 
who has inherited superior neurological po- 
tential from his parents, approaches his ninth 
month? He is hardly an “average” fetus. 
If his eventual IQ is projected to be 170 or 
more, his unstimulated brain has already 
“passed through” the first six months of 
what would ordinarily be his postnatal life. 
Consequently the optimal period for his pri- 
mary socialization has passed. 

Are autistic children “mutants of espe- 

: intellim rogenitors" as Treffert 
cially intelligent progen 


(1970) suggests they may be? Are autistics 
not just superior organisms, but do they 
have potential intellectual endowment so rare 
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that it might occur in only the upper extreme 
limits of the distribution of intelligence? In- 
terestingly, the prevalence of infantile au- 
tism is approximately 2 per 10,000 (Treffert, 
1970). This prevalence is almost identical to 
the number of persons thought to have rates 
of intellectual development which might be 
expressed as quotients of more than 170 if 
tests were available with which we could 
measure them. 


SUGGESTIONS FOR RESEARCH 


The most obvious research need for test- 
ing the hypotheses stated in this paper re- 
volves around the necessity for determi- 
ning the intellectual levels of parents who 
have autistic children. These investigations 
should follow Kanner's definition of autism 
carefully to avoid contaminating the results 
with scores of parents whose children have 
"autistic symptoms" but are not truly early 
infantile autistics. It may be necessary for 
researchers to pay particular attention to the 
quality of parental intellectual activity as 
well as the quantity. This seems especially 
advisable in view of Kanner's (1965) ob- 
servation that the parents of autistic children, 
while they may have meager incomes, are 
preoccupied with abstractions of a scientific, 
literary, artistic, or mathematical nature. 

More research is needed to test the neuro- 
physiological aspects of autism. In this re- 
spect, the work by Hutt and Hutt (1969) 
mentioned earlier seems an especially appro- 
priate area of investigation, Their experi- 
ments led them to hypothesize that autistic 
children are in a high state of physiological 
and behavioral arousal. This paper has sug- 
gested that in autism it is believed that the 
fetus becomes habituated to a low level of 
stimulus variations. Reference was made 
previously to Schultz's (1965). statement 
that the level of stimulus variation the or- 
ganism experiences early in its development 
is an important determinant of the optimal 
cortical arousal for adaptive behavior. Be- 
cause there is little stimulation prenatally, 
the child's optimal cortical arousal is low. 
After birth, normal levels of environmental 
stimulation constitute an unaccustomed Over- 
load with consequent cortical hyperarouedil 
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1968; Shepard, 1964; Thurstone & Jones, 1957) 
has been based on the assumption that the 
numerical values of the dependent variable can 
be regarded as interval-scale measures of the 
relevant psychological attribute.” The subject 
of the sentence is not Anderson's work; his 
paper was mentioned, along with other reviews, 
because it cites many studies in which the 
interval-scale assumption was utilized. 

Anderson has indeed discussed the possibility 
of monotone transformations of the dependent 
variable. However, his treatment of this ques- 
tion is inadequate: He presents no general 
methods for testing whether appropriate trans- 
formations exist, and consequently can reach 
wrong conclusions (e.g., Sidowski & Anderson, 
1967). 

We agree that numerical responses often do 
contain useful “metric” information. When and 
why they do so is itself a subject for qualitative 
investigation. Stevens’ methods of magnitude 
estimation and cross-modality matching, for ex- 
ample, lend themselves to analysis using con- 
joint-measurement methods. 

One final remark: Anderson describes his 
work by inserting adjectives and adverbs with 
positive affective loadings but little precise con- 


tent, for example, “organically” related and 
“functional” measurement. 


He declares it to 
be “superior on epistemological grounds,” “an 
organic part of behavior theory,” and “a clear 
rationale and a basis for the general study of 
judgment.” Such assertions are very poor sub- 
stitutes for adequate reasoning about the foun- 
dations of psychological measurement. 
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THE EDITOR 


Krantz and Tversky apparently agree with 
the main point of my letter, that functional 
measurement does not assume an equal-interval 
response scale. Now, however, they claim that 
my treatment of response scaling is “inade- 
quate,” which I should say is incorrect. The 
power series method (e.g., Anderson, 1970, p. 
159) has limitations, of course, but it has some 
definite advantages over nonmetric methods, ` 
especially for “fallible,” that is, real data. 

Krantz and Tversky are seriously in error in 
stating that I ignore qualitative laws. As one 
example, qualitative analysis has provided 
critical support for the averaging hypothesis in 
repeated experiments (see Anderson, 1971, p. 
181). This finding has important consequences 
for measurement theory. 

Krantz and Tversky also claim that I quoted 
them out of context. The cogency of the argu- 
ment in their letter (that their reference to 
Anderson, 1970, should not have been taken as 
a reference to my work) is perhaps better 
judged by others. For my own part, I wrote 
them two or three times concerning this same 
statement in their earlier technical report; they 
could easily have straightened out the matter 
had they chosen to do so, 
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antages of qualita- ` 
tacitly accepted by 
The systematic study of qualitative 

"E. najor part of the subject matter of 
conjoint measurement. We hope, therefore, 
that Conjoint-measurement methods will hence- 
forth be used more systematically by Anderson 


in the analysis of composition rules and also of 
response scales, 


Our remarks about the adv 


tive analysis have been 
Anderson, 


laws is a m 
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Fic. 1. Probability of perceiving a word as a 
function of repetition trials (1 through 25) and 
duration. (The base duration, T, was found in- 
dividually for each subject and was typically about 
15 milliseconds.) (From Haber, 1969; originally 
in Haber and Hershenson, 1965. Reprinted from 
an article by R. N. Haber and M. Hershenson 
published in the January 1965 Journal of Experi- 
mental Psychology. Copyrighted by the American 
Psychological Association, Inc.) 


principle is not broad enough to account for 
integration of information over changes 
which are not a direct result of self-produced 
movement of the receptors. In what fol- 
lows, a model is proposed which goes some 
way toward suggesting a basis for visual 
storage and sequential processing in the 
sense of integrating stored visual information 
with new inputs, and hence a basis for per- 
ceptual stability and clarity. 


REPETITION AND PERCEPTION 

Perhaps the most intriguing recent finding 
on visual storage is Haber’s clarity effect: 
an observer who views a briefly presented 
stimulus array repeatedly, and at constant 
luminance and duration for each repetition, 
reports that successive presentations increase 
in perceptual clarity. Obviously, the effect 
is obtainable only over a restricted range of 
conditions; the presentations must not be 
strong enough to yield complete clarity on 
the first exposure, nor so weak as to yield no 
perception at all after repeated exposures. 
Nevertheless, it is remarkable that repetition 
per se can have such a powerful effect. The 
finding has been well documented under a 
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variety of conditions, recently summarized 


by Haber (1969). x 


The major results are shown in Figures 
land2. The experiments have all been con- 
cerned with seven-letter arrays that spell 
words. The first figure shows how exposure 
duration affects clarity, the second that both 
familiarity and prior knowledge of what the 
word is increases the probability of its being 
seen, but not the change in clarity from trial 
to trial. It is very unlikely that guessing is 
an important factor in the experiments, since 
prior knowledge of the word does not elimi- 
nate the repetition effect. Taking the two 
figures together, it appears that the various 
experimental manipulations (exposure dura- 
tion, familiarity, prior knowledge) determine 
both the initial and final levels of perform- 
ance, but not the rate at which clarity grows 
with repetition, a point Haber himself makes 
with some force (Haber, 1969, p. 277), 

How may such a finding be explained ? 
Haber favors an explanation in terms of the 
arousal of Hebbian cell assemblies. In a 
general way, this seems reasonable, since 
Hebb is one of the few theorists who has 
considered the role of repetition in percep- 
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Fic. 2. Probability of perceiving a word as a 
function of repetition trials, duration (low—15 
milliseconds ; middle—20 milliseconds ; 
milliseconds), and language (English and Turk- 
ish). (From Haber, 1969; originally 


Published in the Canadian Journal of Psy- 


19. Copyrighted by the Cana- 
Association, 1965.) 
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tual organization in any systematic wa 
The ‘problem with the cell assembly, how- 
ever, is that it is too vague and general a 
concept; it will oblige by performing almost 
any job assigned to it. In this particular 
case, it can be used both to explain the 
growth in clarity (successive arousals of an 
assembly lead to its firing more strongly) 
and also the different levels reached for dií- 
ferent sorts of words (better organized as- 
semblies fire more strongly than less-well- 
organized ones). However, one should then 
expect rate of growth and final clarity level 
to be correlated, since both would be func- 
tions of prior strength of the relevant cell 
assembly. The data, per contra, suggest 
something different. It seems that level of 
clarity is affected by differences in familiar- 
ity and in signal strength or duration, but 
rate of growth of clarity is not. The differ- 
ences between the sets of curves in Figures 
1 and 2 can be accounted for by variation in 
a single parameter which specifies asymp- 
totic performance. I propose that a second, 
constant, growth parameter has to do with 
signal processing at a lower level of informa- 
tion extraction than that at which word 
familiarity has its effect. Notice that the 
units of all the displays are similar—printed 
letters—so that a constant parameter which 
has to do with initial processing of the ele- 
ments for  spatial-structural information 
would hardly be surprising. The hypothe- 
sis is that the signal processing involved is 
a form of autocorrelation which determines 
the growth rate parameter, 

This hypothesis may seem to be a gratui- 
tous leap beyond the evidence; I shall justify 
it by showing that there are several other 
features of detail vision and perceptual clar- 
ity which support the notion, which indeed 
lend it quite strong confirmation. Two have 
to do with major puzzles in pattern vision: 
the first of these is the disappearance of 
retinally stabilized images, and their regen- 
eration in nonrandom ways; the acuity of 
the eyes (particularly binocular acuity) in 
the presence of relatively large nystagmic 

movements and fixation errors is the other, 
Neither phenomenon has heretofore been ex- 


plained adequately. Third, I shall show 
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that neurophysiological findings on contour 
processing in the mammalian visual system 
suggest a very plausible basis for the lim- 
ited form of autocorrelation function re- 
quired by the model. Fourth, I shall show 
how an autocorrelation model solves rather 
elegantly the problem of the detection of sig- 
nals in noise, given the physical and physio- 
logical constraints under which the visual 
system normally operates, and the evolution- 
ary pressures to which it has been subjected. 
The biological utility of the proposed form 
of signal processing will thus be demon- 
strated. Finally, a correlational model will 
be shown to be a possible basis for the forms 
of information storage, integration, and inter- 
ference discussed in the previous section. 


CONVOLUTION THEORY AND PATTERN 
SIMILARITY 


Several proposals have been made for em- 
ploying autocorrelation functions, which are 
a particular type of convolution, in pattern 
recognition. MacLachlan (1962) suggested 
useful functions for them in optical and me- 
chanical devices for information retrieval and 
pattern recognition, and they play a major 
role in recent work on picture processing by 
computer (see Rosenfeld, 1969). Engel 
(1969) has very successfully employed them 
in modeling the phenomena of binocular 
brightness mixing; indeed, it was his work 
which suggested to me the more general ap- 
plications described in this paper. 

The mathematics of convolution theory Te- 
quired for the present purpose are straight- 
forward. Given a pair of continuous two- 
dimensional patterns p(y) and P (49), 
their cross-correlation is defined by the func- 
tional 
A(uv) 

= Sfolx,y) p' Gr + u, y + v)dxdy, [1a] 
where u and v are new parameters which 
can be thought of as internal coordinates 
originating on one of the patterns. The com- 
plete expression of the cross-correlation is 
then: 


JA A(uv)dudv 


over the appropriate values of u and v. 


2a] 


For a discrete, or digitalized, pattern the 
analogous expressions are: 


A (uv) = X: Xze Goo) eG +u, y+ v) [1b] 
xy 
and 


$5 3L Au). [25] 


uv 


In either case, Expressions la and 1b state 
that the brightness values at all points Cz, 
y) are multiplied by the brightness values 
at (w+u, yd v) and the cross-products 
summed. Expressions 2a and 2b state that 
these quantities are summed for all values 
of u and v. In practice, some limitation of 
the range of values for u and v is likely, so 
that the operations become local, position 
invariant transformations of the original pat- 
tern. ( Rosenfeld, 1969, Ch. 4). 

The autocorrelation of a pattern with it- 
self is simply obtained by substituting p (ry) 
for p'(xy) ; that is, it is obtained from 


A(uv) = J'fb(xy)eGc + w, y + v)dxdy [3a] 


The 
pat- 


over the appropriate values of 1 and v. 
corresponding expressions for discrete 
terns are obvious. 

For two-dimensional spatial patterns, 
Expressions 1a, 1b, 2a, and 2b have a 
simple interpretation, which is demon- 


— 
b 
| y 
Fic. 3. The letter A in the domain ab. The pe 
tors aa’, bb’, and cc’ all have the same length 
orientation, so that « and v are the same for all. 
(The products of the densities, Or brightnesses, 


at the termini, are the quantities of interest.) 
(Adapted from MacLachlan, 1962.) 
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(a) 


Fic. 4. When identical patterns, as in (a) are 
brought into coincidence (u —49- 0), the common 
area (shown hatched) is greater than that obtained 
with unlike patterns at maximum coincidence, as 
in (b). 


strated in Figure 3. For given the values 
of and v, A(uv) is a measure of the overlap 
of two patterns; in the case of autocorrela- 
tion, it is the degree of overlap of a pattern 
with itself, at a given shift « along the hori- 
zontal (x) axis, and v along the vertical (y) 
axis. Integrating (or summing, in the 
case of discrete representation) the values of 
A(uv) over the appropriate ranges of u and 
v will then give the measure JJ A(uv)dudv 
which expresses overall the cross-cor- 
relation or autocorrelation as the case 
may be. Clearly the autocorrelation of a 
pattern with itself has some characteristics 
not shared by a cross-correlation. In 

particular, there is an exceptionally high 

value of the function when 4 = v = 0, that. 
is, where the pattern completely overlaps” 
itself. This is illustrated in Figure 4, 

adapted from MacLachlan (1962). Mac- 

Lachlan shows that J/A(uv)dudv is not 

an appropriate measure of pattern simi- 

larity under all circumstances, but that 

SSAX(uv)dudv is (the discrete analogue 

being 32254?(uv)). The second expres- 

sion he calls the similarity function, which 

may be written S(pp’) and is the same as 

Engel's variance function V (pp) in the case 

where the patterns are identical (i.e., when 

A(uv) is an autocorrelation). Engel points 

out that V(pp) is a good measure of the 

spatial-structural information in a pattern. 

S(pp' similarly is a measure of shared 

structure between two patterns. 

So much for the formalism of convolu- 
tions: further information on autocorrelation 
and its application to picture processing may 
be found in Rosenfeld. (1969, esp. Ch. 4, 5, 
and 6). If the ideas are to be applied to 
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pattern processing in the visual system, it 
will be necessary to consider a few practical 
constraints which are suggested by the neu- 
rophysiology of the system, and also to pro- 
pose a mechanism for the generation of the 
autocorrelation functions. This topic is 
taken up in detail in a later section; for the 
moment it may be noted that in his computer 
simulations of binocular brightness mixing, 
Engel was able to obtain remarkably good 
agreement with the human experimental 
data by using a relatively coarse approxima- 
tion to A(uv) and V (pp). Perhaps, then, 
an indication of how fairly crude versions 
of these functions might be computed in the 
visual system will also be sufficient for our 
purposes. 

Autocorrelations of two-dimensional spa- 
tial patterns are obtained by shifting a pat- 
tern with respect to itself and evaluating the 
amount of overlap for different degrees of 
shift. The similarity function S(pp) is 
computed from the same basic data, and 
most of the useful information can be ob- 
tained by relatively small shifts, provided 
that contour information is extracted before 
the autocorrelation is performed. We know 
from the work of Hubel and Wiesel (1962, 
1968) and others that contour information is 
abstracted at least at the first cortical proces- 
sing level in mammals, and also that small 
shifts of the retinal pattern of stimulation 
with respect to itself are ensured under nor- 
mal circumstances by elaborate patterns of 
rapid and involuntary eye movements 
(Krauskopf, Cornsweet, & Riggs, 1960; 
Ratliff & Riggs, 1950). Prima facie, then, 
there is evidence that mechanisms for com. 
puting some form of autocorrelation are 
present at least in higher visual Systems. 


IMAGE CLARITY AND AUTOCORRELATION 
Repetition 


To return to Haber's repetition effect, the 
specific hypothesis is that a necessary condi- 
tion for the clear perception of à visual 
image is that the visual system perform an 
autocorrelation on the input array, a process 
which takes a finite amount of time, and 


that the similarity function S(pp )—or some 
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close analogue of it—is also computed. The 
perceived clarity of the image is determined 
by S(pp ); it is at least a function that in- 
creases monotonically with S(pp). The rea- 
son why brief tachistoscopic presentations of 
patterns do not lead to the perception of 
clear images is that insufficient time is avail- 
able for the complete computation of S (pp). 
If, however, values of S(pp) (including in- 
complete ones) are stored for a short time, 
subsequent inputs of the same pattern can 
summate with the stored version, and in the 
latter case, can enter into its more complete 
specification. 

Lack of clarity on first presentation of a 
letter array is, as Haber (1969) points out, 
the single best indicator of the final level of 
clarity that will be achieved under the given 
conditions. The autocorrelation model ex- 
plains this by saying that lack of clarity on 
first presentation is an index of failure to 
complete the formation of S(pp); the less 
S(pp) can be specified on Trial 1, the less 
well can it ever be specified under the given 
stimulating conditions. If the model is on 
the right track, one would predict that the 
spatial-structural information in the image 
would affect the rate of growth of S(pp) 
and hence of clarity, whereas signal strength 
(e.g.. brightness) might be relatively in- 
effective. So far, Haber's experiments have 
all been with letter arrays, so that the type 
of spatial-structural information has been 
maintained relatively constant. Specific pre- 
dictions about the effects to be expected in 
the repetition paradigm when stimulus in- 
formation is varied systematically will be 
given later. 


Image Stabilization and Disappearance 


If the image of a target is stabilized on the 
retina, so that eye movements cause no rela- 
tive shift between image and retina, the per- 
ceived target rapidly loses clarity and fades 
away (Ditchburn & Ginsborg, 1952; Riges, 
Ratliff, Cornsweet, & Cornsweet, 1953). Tf 
the stabilization is complete, image fading is 
also complete and permanent (Barlow, 
1963). This surprising phenomenon has 
never been adequately explained; Barlow 
proposed that there is a rapidly adapting 


information channel for detail vision which 
requires constant change of stimulus infor- 
mation to remain active, and several writers 
have remarked on the fact that the contour 
coding system based on the retinal receptive 
field organization of Hubel and Wiesel re- 
sponds best to moving contours, and at 
higher levels perhaps only to moving con- 
tours. Heckenmueller (1965) points out 
that stabilization could lead to selective loss 
of function in one or more of the types of 
processing achieved at different levels of re- 
ceptive field organization. 
The specific hypothesis here is that image 
fading is fundamentally the obverse of the 
repetition clarity effect. In the latter, in- 
creases in clarity occur because successive 
inputs add increments of information leading 
to better specification of the autocorrelation 
function and S'(pp) ; in the former, there is 
successive loss of information on which the 
autocorrelation function can be computed. 
If the clear perception of a contoured pat- 
tern depends on the concurrent formation 
and maintenance of S(pp), and if the gen- 
eration and maintenance of S(pp) depends 
on shifts of the pattern with respect to itself 
which are mediated by eye movements, the 
result follows obviously. The only problem 
then is to explain why the target is ever seen 
when the retinal image is stabilized, since 
by definition no shift of the pattern with 
Tespect to itself is possible. The problem is 
not insurmountable, however; a plausible 
extension of the notion of correlation on the 
input can handle it. This is presented in a 
later section. 


Image Regeneration 


The most complete information on image 
regeneration under stabilized—or near sta- 
bilized—conditions is contained in papers by 
Pritchard, Heron, and Hebb (1960) and by 
Evans (1965). The authors of both papers 
str the nonrandom character of regenera- 
tion, and interpret their findings in terms of 
Gestalt-like processes in pattern perception 
and, in the former case, in terms of the cell- 
assembly model. In summarizing this work 
elsewhere (Dodwell, 1970, Ch. 9), 1 was 
pointed out that this nonrandomness of re- 
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generation is in many respects similar to the 
nonrandomness found in rivalry between 
pattern elements during binocular stimula- 
tion of the eyes with noncongruent stereo- 
grams, and should be able to tell us some- 
thing about the processing of patterns in 
the visual system. More definitely, one may 
postulate that at least some of the effects are 
caused by the way in which the autocorrela- 
tions and S(pp) are computed. Both Prit- 
chard et al. and Evans reported that "struc- 
tured" disappearances are much more fre- 
quent than "random" or “unitary” ones; 
straight lines tend to fade and regenerate as 
units, pairs of parallel lines tend to fade 
together, and horizontal lines are more 
dominant than others. Patterns composed 
of curved contours tend to be less "active" 
than patterns consisting of collections of 
straight lines; the latter fade and regenerate 
more readily. "These findings are consistent 
with the hypothesis that perceptual clarity 
is mediated by signal processing which is 
based on a form of autocorrelation, particu- 
larly if the computations are based on Hubel- 
and-Wiesel-type straight-line detectors. The 
detailed exposition of this hypothesis is pre- 
sented later; here it should be noted that 
just as for the Haber clarity effect, a signal- 
processing model cannot explain all the find- 
ings because there are "higher level" effects 
which depend on attention and meaningful- 
ness. Pritchard et al. report, for example, 
that meaningful targets (such as a face) 
tend to fade less readily than meaningless tar- 
gets, and the fading seems to be determined 
to some degree by meaningful connections 
between parts of a target; an element fades 
less readily when the observer's attention is 
focused on it, and so on. Rather similar 
findings have also been reported by McKin- 
ney (1966). My argument here is the same 
as before; in criticizing the use of the cell- 
assembly model to explain the repetition ef- 
fect, attention was drawn to the overinclu- 
sive nature of that model. Information 

processing is surely occurring at several 

levels, and the fading and regeneration ef- 
fects in simple targets can be explained by 

signal processing at a level prior to that at 

which expectations, meaningfulness, and so 

on, have their impact. 
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Visual Acuity 


It has been proposed (e.g. Andersen & 
Weymouth, 1923) that one function of "in- 
voluntary" eye movements is to increase 
acuity by stimulating large numbers of reti- 
nal points and thereby increase the amount 
of data on which a “mean retinal local sign” 
can be computed. Evidence on acuity for 
stabilized targets seems to contradict this 
notion. Riggs et al. (1953) showed that 
acuity for a stabilized target (before it 
fades) is actually somewhat better than in 
the nonstabilized condition. Further evi- 
dence reported by Keesey (1960) demon- 
strates that there are no consistent effects of 
eye movements on acuity, and that increase 
in acuity over time (typically between about 
.01 and 1.0 second) is similar for stabilized 
and nonstabilized targets. Shortess and 
Krauskopf (1961) report a similar finding 
for stereoacuity. Although eye movements 
are necessary for the maintenance of visual 
clarity, they do not themselves contribute to 
increasing the sharpness of detail vision. 

Perhaps the most striking fact about the 
relations between acuity and eye movements 
is that reported by Krauskopf et al. (1960). 
Physiological nystagmus in the two eyes is 
uncorrelated ; saccades tend to be yoked, but 
are not direct vergence adjusters. This 
means that even an experienced observer 
attempting to maintain steady binocular fix- 
ation is not able to do so with great accu- 
racy. Krauskopf et al. measured the stan- 
dard deviation of convergence error for two 
experienced observers at greater than two 
minutes of are for sample durations of 60 
seconds, and well over one minute of arc for 
one observer over a period of only 2 seconds. 
Yet, stereoacuity has been reported to be of 
the order of five seconds of arc. It seems 
clear that in order to extract information ef- 
ficiently from the “noisy” analogues of the 
visual field provided by eyes subjected to 
constant tremor, drift, and saccades (Ratliff 
& Riggs, 1950), some form of statistical 
averaging, or smoothing, must occur. The 
present hypothesis is that a sufficient opera- 
tion for this extraction can be based on the 
autocorrelation of a visual image with itself, 
and in the case of binocular vision, the cross- 


correlation of the images in the two eyes 
(Engel, 1969). 

Specifically, the hypothesis accounts for 
the fact that both monocular and binocular 
acuity are very much finer than the vagaries 
of eye movements would lead one to expect. 
If the eye movements are thought of sim- 
ply as a necessary condition for the computa- 
tion of the autocorrelation and S(pp), these 
functions need not themselves be related di- 
rectly to the extent of the eye movements, 
and more probably depend principally on the 
grain of the retinal mosaic and the charac- 
teristics of the contour-coding units in the 
primary visual areas of the brain cortex. 

Monocular acuity is initially at least as 
good for a stabilized as for an unstabilized 
target: let us postulate for the moment that 
there is a correlational process (detailed in 
the next section) equivalent to the auto- 
correlation on the initial input to account 
for that fact. It was argued earlier in con- 
nection with repetition and clarity that the 
signal processing must take a finite amount 
of time, and that maximum clarity (acuity) 
will only be achieved when the processing 
is complete. Although Haber claims that 
signal duration is not itself an important fac- 
tor in the growth of perceived clarity, there 
is a suggestion in his data that duration may 
be traded off against repetition. This is 
illustrated in Figures 1 and 2; it could be 
argued that under his conditions, a total 
exposure of between 100 and 150 millisec- 
onds is needed to attain full clarity. It 
would hardly be surprising if processing for 
fine details (e.g., in acuity determination ) 
were to take several times as long as this, 
as in Keesey’s experiment. However, this 
is straining the evidence; we can at least 
say that there is nothing grossly inconsistent 
about the time course of repetition clarity, 
of increases in acuity with time, and of 
stopped image fading. 


A NEUROPHYSIOLOGICAL BASIS FOR 
AUTOCORRELATION 


In keeping with the general idea that the 
basis for perceptual clarity effects should 
be sought “early on" in the visual process- 
ing system, it is next demonstrated that the 


known neurophysiology of contour coding 
provides a very plausible basis for the com- 
putation of autocorrelations. It has already 
been suggested that populations of Hubel- 
and-Wiesel-type detector units for edges and 
straight-line segments may fulfill the requi- 
site conditions. There appears to be a sur- 
prising degree of redundancy in the coding 
arrangements, not all of which can be ac- 
counted for by the fact that some cortical 
units are binocular disparity detectors (Bar- 
low, Blakemore, & Pettigrew, 1967; Petti- 
grew, Nikara, & Bishop, 1968). Cells are 
arranged in columns normal to the suríace 
of the cortex in Areas 17, 18, and 19 in both 
cats and monkeys, nearly all cells within a 
column being selective for a contour in a 
specific orientation, but also within a fairly 
circumscribed region of the retina ( Hubel & 
Wiesel, 1962, 1968). Most of the cells re- 
spond particularly well when the line seg- 
ment to whose orientation they are tuned 
moves in a direction perpendicular to its ori- 
entation. Consider the somewhat idealized 
situation in which the external pattern and 
the head are both stationary, but the eyes 
move; consider also one pattern element, a 
straight line, which is in the right position 
and orientation to stimulate the cells of a 
particular column. That component of the 
eye movement which is orthogonal to the 
preferred orientation for the given column 
will in effect shift the line with respect to 
itself and cause brisk outputs from a subset 
of units within the column. As the ampli- 
tude of nystagmus varies, or as the eyes 
drift, the subset of strongly stimulated units 
will also change. But, given that within a 
column there are units with different types 
of receptive field (“on” center, “off” center, 
simple and complex, of different sizes, etc.), 
the pattern of firing would specify a stimulus 
clement more uniquely than would any one 
unit and its associated receptive field alone. 
Here, then, we have a mechanism for signal 
processing of the proposed type. Stimulus 
detection is based on the statistical evalua- 
tion of the outputs from a population of units 
over space and time, an evaluation which 
can transcend the acuity or reliability of any 
one unit on its own. The process is one of 
shifting an element with respect to itself 
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and continuously evaluating the result; one 
might say that, initially at least, this is a 
cross-correlation between the pattern ele- 
ment (line segment) and the set of receptive 
field units within a column. The output 
from this column constitutes “matched fil- 
tering” for the pattern element in question 
(Rosenfeld, 1969, pp. 71-78). The problem 
for the visual system now is to decide 
whether this filtered output arises from a 
stimulus pattern that is constant in time, or 
whether the input is changing. This is 
where autocorrelation becomes important. 
It is reasonable to talk of the autocorrelation 
of an image with itself where the input pat- 
tern is known to remain the same, but it 
seems probable that the visual system cor- 
relates successive time samples on the input 
to estimate the degree of congruence. We 
can talk of autocorrelation only when the 
successive time samples in fact contribute to 
the growth of S(pp) for a particular pat- 
tern—and hence to the pattern’s perceived 
clarity, according to the model, — 

If one postulates that the initial correla- 
tion of pattern and receptive field units is for 
filtering or feature extraction, the subsequent 
correlation of the units’ outputs with them- 
selves over time for image maintenance, the 
initial clarity of the stabilized image can be 
accounted for. Fading occurs because units 
adapt rapidly when there is no variation in 
their input, and hence no successive samples 
on which to form correlations. This sort of 
argument is supported by the findings that 
(a) acuity improves in a stabilized image 
for times up to about one second, and in the 
same way as for a nonstabilized target (Kee- 
sey, 1960), and (b) for a well-stabilized 
image, fading starts typically at between one 
and two seconds (Barlow, 1963). This 
suggests that the "instantaneous" correla- 
n on the static input in fact takes some 
E acies] computed and starts to be de. 
gra after about a second. Tt would be 
of some interest to know whether the repe- 
abilzed images or whet ined om 
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motion is a necessary condition of its oc 
rence, f 
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From considering the Processing of one 
element, one should proceed to the problem 
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of processing for a whole pattern, which 
may consist of a concatenation of straight 
lines in various orientations, or of other 
features such as curved lines, surfaces, points 
where two or more lines intersect, etc. An 
immediate difficulty here is that virtually 
nothing is known about the coding of pattern 
elements other than straight lines. It may 
be, as I have suggested elsewhere (Dodwell, 
1970, Ch. 4), that for other pattern elements, 
perceptual learning is a factor, in which case 
a more complicated account of the signal 
processing would be necessary. At least we 
can say that the present model is consistent 
with the stabilized image data in postulating 
a rather basic role for the processing of 
straight-line segments. 

Vertical 


Processing of Horisontal and 


Elements 

There is a great deal of evidence, both 
physiological and psychophysical, to show 
that in human pattern vision, horizontal- 
and vertical-line elements play special roles. 
Much of it is summarized by Taylor (1963) 
and by Dodwell (1970). Tt is perhaps sur- 
prising that Hubel and Wiesel have con- 
sistently reported that the relative frequen- 
cies of their oriented line detectors are 
essentially random with respect to orienta- 
tion when mapped for static displays (Hubel 
& Wiesel, 1962, 1968). Given the greater 
acuity for lines in the horizontal and verti- 
cal orientations, the higher evoked cortical 
potentials for horizontal and vertical as 
opposed to diagonally oriented gratings 
(Maffei & Campbell, 1970), and similar 
findings, one might expect there to De: a 
preponderance of horizontal and vertical de- 
tector elements.? An alternative possibility 
is that the correlations and autocorrelations 
are performed primarily on the horizontal 
and vertical components of image shift. This 
would be in harmony with the known high 


acuity in these orientations, with Andrews’ 


? Admittedly there is a jump across species in 
this inference; but it is not unreasonable to sup- 
pose that a binocular system such as the mon- 
key's will have similar mappings to those in the 
human contour coding system, hence to infer ran- 
domness with respect to orientation for the latter 
also. 


(1967) demonstration of more selective 
tuning for horizontal and vertical detectors 
in human vision, and with the finding of 
Pettigrew et al. (1968) that at the more 
complex level of processing for moving slits 
in binocular correspondence (in the cat), 
there is indeed the expected anisotropy with 
respect to orientation in cortical detector 
units. Maffei and Campbell’s data on 
evoked potentials, referred to previously, 
also fit well with this hypothesis. 


Retinal Receptive Field Sise and Amplitude 
of Eye Movements 


If the suggested method of computing 
autocorrelations is to be plausible, one would 
need to show that receptive field sizes and 
the amplitude of eye movements are of the 
right order of magnitude for the generation 
of such functions. There is no firm evidence 
on receptive field size in the human visual 
system, but Andrews (1967) estimates that 
fields for line detector units with foveal 
representation are about nine minutes of arc 
long. No one knows how closely packed 
the units may be, or how elongated, but a 
reasonable guess would be that such fields 
would be about one minute of are across, 
so that the central region of a line detector 
might be between 10 seconds and 30 seconds 
across. The average amplitude of nystag- 
mus is around 15 seconds to 20 seconds, a 
value which is large enough to ensure that 
pattern elements would be shifted from “on” 
to “off” regions of single receptive fields, 
and since fields almost certainly overlap a 
good deal, to shift maximum firing rates 
from unit to unit within a column. In this 
connection, the fact that there is a sudden 
reduction in fading when image motions on 
the retina with amplitudes over about 20 
seconds are allowed to occur (Ditchburn, 
Fender, & Mayne, 1959) looks significant. 
Also, Krauskopf (1957) showed that the 
lower frequency components of eye move- 
ments (under 10 per second) are most 
effective in maintaining target visibility, 
Thus it seems that, as far as it goes, the 
evidence is consistent with the possibility of 
autocorrelation generation within the recep- 
tive feld contour coding system, 


EVOLUTION OF THE SYSTEM 


The proposed type of organization can be 
viewed as the result of developments in a 
structure subjected to two conflicting evolu- 
tionary pressures. The biological utility to 
an organism of improved acuity and capacity 
for detail vision is clear enough; so are the 
advantages of having eyes which can be 
moved rapidly and precisely to orient toward 
and fixate on visual features in widely sepa- 
rated directions from the present line of 
regard. The conflict arises because the for- 
mer property would lead one to expect 
structures with large receptor surfaces to 
evolve, which could be maintained in a 
fixed and steady orientation; the second 
should lead to the development of small 
structures controlled by powerful, but pre- 
cisely gradable, muscles. In the former case, 
large structures with a low power-to-weight 
ratio of extrinsic muscles to eye would be 
suitable. In the latter, a high power-to- 
weight ratio is desirable, but with several 
strong muscle systems controlling the eye 
simultaneously, and under antagonistic ten- 
sions, fine tremors of the structure are in- 
evitable. The tremors could be damped out 
by increasing the moment of inertia of the 
eye, but this would defeat the whole purpose 
of the system. Development of processes 
whereby the muscle tremors themselves be- 
came an integral part of the signal extraction 
system for contour details presents an elegant 
compromise between the two conflicting 
specifications for an efficient visual appara- 
tus described above. From this point of 
view, the proposed model makes good sense 
biologically. 

Although it would be difficult to ar- 
gue that the human visual system evolved 
precisely to handle the sorts of information 
processing to which experimental psycholo- 
gists subject it, one can argue that the limi- 
tations on storage and sequential processing 
(other than sequential spatial scanning which 
appears to develop from reading habits) are 
indeed results of evolutionar pressures very 
similar to those just d sed. A time- 
integration system is efficient for detection 
of weak visual signals, but time lag is also 
inefficient from the point of view of process- 


P. C. DopwrrL 


ing of rapidly changing inputs (the prob- 
lem presented at the beginning of this 
paper). In the following section it is shown 
that a correlational model solves the prob- 
lem of these conflicting desiderata too, 


CLARITY, STORAGE, AND INFORMATION 
PROCESSING 

Convincing evidence for a short-term 
visual store was provided by Sperling 
(1960) using a partial-report technique for 
arrays of letters; other investigators have 
since found evidence for such a process 
under a wide variety of conditions (e.g., 
Averbach & Coriell, 1961; Dodwell & Engel, 
1963; Keele & Chase, 1967; Eriksen & Col- 
lins, 1967; Standing & Dodwell, in press; 
Standing, Haber, Cataldo, & Sales, 1969). 
Most of the evidence is consistent with the 
idea that even for very brief stimulus dura- 
tions, short-term visual storage lasts for 200° 
to 300 milliseconds, the amount of persist- 
ence depending to some extent on other ex- 
posure conditions and the observer's state 
of adaptation. As far as the autocorrelati 1 
interpretation is concerned, two poit. 


should be made: first, it has been shown by ^ 


direct measures of short-term visual storage 
that the actual duration of the stimulus ex- 
posure may be relatively unimportant; as 
long as the blank interval between repeated 
supraliminal presentations of a display is less 
than about 250 milliseconds, observers re- 
port that it does not fade completely (Haber 
& Standing, 1969). Second, summative 
effects between repetitions of subthreshold 
displays, that is to Say, displays which are 
below recognition threshold for the informa- 
tion they contain, can occur over intervals 
as long as 900 milliseconds, when the image 
certainly fades completely between presenta- 
tions (Standing et al, 1969). "Taken to- 
gether, these findings show that perceptual 
persistence is not simply a function of stimu- 
lus energy, and that useful information can 
be stored well beyond the time for phenome- 
nal persistence (cf. also Daw, 1962). Tt is 
reasonable to propose that whatever is seer 
as pa p the spatial pattern information 
in the display is stored jy . i 

disfrgese kon the in c ee a 


Put display, and th; 
S(pp) or some close ae m 


analogue of it would 
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be a plausible candidate. There are some 
problems with this idea, however: S (pp) 
is simply a number or quantity which ex- 
presses a property of a pattern, but does not 
reflect any of the specific pattern features. 
The autocorrelation function itself contains 
at least a record of the spatial frequencies 
in a pattern, so that it or its Fourier trans- 
form could be the storage medium. Such a 
notion is reinforced by the fact that there is 
accumulating evidence to show that at least 
under certain conditions the visual system 
operates as a spatial frequency analyzer 
(e.g., Campbell & Kulikowski, 1966; Julesz 
& Stromeyer, 1970). 

"There is evidence that random noise fields 
and other masks interfere with information 
processing in the visual system by stopping 
processing of earlier presented material 
rather than by summating with a given input 
(Averbach & Coriell, 1961; Haber & Stand- 
ing, 1968) ; a good criterion for deciding 
whether or not successive rapid inputs to 
the visual system are spatial transformations 
of the same pattern, to be processed together 
as equivalent displays, would be a criterion 
which preserves some informational features 
which are invariant under translation on the 
array; correlation. between successive time 
samples on the visual input seem to fit the 
bill. Where the same pattern is present in 
successive time samples, an autocorrelation 
function would be built up: where different 
patterns follow each other in sequence, mu- 
tual interference or partial integration would 
occur depending on the patterns’ similarities 
which in turn are estimated in terms of 
cross-correlations between successive time 
samples. Development of this notion might 
provide some new insights into the effective- 
ness of different sorts of visual mask, and 
resolution of the controversy about the prin- 
ciple mechanisms of masking (Kahneman, 
1968). It can be applied also to the recog- 
nition of characters in dynamic visual noise 
(Uttal, 19693, 1969b). 

The idea of time sampling is supported by 
evidence for the quantization of perceptual 
time (Dodwell & Standing, 1970; Kristof- 
ferson, 1967), and measures of the size of 
the perceptual moment should tell us some- 
thing about the time required to form cross- 


285 


correlations and autocorrelations. On the 
basis of simultaneity judgments and masking 
of single flash stimuli, 40-50 milliseconds 
appears to be the right order of magnitude 
for this. However, for more complex ar- 
rays, longer integration periods would be 
probable, so that 40 milliseconds is a lower 
bound for this type of processing. Evidence 
on integration of pattern information in 
letter displays, for example, provide esti- 
mates of at least 100 milliseconds (Eriksen 
& Collins, 1967). 

At a somewhat greater remove, it is worth 
noting that performing autocorrelations on 
time series to extract signal characteristics 
from noise has long been a standard tech- 
nique in communications research, and has 
obvious possibilities in audition, for example, 
in the detection of pure tones masked by 
noise. The finding that short-term memory 
for letters and numbers presented visually 
seems to be echoic in character (Conrad, 
1964) suggests again the possibility of low- 
level signal processing and storage in terms 
of the physical features of sensory input, in 
this case with some form of cross-modality 
coupling ; perhaps “covert rehearsal" of audi- 
tory material is a form of autocorrelation. 
However, these notions are at best sugges- 
tive. To give substance to the model, one 
must show that some fairly specific predic- 
tions follow from it, predictions which do not 
follow from other possible models for per- 
ceptual clarity, information processing, and 
storage. 


SOME PREDICTIONS FROM THE MODEL 


The predictions to be made rely heavily 
on the assumption that autocorrelations are 
performed primarily on the horizontal and 
vertical components of visual inputs in the 
manner described earlier and that because 
of the physiological mechanism involved, 
finite amounts of time are needed to form 
the autocorrelation. 

l. The rate of growth of clarity should 
be a decreasing monotonic function of pat- 
tern complexity, where complexity is de- 
scribed by the amounts of contour in a dis- 
play and their horizontal and vertical spatial 
frequencies. Studies of the repetition clarity 
effect to date have all used letter displays, 
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and complexity as here defined has not been 
manipulated explicitly. The prediction is 
that for very "simple" patterns such as a line 
grating (where a single horizontal or ver- 
tical spatial frequency is present), rate of 
growth in clarity should be very rapid, 
whereas for a random grid, or a random 
dot pattern such as a Julesz field, it should 
be slow. Some informal observations on 
Julesz fields (Psotka, personal communica- 
tion, 1970) indicate that growth of clarity 
is indeed very slow for such displays. 

2. For a given display, such as a letter 
or a sequence of letters, the rate of growth 
of clarity should depend on the particular 
type font used: a format which consisted 
mainly of horizontal and vertical contours 
should both show greater initial clarity and 
achieve an asymptotic level more rapidly 
than one which consisted of curves and 
oblique lines and contained numbers of ir- 
relevant features. As a corollary, rate of 
growth of clarity for handwritten script 
should be lower than for machine-typed (or 
hand-typed) letters. 

3. Variations in type font from presenta- 
tion to presentation should slow down the 
growth of clarity. There is one piece of 
evidence which goes against this prediction 
(Haber, Standing, & Boss, 1970) ; however, 
the curve published by these authors (Fig- 
ure 1 of their paper) suggests that when 
typeface and position of letters are varied, 
the rate of growth in clarity is slowed down, 
although the effect was not reliable in their 
experiment. Earhard and Fullerton (1969) 
have shown that under certain rather re- 
strictive conditions, variation in line width 
in printed letters does not become easier to 
detect over repeated trials, although in- 
dividual letters do become clearer. A more 
thorough test of the hypothesis seems war- 
ranted, 

4, Introduction of visual noise into a dis- 
play should slow down growth in clarity, as 
should introduction of irrelevant displays 
between presentations of the target display. 

5. In a discrimination paradigm, where 
two different displays have to be distin- 
guished from each other in repeated presen- 
tations, some interference with the growth 
of clarity might be expected. However, this 
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prediction cannot be made with conviction, 
since one does not know to what extent in- 
formation about independent but concurrent 
displays can be stored simultaneously with- 
out mutual interference, This question 
would be worth investigating in its own 
right. 

6. Following 5, if different noise fields are 
used to mask successive presentations of the 
same target, the effectiveness of the mask 
Should decline over trials. This could in 
fact constitute a sensitive test of the extent 
to which masking actually stops visual proc- 
essing. 

7. The effectiveness of a mask with re- 
spect to a given target should depend on 
their structural similarities. This is a very 
vague statement; in fact the systematic in- 
vestigation of mask-probe features should 
be a good way of determining what the di- 
mensions of structural similarity are. As a 
first guess, m 
effective masking should occur when probe 
and mask both contain several spatial 
quencies, but share none in common, 

8. Stereoptic depth should grow in clarity 
on repeated presentations, and should be 
some function of the changes in clarity of 
the monocular displays. For example, the 
growth in clarity for stereo depth should be 
more rapid if either or both monocular dis- 
plays have reached full clarity (by repeated 
single or alternating monocular exposures) 
than if dichoptic stimulation is used from 
the start of a series, 

9. Reversals (up-down and left-right) of 
patterns—even complex ones containing 
nonhorizontal or vertical components— 
should interfere with growth in clarity much 
less than small rotations of the display. This 
follows from the fact that the postulated 
autocorrelation function extracts spatial fre- 
quency components along the horizontal and 
vertical axes, components which are not af- 
fected by reversals, but are changed py 
rotations. The prediction is restricted to 
the class of patterns which do not display 
complete identity under rotation, such asa 
set of concentric circles, obviously. 

10. Some auditory analogues 
themselves, for example, repetition 
effects for tones and spoken 
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sented in noise. It would be of considerable 
interest to find out whether cross-modality 
effects on clarity and repetition could be 
obtained between vision and hearing. 

A problem exists for measuring clarity 
effects in situations where parts of a display 
are not easily named, as in a random dot 
field. Individual letters in an array can be 
named, but obviously not individual dots, or 
bars in a grating. Some other response in- 
dicator would be needed; probably a form 
of scaling such as magnitude estimation 
(Stevens, 1957) would be suitable, so that 
the problem is not insuperable. 


CONCLUSION 


The predictions are not all unique to a 
model which specifies a form of correlation 
between consecutive time samples as the 
basis for perceptual clarity, storage, and in- 
terference between rapid input processes. 
However, it is difficult to think of another 
model which would encompass them all. 
Neither a Hebbian cell-assembly model, nor 
one that postulates the extraction of spe- 
cific features from a display before synthesis 
of the to-be-recognized patterns, is incon- 
sistent with the predictions, although such 
models do not lead explicitly to them, But 
those models are not inconsistent with the 
idea of autocorrelation as a basic operation 
in signal processing either, As was main- 
tained earlier, the correlating of spatial pat- 
terns with themselves is to be thought of 
as a process which occurs “low down" in 
the system (although not peripherally, in 
the physiological sense) and, therefore, 
would be a means of determining what the 
distinctive features, or the inputs to a cell 
assembly, might be. 

No one of the lines of evidence or argu- 
ments presented—on the repetition clarity 
effect, fading and regeneration of stabilized 
images, visual acuity, short-term visual stor- 
ageand masking, and biological utility—is by 
itself convincing support for the model, but 
taken together they add up to a consistent 
and quite appealing account of how percep- 
tual clarity can be explained. 

The notion that autocorrelation plays a 
fundamental role in image processing and 


storage has gained wide currency in recent 
years? the main emphasis having been on 
computer applications, as was earlier pointed 
out (e.g., Rosenfeld, 1969). Very recently, 
the idea has been extended in a general way 
to biological systems too (e.g. Anderson, 
1969; Barrett, 1969). In its strongest form, 
the postulation of an autocorrelational sys- 
tem claims that the brain operates like a 
hologram (van Heerden, 1968; Longuet- 
Higgens, 1968; Pribram, 1966), which may 
be reasonable considering the enormous 
long-term storage capacity—particularly for 
pictorial material—of the normal human be- 
ing. The present proposals are more modest 
in assigning to autocorrelations a specific 
function in visual processing. At the same 
time, they are probably more open to em- 
pirical test than the general holographic 
model. Tests on the specific model would, 
on the other hand, lend some support to the 
idea that autocorrelation plays a basic role 
in information processing in the central ner- 
vous system as a whole, to extracting sig- 
nals from noise, to matching an input to a 
degraded stored version, to perceptual 
clarity. 

3I am indebted to Joseph Psotka for pointing 


this out and for drawing the relevant literature to 
my attention, 
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ORGANIZATION AND INTERFERENCE' 


LEO POSTMAN 2 


University of California, Berkeley 


This paper presents evidence for the functional continuity of the products 
of subjective organization and of associative learning. A series of studies 
is reported in which ihe transfer effects of free recall on paired-associate 
learning were investigated. Under the paradigm used in these studies, the 
words in the successive lists remained the same for the experimental groups 
and were unrelated for the control groups. The results consistently showed 
negative transfer in the acquisition of the paired-associate list. Evidence 
was also obtained for long-term proactive inhibition in the recall of the 4 
transfer task. Thus, the higher order units developed in the free recall 
phase were an effective source of interference when the transfer task re- 
quired the establishment of new relations among the component items. The 
interitem dependencies established in free recall and in associative learning 
are viewed as commensurable phenomena that are likely to be governed by 
common principles. The implications of these results for the analysis of 


extraexperimental sources of interference are discussed. 


In contemporary analyses of learning and 
retention, the concepts of association and of 
organization appear to stand in apparent op- 
position to each other. These concepts have 
their origins in different theoretical tradi- 
tions and are characteristically anchored to 
different experimental operations. The clas- 
sical paradigm for the analysis of associative 
processes has been, as the very name im- 
plies, the paired-associate task. The acqui- 
sition of the prescribed pairings signals the 
establishment of associative connections. The 
primary vehicle for the examination of or- 
ganizational processes has been the free re- 
call experiment. The development of higher 
order units is inferred from the increasing 
consistency in the order of output as learn- 
ing progresses (Tulving, 1962b). The dif- 
ferences in experimental operations and 
methods of measurement should not discour- 
age us, however, from exploring systemati- 
cally the continuity of the phenomena studied 
in these paradigmatic situations. To be suc- 

1 This paper represents a modified version of the 
writer's Presidential Address to Division 3 of the 
American Psychological Association, Miami Beach, 
September 5, 1970, The research reported in the 
paper was supported by Grant MH 12006 from the 
National Institute of Mental Health. Thanks are 
due to Janat Parker and Sheila Burns for their 
assistance. 
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" 
cessful, such an exploration will undoubtedly 
require, as Tulving (1968) put it, : “liberal- 
ization of the orienting attitudes that lie at 
the root of traditional formulations [p. 6].” 
It is the thesis of this paper that the sub- 
jective units of free recall and the products 
of controlled associative learning are com- 
mensurable dispositions sharing common 
functional properties. To the extent that 
this thesis is valid, the question of what is 
learned should not be a source of basic dis- 
agreement between exponents of association 
and of organization. Rather, the two posi- 
tions will be seen as facing common ex- 
fr e, ralem, This is not to au 
foe, oe du cde ne principles of associa- 
Given Pat matter, of organization. 
aiken uir: on what is learned, the 
remains open as to which 


formulation has greater analytic and predic- 
tive power. 


DEVELOPMENT oF INTERITEM DEPENDENCIES 

The products of free recall learning and 
of controlled associative practice can be 
readily specified within a common descrip- 
tive framework. In both cases, the results of 
training are reflected in the increased inter- 
dependence of nominal units: Given A 
either B or C or D, etc., may be expected to 
follow with varying Probabilities This 
basic descriptive statement can þe readily 
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modified to accommodate extended sequences 
or even hierarchical trees of categorized 
items (Bower, Clark, Lesgold, & Winzenz, 
1969). It also encompasses the limiting case 
in which the probability is unity for (say) B 
and zero for all other alternatives. The 
measurable differences between subject-gen- 
erated and experimentally prescribed se- 
quences then reduce to characteristics of 
such probability distributions. In the former 
case, the groupings are optional and freely 
expandable; in the latter case, they are pre- 
scribed and fixed. Thus, in free recall the 
definition of the task for the subject permits 
the development of cumulative groupings 
that may be more or less rigid in their in- 
ternal ordering. The experimental arrange- 
ments and conditions of testing in paired- 
associate learning, on the other hand, im- 
pose a pairwise module on interitem de- 
pendencies. 

A difference in the conditions of practice 
and in the criteria of acceptable performance 
does not, of course, imply a fundamental dis- 
parity of underlying processes. For ex- 
ample, all the available evidence indicates 
that the processes of intentional and inci- 
dental learning are governed by the same 
principles (cf. Postman, 1964). There may, 
in fact, be a useful parallel here to the dis- 
tinction with which we are concerned : What 
is selected incidentally is optional, what is 
practiced intentionally is prescribed. Never- 
theless, the same functional relations appear 
to obtain for both. "There is no compelling 
reason to believe that there is any greater 
discontinuity between the subjective units 
developed in free recall and the products of 
controlled associative learning. 

The apparent opposition between the con- 
cepts of organization and of association in 
the face of commensurable facts appears to 
stem in part from the divergent emphases 
that the two positions place on the factor of 
contiguity. Traditionally, association theory 
has regarded contiguity of stimulus and re- 
sponse as a necessary condition of learning. 
This assumption is reflected in the prefer- 
ence for such classical paradigms as paired- 
associate and serial learning where contigu- 
ity is built into the experimental arrange- 
ments. In modern theoretical analyses of 
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association, however, contiguity has become 
a formal construct which is only precari- 
ously anchored to experimentally controlled 
conditions of physical or temporal proximity. 
We need to think only of the significance at- 
tached to mediators in the analysis of asso- 
ciative processes. Here the critical contigu- 
ities are between observed and implicit 
events. Yet the silent presence of the 
schema of literal contiguity has probably 
dampened the enthusiasm of association 
theorists for the intensive exploration of the 
phenomena of subjective organization. There 
have been notable exceptions such as Deese's 
(1959, 1960) work. His studies have the 
clear implication that interitem associations 
can generate organization. There is no con- 
ceptual barrier to an associationistic analy- 
sis of higher order groupings that develop 
among items whose contiguity relations 
change from trial to trial. 

Organization theorists have been preoccu- 
pied with another type of contiguity, namely, 
that between particular responses in succes- 
sive tests of free recall. In the interpreta- 
tion of output regularities that develop in 
spite of changing orders of input, concepts 
other than association, such as chunking and 
unitization, have been favored. Whatever 
the preferred label, output consistency is not 
likely to prove a sensitive gauge of organiza- 
tion in the broad sense of the term, that 
is, if organization is taken to denote the 
development of systematic interitem depend- 
encies. "The relationship between organiza- 
tion and output consistency will be seen as 
uncertain and fallible if it is granted that 
interitem dependency need not be reflected 
in rigid response sequences. There is, in 
fact, a convincing argument to the contrary. 
As a multiplicity of interitem relations de- 
velops, an increasing number of equivalent 
pathways leading from one response to an- 
other becomes available. Thus, cohesive- 
ness of grouping may imply flexibility of 
access. It is only when organization is based 
on rigidly ordered categories, such as in the 
use of an alphabetical sequence (e.g., Tul- 
ving, 1962a), that intertrial repetition is 
likely to mirror faithfully the development 
of the schema. To speak of rote organiza- 
tion may not be a contradiction in terms! 
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It is evidence of,interitem dependencies, 
then, that constitutes the empirical basis for 
inferences about both associative and organ- 
izational processes. For purposes of asso- 
ciationistic analysis, controlled contiguity of 
input is not an essential requirement; the 
critical contiguities may be mediated by im- 
plicit events during the processing of the 
input. In the application of the concept of 
organization, the degree of contiguity of re- 
sponses in output probably does not reflect 
in a one-to-one fashion the amount of struc- 
ture imposed on the materials. Depending 
on the characteristics of the materials and 
the constraints of the task requirements, the 
groupings that develop may range from fixed 
pairwise linkages to a complex network, or 
an aggregate of networks such as is found 
in categorized lists. The mechanisms re- 
sponsible for the establishment of interitem 
dependencies—the development of what 
Underwood (1969) recently described as 
the associative attributes of memory essen- 
tial for retrieval —remain to be fully under- 
stood. 


SUBJECTIVE ORGANIZATION AND 
ASSOCIATIVE INTERFERENCE 


The argument for continuity of inferred 
associative and organizational processes can- 
not rest on a priori considerations alone but 
must be documented experimentally. A 
basic empirical question that grows out of 
the present analysis is whether subjective 
organization in free recall entails the devel- 
opment of linkages between discrete units 
that share the functional properties of the 
sequences established in controlled associa- 
tive learning. In support of an affirmative 
answer, it is not sufficient to point to the 
regularities in the order of output that are 
conventionally used as an index of organiza- 
tion. There remains some uncertainty as to 
whether the recurrent response sequences 
reflect the acquisition of higher order units 
or merely a systematic ordering of available 
items on successive recall trials. It has, in 
fact, been argued that there is independence 
of traces in storage and that organization is 
first imposed on recall at the time of test by 
the adoption of a retrieval plan (Slamecka, 
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1968, 1969). The argument will be difficult 
to resolve as long as the evidence for or- 
ganization is sought in internal analyses of 
free recall performance. Iere, as in other 
cases, a more promising approach to the 
assessment of what has been learned is pro- 
vided by tests of transfer. If subjective 
groupings are a product of learning (or a 
property of storage), it follows that after 
free recall practice there should be interfer- 
ence in the acquisition and retention of 
transfer tasks that require the disruption of 
such groupings. The logic of this approach 
starts with the strong assumption that pro- 
gress in free recall learning implies ipso facto 
the achievement of organization. Once such 
organization has occurred, it will become a 
source of interference when the transfer task 
requires the establishment of new -relations 
among the component items. 

The basic operations required for a test 
of this prediction are those of a two-stage 
transfer experiment. In the first stage, a 
list of words is learned by the method of 
free recall and an opportunity is provided 
for the development of higher order group- 
ings. In the second stage, a list of paired 
associates is learned in which both the stimu- 
lus and the response terms are drawn from 
the prior free recall task, The arbitrary 
pairings of the items in the transfer stage 
should be in conflict with the groupings de- 
veloped in the course of free recall learning. 
Consequently, there should be negative trans- 
fer in the acquisition of the paired-associate 
list relative to a control condition in which 
the materials learned in the first and in the 
second Stage are unrelated, This is a strong 
prediction because the associative interfer- 
ence is expected to outweigh the positive 
effects of item familiarization attendant on 
the prior free recall learning of the stimuli 
and the responses, The original organiza- 
ic linkages will eventually be supplanted 
S REY of the transfer task, To 
sources of j s ict -i wea wi 
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in the context of a classical paradigm of in- 
terference at recall. 

The consideration of long-term proactive 
losses calls attention to potential implica- 
tions of our prototype experiment for the 
analysis of preexperimental sources of inter- 
ference in forgetting. The subjective group- 
ings, which are developed during the free 
recall phase of the experiment and which 
may recover during the retention interval, 
will be based on the subject’s preexperimen- 
tal verbal habits. The sequence of tasks 
may be viewed, therefore, as tapping, at 
least indirectly, preexperimental sources of 
interference in the recall of new associations 
acquired in the laboratory. It is, of course, 
a deplorable but indisputable fact that evi- 
dence for the influence of prior language 
habits on retention has been notoriously dif- 
ficult to obtain (cf. Ekstrand & Underwood, 
1965; Postman, 1963). One possible reason 
for the failure of the interference hypothesis 
in conventional studies of single-list reten- 
tion may be the discontinuity between the 
laboratory task and the subject’s normal 
verbal activities. Consequently, the learner's 
preexperimental habits are sharply differen- 
tiated from the associations prescribed in the 
laboratory. If free recall learning does, in- 
deed, entail the development of higher order 
units based on preexperimental verbal dis- 
positions, then the present design should 
favor the arousal and subsequent recovery 
of the critical linguistic habits in the context 
of the experimental situation. 


EXPERIMENT I 


The first study tested the hypothesis that 
the organization developed during the free 
recall learning of a list is a Source of asso- 
ciative interference when items from that list 
are paired arbitrarily in the transfer stage. 


Method 


Design. This study comprised two experiments 
which were run concurrently. In each experiment 
Íree recall learning of a 20-word list was fol- 
lowed by the acquisition of a list of 10 paired 
associates. Under the experimental conditions 
the same words were used in free recall and in 
paired-associate learning ; under the control condi- 
tions, the words in the two lists were unrelated, 
The two experiments differed only with respect to 
the method of Practice in the paired-associate 
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phase. In one case, the transfer list was learned 
by the recall method, and in the other case, by the 
multiple-choice method. Performance in the paired- 
associate phase was likely to be influenced not only 
by associative transfer but also by prior familiar- 
ization with the response terms. Hence the mul- 
tiple-choice procedure, which eliminates the re- 
quirement of response recall, was used to obtain 
as pure an estimate as possible of the associative 
component of transfer. 

Under both methods of practice, additional ex- 
perimental and control groups were included for 
which the order of the tasks was reversed, that 
is, the acquisition of the paired-associate list 
preceded free recall learning. The purpose of 
this expansion of the design was to demonstrate 
that pairwise association and item organization in 
free recall learning are not necessarily incom- 
patible? On the contrary, preexisting linkages 
should facilitate the development of higher order 
groupings in free recall; if so, there would be posi- 
tive transfer from paired-associate to free recall 
learning. In summary, then, the experiments in- 
vestigated transfer between free recall and paired- 
associate learning for both the possible task se- 
quences, with the pool of items remaining the same 
under the experimental treatments and changing 
under the control treatments, 

Lists. The learning materials were four lists of 
20 two-syllable nouns of high to medium fre- 
quency of usage. These lists were used equally 
often in the free recall phase. Four different lists 
of 10 paired associates, with the pairings selected 
at random, were derived from each basic 20-word 
list. Thus, there were 16 paired-associate lists 
which were used under all conditions, The four 
basic lists were divided into two sets for purposes 
of assignment to the control treatments. Norma- 
tive associative relations were minimized within 
lists and between the successive lists learned by 
control Ss, 

Procedure. In the free-recall phase, the words 
were presented at a 1.5-second rate in the window 
of a memory drum. The items appeared in dif- 
ferent random orders on successive study trials. 
Each presentation of the list was followed by a 
1.5-minute written test of free recall. When free 
recall learning was the first task, acquisition was 
carried to a criterion of 16/20 plus three additional 
study-test cycles. 

In both the recall and the multiple-choice meth- 
Ods of paired-associate learning, study and test 
trials were alternated. The pairs were presented 
in four different orders on the study trials, and 
four other orders were used on the test trials. 
The rate of presentation was 1.5 seconds on study 
trials, and 3 seconds on test trials. The interval 
between a study and a test trial was 3 Seconds, and 
that between a test trial and the next study trial 
was 1.5 seconds. The two methods were dis- 


?For other relevant studies of transfer from 
paired-associate learning to free recall, see Segal 
and Mandler (1967) and Wood (1969, 1970), 
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Fic. 1. Acquisition curves for paired-associate 
transfer lists in Experiment I. 


tinguished by the procedure on the test trials. 
Under the recall method, each stimulus term was 
exposed for 3 seconds and the subject was re- 
quired to supply the correct response. On mul- 
tiple-choice test trials, four alternative responses 
were presented with each stimulus, and the sub- 
ject had to call out the correct one during the 
3-second exposure interval. All alternatives in the 
test items were responses from within the list. The 
set of incorrect alternatives accompanying a given 
correct response changed from one test order to the 
next. The subjects were instructed to respond to 
each test item, guessing if necessary. When the 
paired-associate list was learned first, acquisition 
under both methods was carried to the criterion 
of 8/10 plus three additional trials. 

The transfer task, regardless of whether it was 
a free recall or a paired-associate list, was always 
practiced for 10 trials. The interval between the 
first and the second list was 3 minutes, 

Subjects. There were 16 subjects in each of the 
eight groups. The subjects were undergraduate 
students at the University of California who were 
not necessarily naive to verbal learning experi- 
ments. The assignment to conditions was in 
blocks of eight, with one subject from each condi- 
tion per block. The running order within blocks 
was determined by means of a table of random 
numbers, 


Results 


First-list learning. When the first task 
was free recall learning, the mean number 
of trials to the criterion of 16/20 was 4.55. 
For the initial paired-associate task, the 
mean number of trials to the criterion of 
8/10 was 5.38 under the recall method and 
2.78 under the multiple-choice method.* 

* In a previous study (Postman & Stark, 1969), 
in which the criterion was one errorless trial, no 


significant difference in first-list learning was found 
between the recall and the multiple-choice method. 
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This difference was significant (¢= 3.31, 
df= 62, p<.01). There were no reliable 
differences between the groups treated alike 
in the first phase. 

Transfer. We turn now to the measures 
of performance in the test stage. Transfer 
from free recall to paired-associate learning 
will be considered first. In Figure 1, the 
acquisition curves for the recall method are 
shown on the left, and those for the mul- 
tiple-choice method on the right. Under 
both procedures, the control group surpassed 
the experimental group. The differences 
were substantial and persistent. In an anal- 
ysis of the total numbers of correct responses 
during second-list learning, the main effect 
of the condition of transfer was significant 
(F = 1181, df = 1/60, p<.01). Acquisi- 
tion was faster under the multiple-choice 
than under the recall method (F = 10:32, 
df = 1/60, p < .01). However, the negative 
transfer effects were clearly in evidence in 
each case. There was no reliable interaction 
between the method of testing and the con- 
dition of transfer, The contribution of re- 
sponse familiarization to transfer perform- 
ance can be gauged from a comparison of 
the two sets of curves, For the recall 
method, the amount of negative transfer in- 
creased over several trials of test-list learn- 
ing; the positive component of response 
learning initially masked some of the associa- 
tive interference that was present. For the 
multiple-choice method, on the other hand, 
the separation between the experimental 
and the control group was maximal on the 
very first trial. The consistency of the re- 
sults obtained by the two methods strongly 
Supports the inference that prior list or- 
ganization was a source of associative in- 
terference, 

An analysis of 


Overt errors under the re- 
call procedure y 


@ c ielded results entirely con- 
sistent with the findings on acquisition, As 
Table 1 shows, the mean percentages of 
Intralist errors were higher under the ex- 
perimental than under the control treatment 


It appears that 
ited to the early stage of a 
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TABLE 1 

OF INTRALIST ERRORS DURING 
RNING BY THE RECALL 
METHOD IN EXPERIMENT | 


in Misplaced Stimulus " 
Experimental 10.1 53 15.6 
Control L4 1.6 3.3 


(t — 2.30, df — 30, b < .05) (after arcsine 
transformation). The separate rates of mis- 
placed responses and intrusions of stimulus 
terms are also shown. The values were 
higher for the experimental than for the con- 
trol group in both ca The differences in 
the percentages of stimulus intrusions are 
particularly diagnostic. The prior occur- 
rence of all items in a single free recall list 
impeded the differentiation between the stim- 
ulus and the response terms in the paired- 
associate task. Such failures of differentia- 
tion probably served to depress the speed of 
learning. They cannot, however, be re- 
garded as an essential condition of negative 
transfer since they could not occur under the 
multiple-choice procedure. In the latter case 
the deficit in performance reflected directly 
the component of associative interference, 
Transfer from paired-associate to free re- 
call learning will be considered next. The 
acquisition curves for the test list are shown 
in Figure 2. Under both methods of prac- 
tice, the experimental group was clearly su- 
perior to the control group. The separation 
was large on the initial trials but diminished 
as test-list learning continued and the ceiling 
of performance was approached. There was 
a slight reversal in the terminal stage of 
acquisition, particularly for the multiple- 
choice groups. The overall difference be- 
tween the two treatments in the total num- 
ber of correct responses was significant (F 
— 1279, df — 1/60, P < .01). There was 
again no reliable interaction of the method 
of practice with the condition of transfer, 
The prior establishment of pairwise associa- 
tions facilitated free recall learning but prob- 
ably did not make for optimal list organiza. 
tion in the long run, 
There is no question 
associations 


but that the prior 
were utilized by the experi- 
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Fic. 2. Acquisition curves for free recall 
transfer lists in Experiment I. 


mental subjects in free recall, The pairs 
from the previously learned list appeared as 
clusters in the subjects’ output throughout 
free recall learning. The number of clusters 
in excess of chance (cf. Bousfield & Bous- 
field, 1966) increased as a function of trials, 
tracking the overall level of performance 
(Figure 3). For the control subjects, the 
frequency of occurrence of the same pairs 
remained, of course, close to zero through- 
out. It is interesting to note that the level 
of clustering was higher after recall than 
after multiple-choice training. The com- 
mon mode of responding—active reproduc- 
tion—encouraged the continued use of the 
same retrieval cues, 

The critical finding is the asymmetry of 
the transfer effects between free recall and 
paired-associate learning: there was inter- 
ference when free recall occurred first, and 
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Fic. 3. Mean cluster scores for previously learned 
pairs during acquisition of free recall transfer lists 
in Experiment I. 
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facilitation when it occurred second. While 
the products of practice were carried over 
from one task to the other in each case, the 
sign of the transfer effects depended on 
whether or not the interitem relations estab- 
lished in the first phase remained appropri- 
ate in the second phase. 


Experiment II 


The next study continued the investiga- 
tion of prior list organization as a source of 
interference. Attention was focused on the 
development of proactive inhibition as well 
as on negative transfer in acquisition. 


Method 


Design. The prototype design—free recall learn- 
ing followed by the acquisition of a paired-asso- 
ciate list—was extended in two ways. First, in 
order to vary the potential degree of organization 
in free recall, both categorized and noncategorized 
lists were used as learning materials. Second, 
in order to assess the proactive influence of prior 
free recall learning on the retention of the transfer 
task, tests of retention for the paired-associate list 
were added. These tests were given either 1.5 
minutes or 48 hours after the end of learning. 
Thus, the total design was a 2xX2x2 factorial, 
with 2 types of materials (categorized vs. non- 
categorized), 2 conditions of transfer (experi- 
mental vs. control), and 2 retention intervals (1.5 
minutes vs. 48 hours). 'The conditions of transfer 
were defined as before: The items used in free 
recall and in paired-associate learning were the 
same under the experimental treatment and were 
unrelated under the control treatment. 

Lists. The learning materials were six lists of 
20 two-syllable nouns, four noncategorized and 
two categorized. The noncategorized lists were 
the same as those used in Experiment I. The 
categorized lists comprised four instances each of 
five taxonomic categories. "Three of the categories 
were exhaustive (e.g., the four seasons of the year) 
and two were nonexhaustive (e.g, names of coun- 
tries). In the construction of the paired-associate 
lists, noncategorized items were paired randomly, 
as was done in the previous experiment. In the 
case of the categorized materials, half the items 
from a given category appeared as stimuli and 
half as responses in the paired-associate list. Mem- 
bers of the same category were never paired, 
Four different sets of paired associates were de- 
rived from each 20-word list. Thus, there were 
16 lists composed of noncategorized items and 8 
composed of categorized ones. Lists were fully 
balanced over the appropriate portions of the 
design, 

Procedure. The procedure in 
phase was exactly the 
I; acquisition was ag: 


the free 
same as in 


recall 
Experiment 


ain carried to the criterion 
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of 16/20 plus three additional study-test cycles. 
The learning instructions made no reference to 
interitem relations and were exactly the same for 
the noncategorized and the categorized lists. In 
the latter case, the items from the different cate- 
gories were presented in random order. 

The paired-associate list was learned to the 
criterion of 8/10 by the recall method; the pro- 
cedure was identical with that under the com- 
parable conditions in Experiment L The test of 
retention consisted of one recall trial without 
feedback, followed by five relearning trials, 

Subjects. There was 16 subjects in each of the 
eight groups. The subjects were drawn from the 
same population as those in Experiment I and 


assigned to conditions in the same manner 
before. 


as 


Results 


First-list learning. The mean number of 
trials to the criterion of 16/20 was 4.22 for 
the noncategorized lists and 1.48 for the 
categorized lists. The difference between the 
two types of materials was significant (t= 
828, df = 126, p < .01). There were no 
reliable differences between groups learning 
the same type of list, n M 

The consistency of the order of output 
was, of course, much greater for the cate- 
gorized than for the noncategorized material, 
The numbers of intertrial repetitions ( Bous- 
field & Bousfield, 1966), in both the for- 
ward and the reverse order, were determined 
for the last three trials of acquisition. The 
mean score (obtained minus expected value) 
per trial was 6.13 for the categorized lists 
and 2.36 for the noncategorized lists (1 = 
93^, U— 125, »« DI). Di the former 
case, there. was a high degree of clustering 
by categories, number of cate- 
gorical Tepetitions in excess of chance was 
9.68 per trial. The high level of clustering 
indicates that the subjects readily discovered 
the categorized nature of the list. It can be 
assumed that the taxonomic groupings were 
well established when the Subjects were 
shifted to the paired-associate phase. 

Transfer. Figure 4 shows the 
curves for the transfer task. 
teria on the paired-associ, 
as a function of trials. 
the customary locations 
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the figure, 
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land & Kurtz, 1952). This method of dis- 
playing the course of acquisition was chosen 
in order to permit a ready comparison with 
the transfer measures obtained in Experi- 
ment I (see Figure 1). 

There are two main findings. First, the 
pairs composed of mismatched members of 
taxonomic categories were more difficult to 
learn than the pairs derived from a pool of 
unrelated words. The difference in trials to 
criterion was significant (F = 32.30, df= 
1/120, p < .01). Second, for both types of 
list the test task was learned more slowly 
under the experimental than under the con- 
trol condition (F — 17.81, df — 1/120, p< 
01). The amount of negative transfer was 
somewhat greater for the categorized than 
for the noncategorized lists, but not reliably 
so. The results obtained with lists of un- 
related words replicate the findings of the 
previous experiment. The presence of paral- 
lel effects for the categorized lists adds an 
important new dimension to our analysis of 
the conditions of associative interference— 
the degree of arousal of prior language habits 
in the experimental context. 

On the face of it, no differences between 
the experimental and the control subjects 
learning the categorized test list might have 
been expected. The basic source of asso- 
ciative interference in the acquisition of that 
list was the mismatching of closely related 
items. The categorically appropriate re- 
sponse to each stimulus was present in the 
list and remained in sustained competition 
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Fic. 4. Acquisition curves for transfer lists jy 
Experiment II. (Successive criteria are plotted 
as a function of trials.) 
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TABLE 2 


MEAN PERCENTAGES OF INTRALIST ERRORS DURING 
ACQUISITION OF CATEGORIZED AND Nox- 
CATEGORIZED TRANSFER Lists: 
EXPERIMENT II 


Misplaced | Stimulus 


Condition responses | intrusions | Total 
Categorized list 
Experimental 12.7 12.9 25.6 
Control 10.3 7.4 17.7 
Noncategorized list 
Experimental 10.8 7.6 18.4 
Control 4.6 2.3 6.9 


with the inappropriate prescribed one. The 
resulting interference stems from highly 
overlearned linguistic habits which both ex- 
perimental and control subjects presumably 
brought to the laboratory. Nevertheless, 
there was a substantial amount of negative 
transfer. The interpretation of this fact is 
that the prior free recall practice served to 
activate the relevant linguistic habits in the 
experimental context and thus to increase 
their persistence and interfering potential 
in the transfer stage. 

As Table 2 shows, the percentages of 
intralist errors were higher for the experi- 
mental than for the control groups (F = 
20.63, df = 1/120, p < 01); they were also 
greater for the categorized than for the non- 
categorized lists (F — 17.93, df = 1/120, p 
X 01). These differences obtained for both 
misplaced responses and stimulus intrusions. 
In the case of the categorized lists, “appro- 
priate” intralist intrusions are of special 
interest, that is, the occurrence of er- 
roneous responses which belonged to the 
same taxonomic category as the stimulus 
terms. The percentage of such intrusions, 
to the base of all overt responses, was reli- 
ably higher for the experimental than for 
the control group (3.1 vs. 1.8). However, 
relative to the total number of errors of com- 
mission, the frequencies of appropriate in- 
trusions were comparable for the two groups 
(12 vs. 13%). Thus, the interference was 
more persistent under the experimental than 
under the control treatment, but the pattern 
of errors was similar under both conditions, 

Recall. The mean numbers of items Tre- 
called on the immediate and the delayeq 
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TABLE 3 


MEAN NUMBERS CORRECT ON TESTS OF RETENTION 
FOR PAIRED-ASSOCIATE LISTS IN 
EXPERIMENT II 


| Categorized list | Noncategorized list 


Condition 
1.5 48 1.5 48 
minutes hours minutes hours 
Experimental 7.8 4.3 7.1 45 
Control 7.6 6.1 8.6 4.7 


tests of retention are shown in Table 3. For 
the categorized material, the loss over the 
48-hour period was greater under the ex- 
perimental than under the control treat- 
ment. For the noncategorized material, 
there was a slight trend in the opposite di- 
rection. The higher order interaction, which 
reflects the difference in the amount of long- 
term proactive interference for the two types 
of lists, was significant (F — 6.14, df = 1/ 
120, p<.02). Further analysis showed 
that the interaction of the condition of 
transfer with the length of the retention in- 
terval was significant for the categorized 
lists (p < .02) but not for the noncategor- 
ized lists, 

It may be useful to restate the nature of 
the higher order interaction in a different 
way. Under the control condition, the cate- 
gorized list was retained considerably better 
than the noncategorized list. This finding 
is in agreement with the results of earlier 
experiments which showed better retention 
for lists composed of mismatched normative 
associates rather than unrelated pairs ( Post- 
man, Fraser, & Burns, 1968; Underwood & 
Ekstrand, 1968). The presence of closely 
related items in the list appears to have a 
positive influence on response recall which 
outweighs the associative interferences at the 
time of the retention test. Under the ex- 
perimental condition, this difference between 
the categorized and noncategorized mate- 
rials was eliminated. According to the 
present interpretation, prior free recall learn- 
ing increased the proactive effects of the pre- 
existing linguistic associations sufficiently to 
outweigh the advantage conferred by a high 
level of response recall. This interpretation 
must be viewed with caution: the actually 
observed difference in long-term retention 
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was between the control groups, and the 
virtual absence of a difference between the 
experimental groups is attributed to a bal- 
ance of positive and negative factors. How- 
ever, earlier analyses of extraexperimental 
forgetting have likewise pointed to dual and 
opposed effects of language habits on re- 
tention (cf. Postman, 1967). 

In the recall of the categorized lists, ap- 
propriate intrusions were not numerous, but 
the difference that was found—12 occur- 
rences for the experimental group and 5 for 
the control group—was in the expected di- 
rection. However, as in acquisition, the 
frequencies of such intrusions relative to 
all errors of commission were comparable 
for the two groups, namely, 17% and 14%. 
The relearning scores will not be reported 
in detail. The groups that differed on the 
delayed test of recall converged as relearn- 
ing continued;.but the initial pattern of vari- 


ation remained in ce on all but the last 
trial. 


At this point an alterna planation 
should be considered briefly. Can he higher 
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ment concerns the difference between the 
two classes of materials. There was nega- 
tive transfer in the acquisition of both types 
of list, but proactive interference only in the 
recall of categorized items. In the case of 
the noncategorized lists, interitem depend- 
encies newly developed in the course of free 
recall learning were stable enough to impede 
the performance of the transfer task that 
followed immediately; but they apparently 
failed to recover after an interval of time 
and hence did not produce proactive losses 
in long-term recall. There was an alterna- 
tive possibility, however, that warranted 
consideration: "The measures of retention, 
which were heavily dependent on response 
recall, may not have been sufficiently sensi- 
tive to the proactive interferences falling on 
specific associations. The next study bears 
directly on this alternative interpretation. 


EXPERIMENT III 


This study again investigated the influence 
of prior free recall learning on paired-asso- 
ciate transfer and retention. As compared 
with the last experiment, there were two 
major changes. First, in order to focus as 
directly as possible on the associative com- 
ponents of transfer and interference, the test 
list was learned and retention was measured 
by the multiple-choice method. Second, only 
noncategorized materials were used, and a 
comparison was made between lists of high 
and of low meaningfulness. The stability of 
the higher order groupings established dur- 
ing free recall should increase as a function 
of the degree of meaningfulness of the words 
in the list. The amounts of interference in 
the paired-associate phase should, in turn, 
directly reflect the stability of the prior or- 
ganization of the materials. The question 
of central interest was whether a test maxi- 
mally sensitive to the associative component 
of interference would bring to light proac- 
tive effects on long-term retention even in 
the absence of normative relations among 
the items. 2 


Method 


Design. Except for the two changes noted above, 
the design was the same as in Experiment II. The 
Pes of meten don Occurred either 3 minutes or 48 
hours after the end of paired-associate learning. 
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Thus, the design was again a 2X2X2 factorial, 
with 2 levels of meaningfulness, 2 conditions of 
transfer (experimental vs. control), and 2 reten- 
tion intervals. 

Lists. The learning materials were four lists of 
20 nouns. The words were selected írom the 
norms of Paivio, Yuille, and Madigan (1968). 
The scaled meaningfulness of the words was high 
in two of the lists (average m values of 7.25 and 
7.33) and low in the other two lists (average m 
values of 441 and 4.17). The degrees of con- 
creteness of the words in the two types of lists 
were equated, but frequency of usage was corre- 
lated with m value. ‘Four different lists of 10 
paired associates were derived from each basic 20- 
word list, with the pairings again selected at ran- 
dom. Within each level of meaningfulness, the 


lists were fully balanced. 
Procedure. Free recall learning in the first 


phase of the experiment was again to a criterion 
of 16/20 plus three additional study-test cycles. 
The words were presented on study trials at a 
2.5-second rate, and the written tests of recall 
were 2 minutes in duration. 

The paired-associate lists were learned by the 
multiple-choice method to a criterion of 8/10. A 
2.5-second rate of presentation was used on study 
trials, and a 5-second rate on test trials. The in- 
terval between a study and a test trial was 5 
seconds, and that between a test trial and the next 
study trial was 2.5 seconds. In all other respects, 
the multiple-choice procedure was the same as in 
Experiment I. The interval between the first and 
the second list was 5 minutes. 

Retention was measured on five successive test 
trials. The items were again presented at a 5-sec- 
ond rate. The subjects received no feedback dur- 
ing the test of retention. 

_ Subjects. There were 16 subjects in each of the 
eight groups. The source of the subjects and the 
manner of assignment to conditions were the same 
as in the previous experiments. 


Results 
First-list learning. In the free recall phase, 

the high- lists were learned faster than the 
low-m lists, but the difference was not large. 
The mean numbers of trials to the criterion 
of 16/20 were 3.08 and 4.03 (t = 4.64, df = 
126, p < 01). There were no reliable dif- 
ferences between groups learning the same 
type of list. The levels of intertrial repeti- 
tion were comparable for the two types of 
material. For the last three tests, the mean 
intertrial repetition scores per trial, deter- 
mined as before, were 2.93 and 2.62 for the 
high-m and the low~n lists, respectively, 

K Transfer. Acquisition of the paired-asso- 
ciate list by the multiple-choice method was 
quite rapid under all conditions (Table a 
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TABLE 4 


NUMBERS OF TRIALS TO CRITERION ON 
TRANSFER Lists or HIGH AND OF Low 
MEANINGFULNESS IN EXPERIMENT III 


Lro PostMaNn 


TABLE 5 


MEAN NUMBERS Correct on Tests or RETENTION 
FOR PAIRED-AssOcIATE LISTS IN 
EXPERIMENT III 


High m | 


Condition | | Low m x 
Experimental 1.94 | 2.41 | 2.48 
Control 162 | 216 || 1.89 

L78 | 228 | 


r | 


The high-m lists were again learned reliably 
faster than the low-m lists (F =7.02, df = 
1/120, p < .01). In agreement with the re- 
sults of the previous studies, there was also 
some evidence of negative transfer, but the 
difference between the experimental and the 
control conditions did not reach significance 
nor did it interact with meaningfulness. The 
generally rapid mastery of the task clearly 
limited the sensitivity of the criterion mea- 
sures to differential transfer effects.® 
Recall. The mean numbers of correct re- 
sponses given on the retention test are shown 
in Table 5. For the high-m lists the long- 
term losses were, as expected, greater under 
the experimental than under the control 
treatment. The difference in the rate of for- 
getting was in the opposite direction for the 
low-m lists, indicating that the benefits of 
prior item familiarization outweighed what- 
ever interference effects might have been 
present. Thus, the pattern of retention 
scores parallels that obtained in the last ex- 
periment. The three-way interaction was 
again significant (F = 4.50, df = 1/120, p 
< 05). When the two types of lists were 
considered separately, the interaction of the 
condition of transfer with the length of the 
retention interval was significant in neither 
case, However, in the recall of the high-m 
lists, with the control group having an ad- 
vantage on both the immediate and the de- 
layed test, the main effect of the condition 
of transfer was reliable (p < .02). Prior 
6The fact that acquisition was considerably 
faster than under the multiple-choice condition in 
the previous experiment may be attributed to the 
slower rate used on both presentation and test 
trials. A slower rate was chosen in the present 
experiment in order to minimize the effects of 
possible differences in search time for items of 


high and of low meaningfulness (cf. Schulz & 
Lovelace, 1964). 


High m Low m 
Condition | acm 
3 minutes| 48 hours | 3 minutes| 48 hours 
rs 
Experimental 8.4 | 5.9 8.2 5.1 
Control 8.9 Tf 8.9 4.2 


free-recall learning was, therefore, a. signifi- 
cant source of interference, but there was no 
statistical support for a build-up of proactive 
effects over time. In view of the significance 
of the higher order interaction and the con- 
sistency with the results of the last experi- 
ment, the temporal trend should probably 
not be discounted. On the other hand, it is 
now clear that under the conditions of the 
present experiments, dependable proactive 
effects are obtained only when the organiza- 
tion of the items in the free recall phase is 
based on well-established linguistic disposi 
tions. Los 

As in the preceding experiment, there 
were substantial differences in long-term re- 
tention between the two control conditions, 
The higher the meaningfulness, the more 
stable were the associations formed during 
paired-associate learning. As a result of the 
proactive influence of free recall learning, 
this advantage was lost under the experi- 
mental conditions. This interpretation is, 
of course, subject to the same reservations 
as before. It should also be noted that in 
the present case, differentiation cannot be 
invoked as an explanatory factor, since the 
multiple-choice procedure entirely eliminates 
the requirement of list identification. On 
balance, the nature of the higher order inter- 


action points once more to dual and opposed 
effects of prior language habits. 


Gr 


The central question in this series of ex- 
periments was whether subjective groupings 
developed in the course of free recall learn- 
mg are functionally continuous with the 
products of controlled associative learning. 
lhe demonstration that such groupings can 
become an effective source of interference 
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if 


ORGANIZATION AND INTERFERENCE 


in -the acquisition and retention of subse- 
quent associative tasks strongly supports an 
affirmative answer. 

The present results do not permit us to 
specify in detail the mechanisms responsible 
for the observed interference effects. One 
possibility is, of course, that the subjective 
groupings consist of ordered sequences that 
come into conflict with the prescribed pair- 
‘ings in the transfer phase. The interference 
would thus reflect the successive acquisition 
of incompatible sequential arrangements of 
the items. However, there may be negative 
transfer from free recall to paired-associate 
learning—not only when the test task calls 
for the disruption of internally ordered units. 
As a result of prior experience, each item in 
the test list may come to arouse numerous 
and changing associates which enter into 
competition with the prescribed one. With 
respect to the pattern of interitem depend- 
encies, the change would then be from a net- 
work with multiple alternative pathways to 
a strictly segmented one. The extent to 
which each of these mechanisms will come 
into play will, of course, depend on the na- 
ture of the organization developed in the 
course of free recall learning. 

The finding that there was positive trans- 
fer from paired-associate to free recall learn- 
ing complements the results obtained with 
the reverse order of tasks. The existence 
of stable interitem dependencies can be a 
source of facilitation not only when the 
groupings are generated by the subject but 
also when they are based on arbitrary pair- 
ings. Thus, even a segmented initial organ- 
ization is better than none, at least in the 
relatively short term. It is recognized that 
the use of less than optimal units may under 
certain circumstances retard the progress of 
free recall learning (cf. Tulving, 1968). For 
purposes of the present discussion, the criti- 
cal fact is that the same dispositions can 
function effectively in the performance of a 
paired-associate and a free recall task, 

At this point, one may be tempted to argue 
that the phenomena of subjective organiza- 
tion can be subsumed under the principles 
of associative learning; or conversely, that 
the establishment of prescribed associations 
represents a special case of organization, 
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Neither of these arguments will be enter- 
tained here. The thrust of the present com- 
ments is to call for a rapprochement between 
the exponents of organization and of asso- 
ciation, with the analysis of comparable and 
commensurable phenomena as the meeting 
ground. The conceptual framework that 
will eventually prove most useful in the pur- 
suit of this objective may well represent a 
confluence of both orientations. 

Recognition of the continuity between as- 
sociation and organization may help us to 
gain a new perspective on the mechanisms of 
extraexperimental interference in forgetting. 
The basic assumption of a general theory of 
interference is that the principles of retroac- 
tive and proactive inhibition, as established 
in controlled experiments, are responsible 
for forgetting outside the laboratory. In the 
retention of verbal materials, the major 
sources of extraexperimental interference 
have been sought in the subject’s language 
habits (Underwood & Postman, 1960). The 
repeated failures of predictions based on this 
conception have given rise to a search for 
the shortcomings of the theory (e.g., Post- 
man, 1963; Underwood & Ekstrand, 1966). 
The possible deficiencies in the existing for- 
mulation deserve careful exploration before 
the hypothesis is abandoned altogether, espe- 
cially in the absence of better supported al- 
ternatives. Two further questions should be 
added to some of the earlier speculations. 

The first of these concerns the units of 
analysis with respect to which the subject’s 
preexperimental dispositions are considered. 
We have been inclined to specify these dis- 
positions in terms of fixed pairwise associa- 
tions. Thus, the response car to TABLE 
should be difficult to learn and easy to forget 
because chair is a powerful competitor. 
However, pairwise linkages do not neces- 
sarily reflect verbal habits in actual use; they 
are usually drawn from norms gathered by 
the special procedures of tests of free asso- 
ciation. Surely our associative dispositions 
represent highly complex networks of inter- 
word dependencies. The total pattern of 
interitem relations in a list is likely, there- 
fore, to be a critical factor in acquisition and 
retention. Consequently, it becomes impor- 


tant to inquire into the conflicts that may 
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develop between the list organization im- 
posed in the experiment and that entailed by 
the subject’s linguistic dispositions. Does 
the experimental task require the segmenta- 
tion into separate and independent units of 
higher order groupings based on linguistic 
convention? The results of the experiment 
with categorized materials indicate that un- 
der such circumstances an  appreciable 
amount of both associative and proactive 
interference may come into play. 

The second question focuses on the condi- 
tions under which preexperimental disposi- 
tions may be expected to become effective in 
the acquisition and retention of laboratory 
tasks. We referred earlier to the importance 
of ensuring the arousal, and thus preparing 
the way for the subsequent recovery, of the 
critical linguistic dispositions in the context 
of the experimental situation. The results 
of the present studies indicate that the verbal 
dispositions activated in free recall learning 
do, in some measures, become tied to the 
experimental context. The requirements of 
organization entailed by a free recall task 
serve to reduce the isolation of experimental 
from preexperimental habits. 
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NCTION OF DISTINCT ASSOCIATIONS FOR 


PAIRED-ASSOCIATE PERFORMANCE ! 


GEORGE WOLFORD ? 


Dartmouth College 


A model for predicting performance on a variety of paired-associate test 


types is developed and empirically tested. 


The primary feature of the model 


is the assumption that subjects form distinct forward and backward associa- 


tions within a single paired-associate item. 


Different test types have different 


numbers of associations available for mediating performance. Two experi- 
ments were run using recall, recognition, and a variety of forced-choice test 


types. 


The one-element model was proposed by 
Bower (1961) and Estes (1961) to account 
for the data froma series of paired-associate 
experiments. The model provides a good 
quantitative account of the data when there 
are only a few well-learned responses and 
when learning is assessed using forward 
recall tests (Bower, 1961). 

A number of studies which used recogni- 
tion tests in addition to forward recall tests 
to measure learning have produced results 
which are not fit well by the one-element 
model (Estes & Da Polito, 1967; Kintsch 
& Morris, 1965). The two main difficulties 
for the one-element model are that recogni- 
tion is often more superior to forward recall 
than can be accounted for by different 
probabilities of guessing correctly, and 
there are variables which affect recall and 
recognition differentially. 

One possible extension of the one-element 
model is to assume that more than one as- 
sociation can be learned within a single 
paired-associate item. Consider the pair 
BOY-7. It might be possible for the subject 
to learn one directional association running 
from Boy to 7 and another directional as- 
sociation running from 7 to nov. The 
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Both experiments offer support for the model. 


implications of this extension of the one- 
element model for recall and recognition 
performance are developed in this paper. 


DIRECTIONALITY OF ASSOCIATIONS 


The learning of a single item from a 
paired-associate list has often been con- 
sidered analogous to the learning of a clas- 
sically conditioned association (Gibson, 
1940; Goss, Morgan, & Golin, 1959). Such 
a formulation strongly implies that associ- 
ations are directional when formed. In 
particular, an association is directed from 
the stimulus to the response. One major 
problem with the S-R formulations has 
been the persistent presence of backward 
associations. In some paired-associate 
experiments, the backward associations 
appear to equal or even exceed the forward 
associations in strength (Asch & Lindner, 
1963; Lockhart, 1969). These strong back- 
ward associations are quite at odds with 
the typical difficulty experimenters in clas- 
sical conditioning have had in establishing 
backward associations (Cason, 1935; Por- 
ter, 1938). 

An interesting solution to the problem 
of backward associations, proposed by Asch 
and Ebenholtz (1962), is called the prin- 
ciple of associative symmetry. This prin- 
ciple states that upon the formation of 
a forward association, a backward associa- 
tion of equal strength is necessarily formed, 
Associative symmetry carries the implicit 
assumption that any given paired associate 
consists of two stimuli joined by 


a single 
bidirectional association. 
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One problem with the theory of associa- 
tive symmetry arises from the fact that in 
practice backward and forward associations 
are rarely equal, the forward associations 
usually being superior, as revealed by higher 
recall scores (Ekstrand, 1966). To handle 
this difficulty, Asch and Ebenholtz (1962) 
further assume that both members of the 
pair must be of equal availability for the 
equality of associations to manifest itself. 
Availability is the ability of an item to 
maintain itself in short-term memory in 
the midst of interfering activity (Horowitz, 
Norman, & Day, 1966). Some experi- 
menters have manipulated availability 
and obtained results in keeping with the 
theory of associative symmetry (Asch & 
Ebenholtz, 1962; Horowitz et al., 1966). 
Other experimenters, however, have sup- 
posedly equalized availability and the non- 
equality of forward and backward associa- 
tions continued to appear (Coutu, 1966; 
Schild & Battig, 1966). 

An alternative to the assumption of as- 
sociative symmetry is the assumption that 
the subject forms distinct directional as- 
sociations on any given trial. If we con- 
ceptualize a paired-associate item as con- 
sisting of two stimuli, S; and Ss, then two 
directional associations can be formed : one 
association running from S, to Ss, and 
another directional association running 
from Sz to Sj. The function of an associa- 
tion is to call forth a representation of the 
appropriate pair member in memory. The 
response which is then made is dictated by 
the experimental situation. Therefore, 
instead of one bidirectional association, we 
would have two unidirectional associations. 

A number of studies have produced evi- 
dence in favor of distinct directional as- 
sociations. An experiment by Keppel and 
Underwood (1962) provides support for the 
existence of distinct forward and backward 
associations. They used an A-B, A-C 
paradigm and found that it is possible to 
greatly reduce the strength of forward as- 
sociations without reducing the strength of 
backward associations. Johnston (1967) 
has replicated the finding with additional 
controls and an expanded design. If it is 
possible to destroy the association in one 
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direction while leaving the association in 
the other direction intact, then there must 
have been more than one association to 
begin with. 

Lowry and Wollen (1969) conducted an 
experiment in which the forward and back- 
ward associations had different retention 
properties. This finding provides addi- 
tional support for the assumption of distinct 
forward and backward associations. 

Experiments which find forward and 
backward associations to be of equal 
strength do not necessarily offer support 
for associative symmetry. There is no 
reason why two associations could not be 
distinct and still be equal in a given 
experiment. 


IMPLICATIONS FOR PERFORMANCE 


The primary reason for concern over the 
distinction between a single bidirectional 
association and two directional associations 
is the belief that the assumption of distinct 
directional associations (if valid) can in 
crease our understanding of the interrela 
tionships between various test types. Other 
experimenters have explored the implica- 
tions of distinct associations in a variety of 
interference paradigms (Keppel, 1968; 
McGovern, 1964). Little attention, how- 
ever, has been given to forme the 
notions or to varying test types over a wide 
range. First presented is an intuitive over- 
view of the implications of the concept of 
distinct directional associations for a vari- 
ety of paired-associate test types. Then, 
the formal assumptions required to make 
quantitative predictions about the various 
test types are listed. 


Various types of paired-associate tests 


have different numbers of forward and 
backward associations available for mediat- 
ing performance. On a forward recall test 
the subject is presented with the left-hand 
or temporally first) member of a pair and 
asked to fill in the missing right-hand mem- 
ber of that pair; only a learned forward 
association will insure correct performance. 
The probability of guessing correctly on a 
forward recall test is low or negligible in 
most experiments. 


On an old-recognition test, the subject is 


ef? 
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presented with a complete pair and can 
correctly respond “old” if either the for- 
ward or backward association for that pair 
is learned. On a new-recognition test, the 
subject is presented with the left-hand pair 
member of one pair and the right-hand pair 
member of a second pair and can correctly 
respond “new” if the forward association of 
the first pair or the backward association of 
the second pair is learned. On an old- or 
new-recognition test, the subject can also 
guess correctly if neither association is 
learned. The probability of a correct guess 
depends on the subject’s bias but is near .5 
in most experiments. Recognition, then, 
differs from recall in potential guessing 
accuracy and in the potential availability of 
an additional association. Recognition will 
be most superior to forward recall when 
backward associations are prominent. 

On one type of forced-choice test, the 
subject is presented with one left-hand pair 
member and two right-hand pair members. 
His task is to choose the right-hand pair 
member which was paired with the left- 
hand pair member during training. The 
subject has one forward and two backward 
associations available for mediating perfor- 
mance. The learned association from either 
member of the correct pair will insure a cor- 
rect response as on an old-recognition test. 
In addition, if the subject has learned the 
backward association from the incorrect 
alternative, he can eliminate that alterna- 
tive and respond correctly (providing all 
pairs within the presented list are unique). 
Performance on a forced-choice test with 
two alternatives will be superior to the 
average performance on old and new recog- 
nition owing to the presence of an additional 
association. 

Two predictions have emerged thus far 
in the development of a model which as- 
sumes distinct directional associations. 
Performance on old-new recognition will be 
more superior to performance on forward 
recall tests when backward associations are 
strong relative to forward associations, and 
performance on forced-choice tests with 
two alternatives will be superior to the 
averaged performance on old-new recogni- 
tion tests. 
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A number of previous studies have com- 
pared recognition and recall, with some 
studies finding recognition superior to re- 
call (Bruce & Cofer, 1965; Postman & Rau, 
1957) and some studies finding recognition 
equal in performance to recall, after ap- 
propriate guessing corrections (Davis, Su- 
therland, & Judd, 1961; Estes X Da Polito, 
1967). It would be nice to show that the 
latter studies had less prominent backward 
associations, but none of the studies re- 
ported backward recall scores. Also, there 
are variables other than backward associa- 
tions which influence comparisons between 
recognition and recall, such as partial 
associations (McNulty, 1965) and the selec- 
tion of the incorrect alternatives (Bahrick 
& Bahrick, 1964). 

The second prediction from above, 
namely that performance on forced choice 
with two alternatives will be superior to 
combined performance on old-new recog- 
nition, has received support in the literature 
(Freund, Loftus, & Atkinson, 1968; 
Kintsch, 1968). 


OTHER THEORIES OF PAIRED-ASSOCIATE 
PERFORMANCE 


One of the earliest theories of recognition 
performance held that both recall and 
recognition are measures of the same under- 
lying process, but that recognition is the 
more sensitive of the two (Postman, 
Jenkins, & Postman, 1948). This view is in 
need of modification to account for the 
aforementioned cases where recall is equal 
to recognition. This view would be com- 
patible with the multiple association model 
if “recognition is more sensitive" were 
translated into ‘recognition tests usually 
have more associations available for 
performance." 

Estes and Da Polito (1967) and Kintsch 
and Morris (1965) have suggested that 
learning involves two processes: storage 
and retrieval. Both processes are necessary 
for correct performance on recall, but only 
storage is necessary for correct performance 
on recognition. It seems reasonable, how- 
ever, that recognition also involves some 
form of retrieval. The subject must still 
find the association in memory in order to 
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verify that a given pair presented for 
recognition is a correct one. It does not 
seem that the mere presence of both pair 
members would insure the subject's finding 
the association given that it is stored. It 
is probable, however, that the presence of 
both pair members will add additional cues 
for the finding of the association. There is 
also evidence in the partial information 
literature (McNulty, 1965) that what is 
stored is important for recognition-recall 
differences. If what is stored is only a 
partial association, then recognition re- 
ceives more benefit than recall. In terms 
of the distinct association model, any pair 
which has been learned in the backward 
direction, but not in the forward direction, 
can be considered as a pair which is stored 
but is irretrievable in the forward direction. 

Another relative newcomer to the area 
of recognition performance is signal detec- 
tion theory, which was originally developed 
to handle the data of perceptual detection 
experiments and was extended to recogni- 
ton memory by Egan (1958). A number 
of experimenters have applied the signal 
detection theory model to paired-associate 
experiments (Freund et al., 1968; Hickson, 
1968; Suboski, 1967). The main problem 
with this theory for the present research 
arises in making comparisons between 
recognition and recall performance. For 
the most part, signal detection theory 
makes no statement about recall perfor- 
mance. At best it treats recall as forced 
choice from many alternatives. For the 
recall of words, two-digit numbers, or other 
items with large class membership, signal 
detection theory will generally underpredict 
recall when treated as forced choice. Signal 
detection theory and the distinct associa- 
tion model agree about the superiority of 
forced choice from two alternatives over 
old-new recognition. 


THe MODEL 


Four formal assumptions are required to 
incorporate the concept of distinct direc- 
tional associations into a mathematical 
model capable of generating quantitative 
predictions for various paired-associate test 
types. 
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Assumption 1: The learning process for 
any item can be characterized as a four- 
state Markov process in which the four 
states are as follows: State U— neither as- 
sociation is learned; State B—only the 
backward association is learned; State 
F—only the forward association is learned ; 
and State L—both the forward and back- 
ward associations are learned. On a single 
training trial, a given pair will transit from 
State U to State B with probability z; State 
U to State F with probability y; State U 
to State L with probability x; and the pair 
will remain in State U with probability 
1—x-—y-z The transition probabili- 
ties from States B, F, and L will not enter 
into any of the predictions in the present 
paper; these transitions need to be taken 
into account only when there are multiple 
test trials with intervening training trials. 

Assumption 2: Prior to the first reinforce- 
ment or training trial, all pairs are in State 
U. 

Assumption 3: Associations within differ- 
ent pairs are learned independently of one 
another. 

Assumption 4: The subject uses all avail- 
able associations in making any response. 

The learning process of the distinct as- 
sociation model (referred to as M1) de- 
scribed in the above assumptions can be 
represented by the matrix: 


State on Trial n 


State L 

on F 

Trial B |b 0 
“=f U 


where] = (1— y — y — z). 


The distinct association model is pre- 
sented above in its most general form. Pa- 
rameters a and b are somewhat superfluous 
since they do not enter into any 
dictions yet considered. They should be 
involved only when there are multiple 
reinforcement trials with intervening test 
trials. The correlation between the forma- 
tion of the forward and backward associa- 
tions is allowed to assume any value 
between —1 and +1. A correlation of 
+1 would indicate associative symmetry 


of the pre- 


añ 


xfj 
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and would be obtained when y and z are 
zero. This would mean that either both 
associations are learned or the item remains 
in the unlearned state. A correlation of 0 
would indicate complete independence of 
forward and backward associations and 
would be obtained when x = (x + y) 
*(@ +2). This equality arises from the 
definition of independence, namely, two 
associations are independent if the probabil- 
ity of learning both is equal to the product 
of learning each separately. The actual 
nature of the correlation is determined by 
estimating the parameters from empirical 
data. When independence holds and the 
correlation between forward and backward 
associations is zero, the distinct association 
model can be rewritten as a two-parameter 
model. 


Theoretical Predictions 


Predictions for the probability correct 
after one reinforcement will be presented 
for a variety of test types. 

Forward recall. The subject will be 
Correct on a forward recall test of a given 
pair if that pair is in State L or F (i.e., if 
the forward association is learned). He 
can also guess correctly from State U with 
probability 1/r. Therefore, 


P(C) =x +y + (1 — x — y)i/r. 1] 


The parameter r refers to the number of 
members in the set from which the right- 
hand pair members of the list were chosen. 
It will be assumed in the present research 
that items were chosen from sufficiently 
large sets so that 1/r is approximately zero, 
Equation 1 can then be rewritten as: 

P(C) =x +y. [2] 

Backward recall. It is also assumed that 
on backward recall tests the probability of 
guessing correctly from the unlearned state 
is approximately zero. The subject will, 
therefore, be correct on the backward recall 
of a given pair if that pair is in State L or 
State B. 

[3] 


On an old-recognition 
presented with both 
The subject will cor- 
if the pair is in State 


P(C) 2 x +z. 

Old recognition. 
test, the subject is 
members of a pair. 
rectly respond “old” 
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L, State F, or State B ; the subject will guess 
correctly from State U with probability g. 
Therefore, 
P(C) =1-— (1 — xs- y -— 3)(1 — g). [4] 
New recognition. On a new-recognition 
test, the subject is presented with the left- 
hand member of some pair i and the right- 
hand member of a different pair j. He will 
correctly respond “new” if pair 7 is in State 
L or State F, or if pair j is in State L or 
State B. In other words, the subject will 
be correct if he has learned the forward 
association of pair or the backward as- 
sociation of pair j. In the absence of either 
association, the subject will guess correctly 
with probability (1 — g). Therefore, 
P(C)=1-(4 —x—y) 
Xxü-y-se [5] 
Model M1 predicts different values for 
the probability correct on old- and new- 
recognition tests. Associations within dif- 
ferent pairs (such as those available on new 
tests) are assumed to be learned independ- 
ently of one another, but associations within 
the same pair (such as those available on 
old tests) are not necessarily learned inde- 
pendently of one another in Model M1. 
Forward forced choice. On a forced-choice 
test, the subject has all of the associations 
which would have been available for 
mediating performance on an old-recogni- 
tion test of the same pair plus m — 1 
incorrect alternatives which can be elimi- 
nated on the basis of learned associations. 
On a forward forced-choice test (one left- 
hand pair member and two or more right- 
hand pair members), the subject eliminates 
the incorrect alternatives on the basis of 
learned backward associations. In terms of 
the Markov process, the subject will be 
Correct on a forced-choice test of a pair if 
the pair is in State L, F, or B. If the pair 
is in State U, the subject will eliminate all 
incorrect alternatives from pairs which are 
in State L or B before guessing. Therefore, 


P(C) =1 — (1 —z-— x- y) 


m—l 
pegs IM x eru 


1 
LU 


X (e 230—x— smi 
t — 4 


[6] 


and II. The fit for the conditional proba- 
bilities of correct responding of Experiment 
l is less than adequate, but there are 
variables outside the realm of the model 
which might be contributing to the dis- 
crepancies between the predicted and ob- 
served results. These variables were men- 
tioned above. The one-element model will 
not yield similarly adequate predictions of 
the results in Table 3. The one-element 
model or any similar single-process model 
would have to predict that if the proba- 
bility of correct response on forward recall 
differs from one item type to another, then 
the probability of a COrrect response on 
recognition would also differ. 

There was also some indication 
spontaneous comments and 
the subjects that the distinct association 
model may be an accurate reflection of the 
decision processes in these two experiments. 
Two subjects in Experiment II wrote in the 
Correct associates of the incorrect alterna- 
tives on the forced-choice tests, and three 
subjects in Experiment I wrote in the 
Correct associates on new-recognition tests. 
The writing in of incorrect alternative 
associations is additional evidence that 
subjects are using these associations as a 
source of information for recognition per- 
formance, Other subjects mentioned at 
the conclusion of the experiment that they 
had implicitly eliminated incorrect alterna- 
tives on new and forced-choice tests on the 
basis of learned associations. Mandler, 
Pearlstone, and Koopmans (1969) have 
similarly Suggested that subjects may use 
a “retrieval check” during an old-new 
recognition task. 

Concerning the nature of the correlation 
between forward and backward associa- 
tions, independence seems to have the 
upper hand. The parameter estimates from 
both experiments indicate a correlation of 
zero between forward and backward associ- 
ations. The distinct association model can 
be reduced to a two-parameter model in 
this case. Wollen, Allison, and Lowry 
(1969), using a different type of analysis 
and experiment, found 
where between one and 


in the 
behavior of 


a correlation some- 
Zero. More direct 


ir e e erf d Ld 
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research is needed to determine the exact 
value of the correlation. 

The distinct association model allows 
one to predict and vary the relationship 
between forward recall and recognition. 
Variables which Strengthen backward as- 
sociations relative to forward association: 
will result in heightened differences þe- 
tween forward recall and recognition, Such 
variables include item type (##-worp 
pairs), rehearsal in the backward direction, 
and instructions (stressing backward re- 
call). Converse values of the above 
variables will tend to lessen differences — 
between forward recall and recognition, 

The distinct association model can a 
extended to make predictions jn any 
situation where multiple distinct associa- 
tions are available for mediating perform- 
ance. Presumably a number of interitem 
(and possibly intraitem) associations might 
be formed in other verbal learning para- 
digms. Intraitem associations could pro- 
vide a source of information on single-item 
recognition tasks, Items Consisting of 
many unique components between. which 
such associations could be formed ( ch as 
pictures of faces) would have an advanta 
over sparser items, Interitem associations 
might be of importance jn free recall, 
Items which have already been recalled 


provide cues for the recall of additional 
items. In order to 


ems, — apply the distinct 
association model to these Situations, we 
ae z : 
need to measure the Strength and determine 
the number of associations which are 
available for mediating Performance in 
each case, 


, One further possibility 
 postcriterion | ; 
by the distinct 
learned associati 


is that changes 
atencies can be explained 
association model, 


an old-recognition test, but 
ae ave two learned associations 

ght have shorter latencies, Further 
to test the 
association 
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How people learn a new behavior 

already exists a learned behavior (B) 
remembered are the chief concerns o 
entail postulated dependencies amon: 


(C) in a situation (A) for which there 
and how these competitive behaviors are 
f verbal learning theory. These theories 
g A, B, and C. Associative interference 


theory has focused on A-B, A-C associative interdependence; list differentia- 


tion theory, on B, C availability 


isolate these postulated dependencies 


interdependence; stimulus encoding theory, 
on stimulus identification interdependencies, 


The purpose of this paper is to 
and to argue for the critical importance 


of finding or failing to find the requisite dependencies in memory tasks where 


A, B, and C are to be retrieved in some form 


The theoretic problems the verbal 
learners face are, in their general form, 
rather universal: A human learns Behavior 
B in Situation A. How does he do it? 
Later he must learn another behavior, C, 
which now is decreed to be appropriate to 
Situation A. How does he do it? At some 
time in the future he is confronted with 
Situation A. Is his behavior to be of the 
B or the C variety, and how so? 

Answering questions like these is a 
theoretic enterprise: Assumptions about 
how people learn are arranged into argu- 
ments that might explain or predict 
performance in learning tasks. Critical to 
advancement in this enterprise is empirical 
evaluation of the theories proposed. Empir- 
ical evaluation in an area of science where 
theories are sometimes almost conversa- 
tional entails two particularly important 
operations. One of these is clarification 
and definition of the collection of state- 
ments that together constitute a theory, 
The other is discovery of further implica- 
tions of a given theory so that its accept- 
ability can be more reliably judged. 
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The generality or universality of the 
theoretic problems addressed by verbal 
learners is not subverted by the scientific 
dictum that theories must be stated and 
evaluated in as cold precision as possible. 
Situation A and Behaviors B and C become 
controllably represented in the A-B, A-C 
transfer paradigm. The theories we seek 
to clarify and evaluate are stated in terms 
of such paradigms, The linking evaluative 
presumption is that any theory that 
inadequate for our 
research situations is inadequat 
general learni 
distinction : 


i. NI t only be inadequate 
or general phenomena, but irrelevant as 
This Observation emphasizes the 
TY rejection and suggests 
uil modification and refinement 
gat best proceed in a denial instead of a 
confirmatory mode, 
In one way or another, any theory of 
Poses dependency relations of 
n the transfer situation, 
t Response C to Stimulus A 
nition, learning not 


Some sort. 
learning to emi 
entails, by defi 
Response B to Stimulus A. 
Purports to explain how 
Manages such a Problem must therefore 
pom Processes or mechanisms that 
errelate B-emitting and C-emitting cap- 


abilities or Probabilities. What is interest- 
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ing is that verbal learning theories differ 
in how this interdependence shall be 
represented. One of the purposes of this 
paper is to explicate as carefully as possible 
the locus of assumed dependency relations 
in two standard theories, associative inter- 
ference theory and list differentiation 
theory. Another purpose is to draw atten- 
tion to certain relevant independent re- 
trieval phenomena to the end that these 
two theories might be refined or sharpened 
or simply discarded, either by a presently 
decidable verdict or through subsequent 
research that is sensitized to dependency 
assumptions (whether explicit or implicit). 
‘The final purpose of this paper is to apply 
this dependency approach to the rather 
hectically emerging stimulus encoding anal- 
yses of learning. 


ASSOCIATIVE INTERFERENCE THEORY 


The theoretic position of longest standing 
in verbal learning is associative interfer- 
ence theory. There are, however, several 
separable components of this theory, so it 
will be necessary to tread carefully through 
the various and intertwining assumptions 
that have come to characterize the general 
position. The easiest way to begin seems 
to be to outline the theory as it was 
developed and modified. 

'The origin of associative interference 
theory is traceable to G. E. Müller, who 
introduced the ideas of associative inhibi- 
tion (Müller & Schumann, 1894) and 
retroactive inhibition (Müller & Pilzecker, 
1900). The former idea is that learning 
Behavior C in Situation A (A-C learning) 
is more difficult after having already learned 
Behavior B in Situation A (A-B learning), 
the idea of negative transfer; the latter, 
that subsequent performance of Behavior 
B is poorer after interpolated A-C learning, 
the idea that A-C learning retroactively 
interferes with what was learned originally. 
Both ideas center on interference between 
associations that compete to produce the 
behavior in question, the Responses B 
and C. 

Webb (1917) combined these two ideas 
(somewhat modifying the first so that it 
was reciprocal between A-B and A-C) 
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into a two-factor theory of retroaction : 
(a) In transferring from original to inter- 
polated learning in the A-B, A-C paradigm, 
interpolated A-C learning causes partial 
disorganization of what was learned in 
original A-B learning. (b) At the time of 
relearning A-B, the presence of Stimulus A, 
which was common to original and inter- 
polated learning, serves to reinstate the 
interpolated as against the original associa- 
tion. Thus interpolated A-C learning was 
thought to weaken the original association 
between A and B and to provide the 
Response C as competition against what- 
ever likelihood remained that Response B 
might be emitted. 

McGeoch (1933a, 1933b, 1936), however, 

saw Webb's two factors as alternative, 
competing versions of a single-factor theory. 
He opted for the latter factor as sufficient. 
McGeoch's thesis was that after A-B, 
A-C learning, the Stimulus A can evoke 
both B and C but that only either B or C 
will be emitted according to the dominance 
relation between them; A-C learning does 
not weaken the A-B association. In his 
text, MeGeoch (1942) summarized his 
position as follows: 
Inhibition is not thought of as a separate process, 
but as a function of competition among responses, 
with a resultant momentary dominance . . - of one 
response over another, Responses thus inhibited are 
not necessarily lost from the subject's repertoire, but 
are kept by other responses from appearing [p. 495]. 
In keeping with Postman's (1961, pp. 
152-153) admonition, we shall label this 
view the independence-dominance hypothesis 
— independence of the A-B and A-C 
associations, with momentary dominance 
at the time of testing as the only cause of 
retroaction. 

An implication of McGeoch’s position 
is that the frequency of intrusion of the 
interpolated response (the C) during a 
subsequent test for what is remembered 
of original learning should correspond to 
the amount of observed retroaction. Melton 
and Irwin (1940), however, in an experi- 
ment wherein different amounts of retroac- 
tion were generated by varying the degree 
of interpolated learning, discovered that 
intrusion-error frequency and amount of 
retroaction did not covary as seemingly 
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required by McGeoch. The form of dis- 
crepancy suggested unlearning of the 
original A-B association during interpo- 
lated A-C learning. The impact of this 
result was immediate, and consequently 
Webb's first factor was reinstated as 
necessary. 

Underwood (1948a) observed that after 
having learned two associations, or rather 
lists of associations, in the A-B, A-C 
paradigm, the number of correct B re- 
Sponses on the first trial of A-B relearning 
was greater when 48 hours had elapsed 
between interpolated learning and relearn- 
ing than when only 5 hours had elapsed. 
This effect he likened to spontaneous 
recovery as observed in conditioning experi- 


ments when a rest is introduced after 
extinction. Underwood (1948b) proposed 
that 


unlearning is to be conceived of as analogous to the 
extinction of a conditioned response. Like extin- 
guished responses unlearned verbal associations 
recover some of their response-strength with the 
Passage of time [p. 420]. 

Thus in contradistinction to McGeoch's 
independence-dominance hypothesis, we 
have the Melton-Underwood 
recovery hypothesis. 

The principal Source of contention be- 
tween these two hypotheses can be seen 
by asking the following question (which will 
take on more importance later): Can a 
given stimulus (A) have associated to it 
two different responses (B and C)? The 
independence-dominance hypothesis 
Swers with an unqualified “yes.” 
pure unlearning hypothesis 
an unqualified “no.” 
learning-recovery 


unlearning— 


an- 
The 
answers with 
The combined un- 
hypothesis is equivocal 
in that the initial answer is "no," but if 
the A-B association is not completely 
obliterated in A-C learning, it will recover 
over time, thereby producing a competi- 
tion-dominance situation that McGeoch 
said holds throughout. 

The unlearning-recovery hypothesis for 
awhilesuperseded the independence-domin- 
ance hypothesis in general acceptability 
because the latter was difficult to relate to 
the sometimes-observed phenomenon of 
Spontaneous recovery. This is because 
the temporal course of the dominance 


MARTIN 


relation was not specified. The unlearning— 
recovery hypothesis also Was seen as 
relevant to the proaction phenomenon, the 
fact that prior A-B learning can interfere 
with emission of Response C after A-C 
learning. Since at one time it seemed 
that proactive effects were minimal in an 
immediate test after ASB, A-C learning, 
but increased with test delay, the recovery 
of unlearned A-B associations would do 
nicely as a workable explanation. 

The modern conglomerate known as 
associative interference theory may be sum- 
marized as follows: A-B learning produces 
an A-B association ; A-C learning produces, 
concomitantly, an A-C association and a 
weakening of the original A-B association ; 
as time elapses, the A-B association 
recovers some of its strength; upon presen- 
tation of Stimulus A, the Responses B and 
C compete for emission in accordance with 
their relative associative strengths, This 
general view, then, is a mixture of the 
views of McGeoch, of M Iton, and of 
Underwood. d 

The critical dependency relation in this 
theory is the assumption that in forming a 
specific A-C association, there is a con- 
comitant weakening of the corresponding 
A-B association. The problem is to test 
this critical assumption as directly as 
possible, 

Suppose A-B and A-C 
paired-associate tasks where each task 
Consists of associating Pairs of verbal 
units, At some point in time after comple- 
a of interpolated AC learning, 
earner is presented With each A stimulus 
and asked to respond with both of the two 


learning are 


Lit a Previously associated with each 
Stimulus. To some of the stimuli he may 
recall 


both responses (B, C); to others, 
only one (B, C, or B, C); and to some, 
neither (B, Ce Ir learning an A-C 
association entails unlearning the corre- 
Sponding A-B association, and if the 


likelihood of recalling C indexes the A-C 
association 


the fol Strength, then we must expect 
ollowing inequality : P(B|C) < 
P(B[O); that is, i (BI 


» It must be that recall of 
n C is recalled than when 
- All of the following 
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inequalities express this same dependency 
relation: 
P(C|B) « PO 
P(B|C) « P(B) 
P(B&C) < P(B)P(C). 


The last, for example, says that the likeli- 
hood of the learner recalling both B and C 
to Stimulus A is less than what would be 
expected were the A-B and A-C associa- 
tions independent of or noninteractive with 
each other. In the event that correspond- 
ing A-B and A-C associations are in fact 
independent and noninteractive, all of the 
above inequalities would have to be rewrit- 
ten as equalities to accurately describe the 
data. 

An experiment by Abra (1969) provides 
a very nice set of data for testing the 
dependency relation that characterizes 
associative interference theory. ^ Abra's 
subjects learned 10 pairs of verbal units in 
the A-B, A-C paradigm. The stimuli 
were consonant-vowel-consonant nonsense 
syllables and the responses were common 
two-syllable adjectives. These two lists, 
A-B and A-C, were each learned to a 
criterion of one perfect recitation. Then, 
either immediately or 24 hours later, the 
subjects were given the modified-modified 
free recall (MMFR) test (Barnes & Under- 
wood, 1959). This test consists of present- 
ing to the subject each stimulus (A) and 
requiring the subject to recall as best he 
can the two responses (B and C) previously 
associated with the presented stimulus. In 
Table 1, the mean frequencies of the four 
types of recall are shown (both responses; 
only one or the other; neither) when 
MMER was immediate and when it was 
delayed. Thus, out of 10 stimuli for the 
immediate MMFR, 4.3 of them resulted 
in recall of both responses; 1.1, in recall 
of only the B response; and so on. 

Over the 24-hour delay, the total number 
of Bs recalled increased from 5.4 to Tally 
indicating a substantial amount of spon- 
taneous recovery of the first-list responses, 
At the same time, the total number of Cs 
recalled decreased from 8.1 to 5.6. Thus 
the Bs and Cs appear to interact in a 
more-or-less pure competition-dominance 
manner over the MMFR delay interval, 
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TABLE 1 
MEAN RESPONSE FREQUENCIES 


Immediate MMFR 24-hour MMFR 


Response | 2: 
| € C | Total | c | C [Total 
B 43 | dà | se xm ms | vu 
B | 38 8 | 46 | 14 | 9 | 23 

0 | 56 | 44 


Total | 81 | 1.9 |! 


Note.—Adapted from Abra, 1969. 


At either of the two delays, however, there 
is complete independence of retrieval of 
the Bs and Cs. For immediate MMFR, 


P(B&C) = .43 
and 


P(B)P(C) = (.54)(.81) = .44; 
for 24-hour MMFR, 


P(B&C) = .42 
and 
P(B)P(C) = (.77)(.56) = .43. 


Thus instead of an inequality, we must 
write, for both tests delays, P(B & C) 
= P(B)P(C). 

Clearly, in Abra’s experiment, delay of 
MMFR does not introduce dependence 
between B and C: B and C retrieval are 
no more interdependent after a 24-hour 
delay than in immediate MMFR. The 
trade off over 24 hours between Bs and Cs 
as revealed by the marginal totals in 
Table 1 (5.4 to 7.7 for the Bs, 8.1 to 5.6 
for the Cs) must be attributed, therefore, 
to distinguishable processes that increase B 
availability and decrease C availability 
quite separately—separately, that is, in 
the sense that B and C do not pairwise 
interfere with the retrievability of the 
other. 

The chi-squares for the two 2 X 2 
subtables in Table 1 are .062 and .840, 
respectively.* With one degree of freedom, 


? These are likelihood ratio chi-squares, that is, 
4 
x2(1) = 20O0jiln(O;/E;) 
i=l 
where O; is the observed cell frequency and Beis 
the cell frequency expected under the independence 
hypothesis. The entries in Table 1 are 


à mean 
frequencies. 


CUMULATIVE PROPORTION 


VALUE OF CHI SQUARE 


Fic. 1. Theoretical (dashed) and obtained (solid) 
cumulative distributions of chi-square statistics 
from 25 tests of response independence in MMFR 
tests (adapted from Greeno, 1969). 


they yield p > .80 and p > .30. Thus the 
cell entries are completely predictable from 
the marginal totals. (Strictly speaking, 
such chi-squares may be illegitimate inas- 
much as for a given 2 X 2 table each 
subject contributed to more than one cell. 
Thus the independent-observations as- 
sumption that underlies the chi-square test 
was violated to the extent that there was 
an interaction between subjects and items 
in the experiment. But such a violation 
can only inflate chi-square.) 

This type of independent retrieval phe- 
nomenon was apparently discovered by 
DaPolito (1966). In three experiments, 
he demonstrated that competing responses 
were emitted independently of each other 
in the A-B, A-C learning paradigm. 
DaPolito also demonstrated that this 
independence of retrieval obtains whether 
the test is response recall (MMFR) or 
forced-choice recognition. Moreover, he 
reanalyzed the MMFR results of three 
standard retroaction experiments already 
in the literature, those of Koppenaal (1963), 
Postman (1964), and Postman and Stark 
(1965). In all of these experiments, retro- 
action of A-C learning against prior A-B 
learning was observed in subsequent 
MMFR tests. In the Koppenaal experi- 
ment, the MMER test was variously 
delayed from the end of A-C learning for 
up to one week. In all instances, the 
independent retrieval phenomenon was 
observed. Thanks to the willing coopera- 
tion of other researchers, the same has been 
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found true in the retroaction and spon- 
taneous-recovery experiments of Birnbaum 
(1965), Ceraso and Henderson (1965), and 
Silverstein (1967) (see Martin, 1969).: 

A summary of 25 independent tests of 
retrieval independence is presented in 
Figure 1. The dashed line is the theoretic 
cumulative distribution of chi-square with 
one degree of freedom. The solid line is the 
cumulative distribution of 25 observed 


chi-squares. This figure is from Greeno 
(1969), who writes: 


In all, the 25 sets of data represent quite a variety 
of interference experiments, with a few cases of 
A-Br as well as A-C paradigms, verbal stimuli as 
well as pictures, and words and numbers as re- 
sponses. The various results also involve a wide 
range in the amounts of retroactive and proactive 
interference [pp. 17-18]. 


To my knowledge, there has been only 
one experiment designed explicitly to 
discover a boundary condition for this 
independent retrieval phenomenon, Wich- 
awut and Martin (in press) varied the 
amount of A-C learning in the A-B, A- 


paradigm. After learning A-B to cit 


the subjects learned A-C pairs for either 4, 
8, or 12 anticipation trials; then a standard 
MMFR test was administered. 

discovered that the only circumst: 
which P(C|B) < P(C) and P(BIO) < 
P(B) obtained was where there were only 
four A-C trials, and then only for those 
pairs where there were fewest correct antici- 
pations in first-list learning. Otherwise 
complete independence was again observed. 
Thus it would seem that one limitation T 
the independent retrieval phenomenon is 
that dependence appears when both B and 
C are barely retrievable, One might 
speculate that independent retrieval re- 
quires adequate representation in memory 


of either the A-B or the A C pai 
4 = air, but 
necessarily both. oe ca 


Thus 


It was 
ance for 


the dependency relation that 
AN af 
Note that when recall of eithe 


M 2 

i X er the Bs or the Cs 
vice - perfect, the a ae likelihood of a sig- 
nen Ui Square is reduced. The possibility of 
m pis lem, however, does not arise in most of the 
MN ests made; certainly not in those cases where 

R was delayed. M oreover, were it a problem 
of any magnitude, the cumulative distribution of 
observed chi-squares shown in Figure 1 would depart 
from the theoretic cumulative distribution. 
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characterizes associative interference theory 
is patently denied by what we might call 
the DaPolito independent retrieval phe- 
nomenon: The assumption that in forming 
a specific A-C association there is a 
concomitant weakening of the correspond- 
ing A-B association is false. And given 
that this assumption is false, the further 
assumption that weakened associations 
recover over time does not make sense 
(unless it is postulated that A-B associa- 
tions are weakened on some basis other than 
associative interference from corresponding 
A-C associations). In short, the independ- 
ent retrieval phenomenon denies outright 
the unlearning-recovery hypothesis. But 
since there are several conversational ways 
to dilute this denial, some elaborative 
comments are in order. These comments 
will serve also to sharpen the claim of 
denial just made. 

What is meant by "unlearning" in the 
unlearning-recovery hypothesis is a weak- 
ening of an association. If unlearning is 
read simply as response unavailability, 
then what we have is only a statement of 
observable fact, the fact of retroaction, not 
an assumption that postulates an explana- 
tive process. Demonstrations of independ- 
ent retrieval have all involved relative 
unavailability of one or the other of the 
two sets of responses (Bs for retroaction, 
Cs for proaction). The independent re- 
trieval phenomenon denies associative un- 
learning because the idea of associative 
unlearning implies a conditional pairwise 
relation between B and C availability, but 
the phenomenon itself is that there is no 
such relation. Moreover, as if to finalize 
this conclusion, clear changes in the relative 
availabilities of the two sets of responses 
(as in spontaneous recovery) do not 
introduce pairwise dependencies between 
responses with a common stimulus.? 

The MMFR test was introduced by 
Barnes and Underwood (1959) for the 

5 Melton (1961, pp. 183-184) expressed reserva- 
tions about the interpretative status of tests of the 
unlearning hypothesis and called for pairwise con- 
tingency analyses of the general type later carried 
out by DaPolito. Earlier still, Runquist (1957) dem- 
onstrated that there was no rank-order correlation 


between A-B pair strength and A-C pair recall, even 
though there was ample evidence of proaction. 


express purpose of eliminating response 
competition (reproductive inhibition) at 
the time of recall: Subjects are requested 
to give both responses (B and C), and 
because presentation of the A stimuli is 
self-paced by the subject, there is ample 
time to output the nondominant response 
after the dominant response has been 
emitted. Probably the best evidence that 
this test is indeed free of response competi- 
tion is the independent retrieval phenom- 
enon. Pairwise associative unlearning and 
response competition work in the same 
direction, that is, both work toward the 
inequality P(B & C) < P(B)P(C). Since 
this inequality does not hold, both sources 
of its expectation can be dismissed together. 
Thus there can be no question of the 
MMER test as possibly masking observa- 
tion of the dependency relation assumed 
by associative interference theory. 

On the basis of the preceding comments, 
one might overlook the relevance of 
McGeoch's independence-dominance hy- 
pothesis. McGeoch's hypothesis seemingly 
revolves around response competition and 
the MMFR test does not involve response 
competition. The hypothesis was generally 
applied to test situations (e.g., relearning) 
where only one of the two responses could 
be given, which means that in such situa- 
tions the dominant response would be the 
only response emitted. In the MMFR 
test, both responses can be given. But 
even though response competition is 
eliminated in the MMFR, dominance 
remains amply evident. When MMFR is 
immediate, recall of the interpolated res- 
ponses (the Cs) precedes, in order of output, 
recall of the original responses (the Bs), 
and as MMFR is delayed, the precedence 
of interpolated responses diminishes and 
even reverses (e.g. Koppenaal, 1963; 
Martin & Mackay, 1970). Thus McGeoch’s 
independence-dominance hypothesis is cer- 
tainly viable on the matter of dominance, 
even though McGeoch did not stipulate 
any general rules to govern dominance. 
Moreover, McGeoch called for independ- 
ence between the two associational systems 
(original, interpolated) and has been over- 
whelmingly affirmed. 
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In summary, the independent retrieval 
phenomenon is rejective of those aspects of 
interference theory based on common- 
stimulus-determined associative unlearning, 
but highly supportive of McGeoch's idea 
of associative independence. It looks as 
though learners do not disassociate Behav- 
ior B from Situation A when required 
subsequently to learn Behavior C in 
Situation A. We are, then, ostensibly left 
with the idea that indeed two different 
responses can be associated with a single 
stimulus. (The apparent counterevidence 
of Melton and Irwin, 1940, will be dealt 
with later when we consider stimulus 
encoding theories.) But if So, how does 
the learner decide when to emit B and 
when to emit C when Stimulus A is 
presented? Does the notion of response 
dominance help? Unfortunately, it was 
little more than a notion in McGeoch's 
hands; it does not Stipulate how or when. 
Indeed, if dominance were not reducible 
to something less mindless, a subject 
could never learn to emit C once B were 
learned; and once C were learned, he would 

be hard pressed to emit B with increasing 

likelihood as time elapsed. Some mechan. 


ism is required to explain observed domin- 
ance relations, 


List DIFFERENTIATION THEORY 


That there is in fact no Pairwise con- 
üngency between B and C recall in the 
A-B, A-C paradigm, but that there still is 
clear relative unavailability of Bs (retroac- 
tion) or Cs (proaction) as sets indicates 
that we must seek elsewhere besides specific 
stimulus-response associative interaction 
for an explanation of retroaction, proaction, 
and spontaneous recovery. A hypothesis 
of some standing that avoids the depend- 
ency defect of the unlearning-recovery 
hypothesis discussed above is that of list 
differentiation. This hypothesis also is 
intuitively attractive, has enjoyed a fair 
amount of success, and has a venerable 
history. In essence, it is a hypothesis that 
attempts to give theoretic substance to 
McGeoch's idea of response dominance, 
Since it carries à retrieval dependency 
implication, it falls into the purview of this 
Paper. As before, the easiest way to 
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explicate this hypothesis is to consider its 
development. 

Thune and Underwood (1943) reported 
that after A-B, A-C learning, when the 
subject is presented with Stimulus A, he 
can often retrieve the B response but seems 
consciously to reject it for output purposes 
because he thinks it inappropriate. Under- 
wood (1945) speculated that retroaction 
might therefore be at least partly a matter 
of response suppression on the basis of 
Some sort of appropriateness notion on the 
part of the subject. 

Underwood and Schulz (1960) postulated 
à response selector mechanism: 

The selector mechanism, itself unexplained, is as- 
sumed here as a fundamental behavioral law . 

In the verbal-learning situation, the selector mech- 
anism restricts the spew [output] to those response 
units within the list Cp. 144]. 
In other words, 
determination of 
{B}. Similarly, 

forming [C]. 


A-B learning involves 
à permissible response set, 
A-C learning involves 
Negative transfer, then, is 
attributable to the work of differentiating 
Sets [B] and IER. (Differentiation of 
{C} against all possible responses in general 
and forming A-C associations should be 
no more difficult than the corresponding 
Processes in A-B learning under a pure 
(no associative interference) list differentia- 
tion theory.) 
_ At the time of a test for retroaction, {B} 
IS seen as Suppressed, hence the relative 
unavailability of the Bs. The amount of 
relative unavailability of the Bs is postu- 
lated to depend on the degree of suppression 


of As the test for retroaction is 
delayed, this suppression dissipates, the 
availability of the Bs returns, and sponta 
neous recovery is observed, 

Postman, Stark, 
Summarized this vi 
Unlearning 
of response 
transfer and z 
than 
689 


This response-set interfere 
E the time of recall of 
Selectio; 6 

n [p. 6917, 


, *One might note th: 
tional criteria, 


nce reflects the domin, 


ance 
the second.| ist criteri 


ia of 


> 
à at Postman refers to selec- 
an expression of the ori 


ginal Under- 
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The idea of list suppression, or how 
response dominance comes about, entails 
response-set organization in memory—the 
Sets {B} and {C} are memorial categories. 
A great deal is known about the effects on 
retrieval of such categorization (e.g., Cohen, 
1963; Mandler, 1967). The essence of 
these effects is that there are definite 
retrieval dependencies. In a recall test, 
let B, stand for recall of a B response as 
the nth output item. Then from a res- 
ponse-set-organization position, we must 
predict P(B,|By-1) > P(B,) Where C, 
is similarly defined, we must predict 
P(B,|C,..) < P(B,) Yet another way 
to say thesame thing is to write P (Ba|Cn—1) 
< P(B,|B,-1); that is, Bs should cluster 
together and Cs should cluster together. 

Such inequalities hold over a wide 
variety of situations where experimental 
and normative organizations are imposed 
on recall tasks (ordinarily represented in 
terms of clustering scores of some sort). 
Accordingly, we have every reason to 
expect that after A-B, A-C learning we 
should observe them again were subjects 
asked to free recall (stimuli not present) 
all the Bs and Cs they can remember. 

One might note that this theory has 
developed within the limiting confines of 
list-learning paradigms. Whereas associa- 
tive interference theory presumed relevance 
lor any learning situation, list differentia- 
tion theory seems fairly bound to situations 
Wherein the to-be-learned behavior starts 
Out as an unintegrated set of behaviors that 
become integrated only in the sense that 
Nonmembers of the set are more surely 
excluded, The paired-associate-learning 
Paradigm dictates that the members of 
this set can never become ordered, this 
because the cuing stimuli for the individual 
Members appear always in random order. 
f one were to apply this theory to a single 
Paired associate, he would be returned to 
McGeoch's position exactly, except that 
the word “dominant” would now read 


Wood notion of appropriateness. To my knowledge, 
no theorist has postulated that subjects actually 
.Orget to which list a given response belongs; 
indeed, Koppenaal (1963) and others have shown 
that list identification accuracy falls only to 88% 
With as long as a one-week delay of MMFR. 
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"not suppressed." When stated in this 
way, it is easier to see that the dependency. 
relation that characterizes list differentia- 
tion theory is traceable to the intent of the 
theory to apply to list-learning paradigms. 

If list organization is one way to interpret 
McGeoch's notion of response dominance, 
then so is its logical alternative. We may 
again consider response sets, but this time 
as being stimulus determined instead of 
list determined. Thus instead of the Sets 
{B} and {C}, we have the Sets |Bi,Ci]; 
{B,C}, {B,C}, and so on, where the 
subscript denotes a particular common 
stimulus. Under this organizational hy- 
pothesis, we must predict that in free recall 
of the responses, the Bs and Cs will cluster 
according to common stimuli, not according 
to list membership; that is, we must again 
predict retrieval dependencies, but this 
time on the basis of stimulus-defined 
response sets. 

Unfortunately, very little research has 
been done to test these views. What little 
exists, however, indicates that the retrieval 
dependencies predicted by the list organiza- 
tion hypothesis do not exist. Martin and 
Mackay (1970) brought two lists of nine 
letter-word pairs to criterion in the A-B, 
A-C paradigm and then gave the subjects 
two successive free recall tests for the 
responses (without the stimuli present) 
either immediately or after a 30-minute 
delay. There was substantial retroaction 
and spontaneous recovery in the ordinary 
sense of unconditional availability of the 
B responses, but cluster analyses on the 
basis of list membership revealed no list 
organization of the responses in either the 
first or the second of the successive free 
recalls at either recall delay. Thus the 
retrieval dependencies predicted by the 
list differentiation hypothesis do not ob- 
tain; rather, we see another instance of 
independent retrieval, this time of the form 
P(B,|B,a) = P(B,) and P(B,|C,.3) = 
P(B,) Had the hypothesis been true, 
the subject should have been more likely 
to remain within lists (original, interpo- 
lated) than to switch lists at any given 
point during the course of his recall. In 
fact, however, he remained within and 
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switched between lists in accordance with 
a statistical independence hypothesis. 

On the other hand, the retrieval depend- 
encies predicted by the stimulus-determined 
version of the response-organization idea 
were variously apparent. There was signif- 
icant stimulus-determined clustering in 
both 30-minute recalls and in the second 
immediate recall. Thus if B;,, is one of 
the learned responses to Stimulus A; and 
is the mth output item in recall, then what 
was observed were the dependencies 


P (Bial Cin) > P(Bi,n) 
and 
P(Bin|Cin—1) > P(BiC;. 1), 

where jæi. 

While it is certainly true that the results 
just described are not on as extensive a 
footing as the independent retrieval phe- 
nomenon of DaPolito, they nevertheless 
illustrate a definitive route toward theory 
refinement. If these results are reliable, 
then in an interference paradigm like A-B, 
A-C, the dominance relation between B 
and C cannot be a matter of list-defined 
interaction between sets of responses. It 
looks as though subjects form Sets {B,C}, 
(BsCs], {B3,C3}, and so on, instead of 
Sets [B] and {C}. The implication is 
that learned behaviors tend to organize 
themselves more in terms of the stimulus 
features common to their past elicitation 
than in terms of when (e.g., in what list) 
they were elicited. 

At first light it may seem that we have 
approached a dilemma: In the first section 
we concluded that retroaction (and proac- 
tion and spontaneous recovery) cannot be 
attributed to pairwise associative interfer- 
ence. In this section we come upon the 
real possibility that the Bs and Cs are 
organized in memory according to common 
stimuli, as opposed to list membership. If 
list membership is not to be the answer to 
the problems of associative interference 
theory, that is, if after all we find that 
response pairs cluster by stimuli yet do 
not gain their relative dominance through 
the associative interference that once was 
thought a property of the triad of two 
responses and one stimulus, what then? 
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For a given stimulus A in the A-B, A-C 


paradigm, our apparent status is that we` 


have B and C clustering together instead 
of the B with other Bs and the C with 
other Cs, we have a clear dominance 
relation between B and C (the phenomena 
of retroaction and proaction), and we have 
an absence of interference between the 
A-B and A-C associations. 


AN IMPLICATIVE DIGRESSION 


Theory in the verbal learning area has 
commonly dealt with three basic concepts— 
the stimulus, the response, and a condi- 
tional relation between them known as an 
association. The association has, since 
almost forever, been a central principle of 
mental organization, and associative inter- 
ference theory may be considered the high- 
water mark of theorizing on this principle. 
The list differentiation theory was thus a 
shift in emphasis to another of the three 
basic concepts; it was an attempt to 
answer the growing difficulties associative 
interference theory was having with (for it) 
anomalous experimental results. But | 
too appears inadequate when one searche 
in data for certain predicted retrieval 
dependencies and cannot find them. 

It is interesting, and implicative for 
considerations we shall take up shortly, 
that these two theories belong to a certain 
Class of theories and that this class of 
theories is denied as a class from another 
quarter. "They are both two-stage theories 
where the first stage pertains to response 
learning and the second, to association 
formation. 'This idea of two stages prob- 
ably began with Thorndike (1932) : “Other 
things being equal, connections are easy 
to form in proportion as the response is 
available [p. 342)” and “much of learning 
consists in making certain responses more 
available [p. 347]" Mandler (1954) 
carefully distinguished these two stages in 
an analytic fashion: 


; and McGui D 
demonstrated exper Sue MS] 


l imentally the explana- 
tive power of Mandler's analysis. The 


idea that the response learning and asso- 
ciation formation Stages are temporally 


ordered was clearly stated by Underwood, 
Runquist, and Schulz (1959) : 


"UO 
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Paired associate learning of verbal lists can be 
divided log The response- 
learning phase is conceived of as the learning re- 
quired to make the responses readily recallable. 
This first phase must necessarily precede the second 
phase, the associative phase, since the response must 
be available before it can be associatively connected 
(second phase) to a specific stimulus . .'. Cp. 70]. 
Another standard source lor this hypothesis 
is Underwood and Schulz (1960, p. 92). 
From this point of view, the list differen- 
tiation hypothesis is a hypothesis that 
attempts to explain retroaction and sponta- 
neous recovery in terms of the response 
learning (availability) stage, for it postu- 
lates the formation of Response Sets (Bj 
and {C}, the suppression of {B} as part 
of the job of forming [C], and the sub- 
sequent recovery of {B}. 

An immediate implication of this view 
is that if somehow or other the two stages 
can be reliably identified, it must be that 
experimental variables that pertain to 
responses will affect the first stage, that 
experimental variables that pertain to 
stimuli will mot affect the first stage, and 
that both kinds of variables will affect 
the second stage. Just such an argument 
has been made and tested by Greeno (1970). 
His tests were of two kinds. 

The weaker test rested on the idea that 
the number of errors (in paired-associate 
acquisition) before the first correct response 
Should index the duration of the first 
stage and that the number of errors after 
the first correct response should index the 
duration of the second Stage. An experi- 
ment by Humphreys and Greeno (1970) 
provided the relevant data. They varied 
Stimulus and response difficulty orthogon- 
ally in a between-subjects factorial design. 
The result was that the duration of the 
first stage was affected jointly (and about 
equally) by stimulus difficulty and response 
difficulty, and that the duration of the 
second stage was affected primarily by 
stimulus difficulty. 

In the stronger test, Greeno fitted to the 
Humphreys-Greeno data a Markov model 
(Greeno, 1968; Humphreys & Greeno, 
1970) that accurately describes a large 
number of detailed conditional and un- 
conditional features of the data. One of 
the parameters of this model is definitely 
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identifiable with the first stage of learning, 
and the effects of stimulus and response 
variables in the second stage can be clearly 
established through parameter invariance 
tests. The results are impressive: The 
first stage is jointly affected by stimulus 
difficulty and response difficulty, and the 
second stage is affected only by stimulus 
difficulty. 

These same results were also observed 
in another experiment wherein different 
materials were used to manipulate stimulus 
and response difficulty (Greeno, 1970, 
pp. 273-275). 

The preceding two sections of this paper 
and the results of Greeno's analysis may 
be summarized as follows: Not only do the 
associative interference and list differentia- 
tion theories predict, in their own domains, 
retrieval dependencies that do not exist, 
they are members of a particular class of 
two-stage theories that predict an ordered 
pattern of stage responsiveness to experi- 
mental variables that does not obtain in 
reality. Given a general commitment to 
the stimulus-response-association form of 
analysis of verbal learning, it is interest- 
ing that neither of these two theories 
includes a stimulus analysis and that 
Greeno's work indicates that one of the 
stages is primarily, if not totally, sensitive 
to stimulus variables, 


STIMULUS ENCODING THEORY 


Theoretic attention to the problem of 
how to represent the stimulus in verbal 
learning situations has, until recently, been 
sketchy at best. The general attitude has 
been something of the sort "a stimulus is 
a stimulus." Current sensitization to 
stimulus representation problems has prob- 
ably arisen from so many different sources 
that a reasonable history is out of the 
question at this early stage. Nevertheless, 
some background comments will be useful 
in establishing the importance of this 
aspect of verbal learning theory, after 
which we can more sensibly discuss Some 
relevant independent retrieval phenomena, 

An undeniable feature of performance in 
a learning situation is its apparent variabil- 
ity. Indeed, for some theorists (e.g., 
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Dashiell, 1937; Hull, 1930) this variability 
has been a critical sine qua non for learning. 
In general, however, the problem has been 
how to give this variability a workable 
theoretic representation. — Hull's (1943, 
Ch. XVII) solution was to postulate a 
gratuitous randomly oscillating inhibitory 
process (,0,) that interposed between 
what was stored in memory (,/7,) and the 
response execution apparatus. As against 
this rather arbitrary solution, Estes (1955a, 
1955b) proposed that performance variabil- 
ity arises from a probabilistic relation 
between the learner and the stimulus 
situation. The idea is that over nominally 
identical presentations of the same stimulus 
event, the learner sometimes perceives one 
subset of component features and at other 
times, other subsets, Estes’ original notion 
was that subsets were all equally likely to 
be sampled (or rather, all subsets of any 
given size were equally likely). Over the 
years, this sampling view was gradually 
modified so that by 1968 we find Trabasso 
and Bower arguing for selective sampling 
that involves Progressive focusing with 
practice. 

From another Point of view, Rock (1962) 
has recently reminded us of, or introduced 
us to (as the case may be), the analyses of 
Hóffding (1891) and Köhler (1940). Haff- 
ding conceptualized the associative relation 
between two external events, like 5 and R, 
as involving the associated memory traces 
sand r: S— s—>r— R, Hóffding's mes- 
sage might currently be read as follows: 
Our theorizing has traditionally been a 
game played with the conditional prob- 
ability P(R|S). But for the subject to 
emit R, the trace y must be activated; for 
r to be activated, s must be activated; and 
for s to be activated, the nominal stimulus 
S must somehow make contact with its 
particular prior functional representation 
s. To the extent that there is any un- 
certainty about how nominal S will be 
encoded, the conditional probability P(s|S) 
is critical; for unless the specific s to which 
r is associated is activated, 7, and hence R, 
will not be forthcoming. Thus Hoóffding's 
concern, like Estes’, was with variability 
in the perceptual relation between the 
learner and his learning situation. 
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The domain of multiple, and 


hence 
variable, encodings of 


a nominally unique ^ 


stimulus is largely unexplored. A proper 
exploration, however, might provide an 
answer to the seeming paradox of 


McGeoch's well-supported contention that 
B and C are independently and without 
mutual interference associable with a com- 
mon nominal stimulus in the A-B, A-C 
paradigm and the seeming contradiction 
of the Melton-Irwin and the stimulus- 
determined clustering results. 

If xvz-2 and wyz-9 are two paired 
associates to be learned in the same 
experimental task, it is obvious that a 
rational subject will associate 2 with x and 
9 with w. In an important and germinal 
paper, Underwood (1963) pointed out that 
learners will sensibly disregard those compo- 
nents of composite stimuli that are non- 
essential to the task at hand. Postman 
and Greenbloom (1967) and several others 
have demonstrated that, in essence, ísub- 
jects will seldom make use of all the 
distinguishing features of stimuli if criter- 
ional performance does not require it, 
These are instances in which subsequent 
experimental tests have Shown that learners 
isolate particular and definable components 
of a nominal stimulus for use às cues for 
adaptive overt behavior, ]t is for such 
circumstances as these that I shall shortly 


cite some implica tive retrieval dependency 


data. 
In distinction to these explicit selectional 
results, there is the case where 


Or cogniti 
only one o t 


- example, that 
Signal /stil/ is presented to a 
mi stimulus S, Let 5; 
ant interpretation by the sub- 
51 18 steel and 5» is steal, From 
of Galbraith and Taschman 
know normatively that p GS)" 
S.S) = (63, A hypothesis 
an elicitative, asso- 
€ redi bse- 
quent visual recognition a =h will 


the work 


\ 


* A hypothesis that s, and s? are mutually in- 


-whether recognition of 


"immediate memory ? 
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positively covary with recognition of STEAL. 


hibitory must predict that recognition of 
STEEL will negatively covary with recogni- 
tion of STEAL. An independence hypothesis 
predicts that once steel and Steal are entries 
in immediate memory, their retrieval is 
independent. Such dependencies are testa- 
ble and must certainly be implicative for 
theories about how semantic and associa- 
tive spaces are organized. 

Also within the general category of im- 
plicit stimulus encoding is the intuitively 
satisfactory result first reported by Under- 
wood (1965). The nominal stimulus TABLE 
is associatively elaborated to include chair 
so that when later CHAIR is presented in a 
recognition test, it is falsely recognized as 
having been previously presented. But 
the question that has not been addressed is 
TABLE and CHAIR 
is stochastically dependent. Is chair 
memorially substituted for table so that 
recognition for TABLE is dependently sup- 
pressed? Is recognition for TABLE en- 
hanced because chair was associatively 
addended to table? Or is it the case that 
once chair is evoked as a memorial image, 
lable and chair are independent entities in 
Inasmuch as this 
paper is about retrieval dependencies or 
the absence of such dependencies vis-à-vis 
verbal learning theories, this seems the 
place to insist that their delineation is 
critical, not only to a proper evaluation of 
current theory but also to a satisfactory 
approach to new theory. 

Let us tentatively assume that stimulus 
selection phenomena (e.g., Postman & 
Greenbloom, 1967 ; Underwood, 1963) refer 
to more or less external, attentional re- 
sponses to particular components of com- 
posite stimuli in a paired-associate task. 
The effectiveness of each of the components 
as an elicitor of the response has been de- 
termined in two ways, the subsequent 
power of an isolated component to elicit 
the response (e.g., Postman & Greenbloom, 
1967) and the subsequent differential power 
of the response to elicit backwardly some of 
the components (e.g., Hintzman, 1969), 
While there is thus a clear case for stimulus 
component selection, no theoretic advance 
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has been made if it turns out that the com- 
ponents selected by the subject are them- 
selves mutually associated and bound 
together; for if they are, then the only 
result is that the observer (the experi- 
menter, the theorist) was mistaken as to 
what constituted a whole and indivisible 
stimulus. On the other hand, if it turns 
out that the individual components selected 
are nol bound together, either associatively 
or in some Gestaltish way, then the prob- 
lems raised in this paper become more 
tractable. 

Wichawut and Martin (1970) and 
Martin? have demonstrated that stimulus 
components do not interassociate with each 
other. Wichawut began with two alterna- 
tive hypotheses, namely, that if A,AsA; is 
the composite nominal stimulus, then either 
A; becomes independently associated with 
Response B or the su bjectively selected A;s 
become associated with each other and the 
integrated package becomes associated with 
B as a whole. 

In Wichawut's experiment, each A; and 
B was a four-letter noun, and three dif- 
ferent groups of 32 su bjects each were given 
three widely spread degrees of A-B learning 
so they could variously do what they might 
about stimulus encoding and organization. 
(The lowest degree of learning was just 
meeting an 8/8 criterion, that is, a single 
perfect recitation of the eight-pairlist. The 
other degrees of learning entailed either 8 
or 16 additional learning trials.) After 
A-B learning, where A = [A,AsA;], the 
subjects were presented with either A, As, 
As, or B from each of the A-B pairs and 
asked to provide the missing three 
members, 

If it is true that stimulus components 
become associated with each other during 
learning, then Component A; should elicit 
Component A;, j #1, with some proba- 
bility. But there are two ways A; could 
elicit A5, either directly via an A;-A; asso- 
ciation, or indirectly via mediation through 
the Response B, A,-B-A,. Accordingly, 
when A; is the presented stimulus compo- 
nent, there are two cases to consider: when 
the subject recalls Response B and when he 


* E. Martin. 


Stimulus component independence, 
In preparation. 


326 aon 


P (Aj or Ay | Ai presented) 


8/8 


8/8* 8 8/8 * 16 


DEGREE OF LEARNING 


FıG. 2. Proportion of times when at least one 
stimulus component (A; or A;) other than the one 
presented (A;) was recalled given recall (upper 
curves, B) and nonrecall (lower curves, B) of the 
response, as a function of degree of learning. (Pa- 
rameter is length of anticipation interval in seconds.) 


fails to recall Response B. We assume that 
if he cannot recall B, then he cannot recall 
A; through the mediative chain A,-B-A;. 
In Figure 2, the upper curve with filled 
circles (the one with the number 2 beside 
it) shows the proportion of times when in 
addition to B being recalled, at least one 
stimulus component (A; Ax) other than 
the one presented (A;) was also recalled. 
As is evident, when Response B was re- 
called, recall of other stimulus components 
increased with degree of A-B learning. On 
the other hand, the lower curve with open 
circles shows the proportion of times when 
at least one stimulus component other than 
the presented component was recalled when 
Response B was not recalled. As this curve 
indicates, Stimulus Component A; in no 
way suggested its fellow components, A; 
and A,, in the case where Response B could 
not be recalled. Stimulus components 
apparently do not associate directly with 
each other; the retrievability of A; when 


Epwin Martin 


A; is presented must be mediated through 
their common response, B. 

Wichawut's experiment involved an ex- 
posure of two seconds of the compound 
stimulus in an anticipation learning pro- 
cedure. Mackay varied this exposure, 
allowing either one second or four seconds 
to study or do with each stimulus whatever 
subjects do with stimuli. He imposed two 
degrees of A-B learning, the 8/8 criterion 
and the 8/8-plus-8 conditions of Wichawut. 
His results are shown in Figure 2 by filled 
triangles for one-second and filled squares 
for four-second stimulus study times for the 
case where the Response B was recalled 
(upper curves), and by open triangles and 
squares, respectively, when the Response B 
was not recalled (lower curves). As can be, 
seen from the upper curves, the amount of 
stimulus study time (1, 2, 4 seconds) 
clearly affected 


the indirect, response- 
mediated associations between stimulus 
components. Equally clear, however, is 


the fact shown by the lower curves: In the 
absence of response recall, there is no re- 
trieval of other stimulus components. 

The subjects in these experiments knew 
nothing of the later components tests. And 
in perfect keeping with standard results, 
they differentially utilized the three com- 
ponents of the stimuli during acquis tion 
(see the articles). Yet when their behavior 
was analyzed for stimulus intercomponent 
dependencies, none were to be found. Ir- 
Fespective of degree of learning, these 


subjects had no way of getting from one 
stimulus component to 


: another except via 
the assi 


gned response that was common to 
all the components, The open-figure data 
in Figure 2 are so consistently close to zero 
that the idea of a mistake is probably un- 
reasonable, Jf anything, independence 
among stimulus components increased with 
degree of learning, an observation that is 
reminiscent of the boundary condition for 
the DaPolito independent retrieval phe- 
nomenon noted in the first section of this 
paper. 
These 
their ini 


x 
results require generalization ; but 
tial implication is clear: Stimulus 
components that are a priori independent 
9 not become interdependent or associated 


AI 


» perimental setting. 
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through learning, at least not in a rote ex- 
Mediation through a 
common response can give the appearance 
of interdependence, but a proper analysis 
shows that after all it is not so. 

In the spirit of the inchoate nature of 
stimulus encoding theory, only a few 
evaluative comments are in order. For 
one, Estes' (1955a, 1955b) general idea that 
stimulus elements are, at some level of 


analysis, independently processed into 
memory seems correct. For another, 
Underwood (1963) and Trabasso and 


Bower (1968) are clearly correct that 
selective (focus) sampling is what happens. 
(Estes' component-independence idea and 
the facts of selective sampling are not 
mutually contradictory as long as a given 
stimulus component is selected on some 
basis other than the subject having already 
selected or rejected some other component.) 
And certainly Hóffding (1891) was justified 
in his fears that stimulus identification is 
a risky affair (see Martin, 1968, for an 
elaboration of this point). While these 
conclusions hardly constitute a definitive 
picture of how stimuli get represented in 
memory, they are suggestive of a way out 
of some of the difficulties in verbal learning 
theory. 
INTEGRATION 


We noted at the outset three general 
questions: What is learned when a learner 
learns Behavior B in Situation A? How 
does some new, different Behavior C dis- 
place Behavior B in Situation A? And 
given Situation A at some later point in 
time, what will be remembered of Behaviors 
‘B and C? Probably one of these questions 
is more important than the other two, and 
it would appear that it is the second ques- 
tion, the question of transfer, that is criti- 
cal. In fact, it is with respect to the junc- 
ture of A-B and A-C learning that the two 
theories discussed earlier generate their re- 
trieval dependency statements for what 
should happen in a later test for what is 


remembered of A-B and A-C learning. 


Moreover, an answer to this question must 
take cognizance of some learning process 
that is central to both A-B and A-C 
learning. 
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An opening answer to this question is 
that the A in A-C learning is not the same 
A as in A-B learning. For the purpose of 
discussion, let Ap and Ac be the subjective, 
functional versions of nominal À in A-B 
and A-C learning, respectively. The criti- 
cal problem is to specily how Ac differs 
from Ag and how the difference comes about 
in learning. 

Begin with the eminently sensible notion 
that Nominal Stimulus A is multifeatured 
(e.g., Anisfeld & Knapp, 1968; Bower, 
1967; Underwood, 1969; Wickens, 1970). 
Although stimulus selection research indi- 
cates that obvious, externally definable 
components comprise a share of the features 
in some experimental situations, they are 
by no means exhaustive of the list. Many 
other properties of a stimulus are also im- 
portant, as frequency and temporal point 
of occurrence (Underwood, 1969) as well 
as their modality of occurrence, class 
membership, and so on (Wickens, 1970). 
For a given Nominal Stimulus A, let A be 
described by the features «i, d»,:**, an 
Moreover, assume that these features are 
differentially salient, that is, not equally 
likely to be sampled (noticed, registered) 
by the learner. 

The features a; of nominal A may be 
viewed as coordinates of a vector. Over 
repetitions within the same task, as in A-B 
learning, some of these coordinates will 
change in value, others will remain con- 
stant. For example, one of the a; may be 
a frequency counter and hence change with 
trials, another a; may be an index of recency 
(Underwood, 1969). The a; that describe 
the phonetic and graphemic natures of A 
probably will not change over trials. Given 
both that there can be variation in which 
features are sampled and that some features 
(coordinates) change value over trials, it 
follows that the functional representation 
of nominal A is different even between the 
beginning and end of A-B learning. 

But in the A-B, A-C paradigm, nominal 
A carries on throughout, and hence there 
may seem no way to make Ac different 
from Ay in any way sufficiently distinctive 
to account for transfer, retroaction, pro- 


action, and spontaneous recovery, Two 
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interrelated proposals have recently been 
made. The author (Martin, 1968) has 
suggested that the sampling probability 
distribution over the feature set fai] 
changes when A-C learning is encountered 
after A-B learning; that if a; is the most 
salient feature in A-B learning, then some- 
how a; becomes the most salient feature in 
A-C learning. Learning was seen as asso- 
ciating the currently prescribed response 
to the most salient feature. Transfer was 
explicated in terms of the numerosity of 
alternative features: The more features de- 
scriptive of the stimulus, the more different 
(in terms of degrees of freedom) can the 
A-C distribution be from the A-B distri- 
bution. On this view, retroaction is a sub- 
jective persistence of the feature sampling 
bias of A-C learning? Proaction is ac- 
countable to there remaining a residual 
likelihood of sampling the feature that was 
most dominant in original A-B learning. 
As the test (e.g., MAIFR) is delayed from 
the end of A-C learning, the feature- 
sampling distribution as biased by A-C 
learning was presumed to revert to its 
original (preexperimental?) shape, thus 
reducing the dominating likelihood of 
Response C over Response B (spontaneous 
recovery). 

The soft spot in this theory is flagged by 
the word "somehow" in the phrase ‘‘some- 
how a; becomes the most salient feature in 
A-C learning." The second proposal 
alluded to above suggests a way to differ- 
entiate Ac from Ag. Greeno (1970) has 
proposed that the first stage of acquisition 
is storing in memory the stimulus-response 
pair itself, as a unit (as opposed to storing 
just the R). This proposal is immediately 
attractive for two reasons: First, it makes 
sense in view of the fact discussed earlier 
that the first stage of learning is affected by 
both stimulus and response variables (the 
source of Greeno's idea to start with). 
Second, by placing the stimulus and the 
response together as adjacent, interacting 

* This constitutes an interpretation of the gen- 
eralized response competition idea of Newton and 
Wickens (1956), which holds that subjects persist 
in their most recent behavior. Inasmuch as stimuli 
are the excuse for behavior, persistence is here seen 
as a matter of how the stimuli are encoded. 
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components of a single entry 
it makes the response 
terminer of which stimulus features will be 
attended, selected, or emphasized. And 
because there is no greater difference be- 
tween A-B and A-C learning than the 
difference between B and C, B and C them- 
selves could be the primary source for the 
difference between Ay and Ac If this is 
how things work, then Ay and Ac should 
be more surely distinguished to the extent 
that B and C belong to distinct classes of 
items, with the result that less retroaction 
will occur. Happily, this is true in fact 
(Birnbaum, 1968; Friedman & Reynolds, 
1967; Postman, Keppel, & Stark, 1965; 
Shulman & Martin, 1970). 

The second stage in a verbal learning 
task is establishing a retrieval route from 
the Nominal Stimulus S to the unitary 
memory entry, the stimulus-response pair 
(Greeno, 1970). In the A-B, A-C inter- 
ference paradigm, the Responses B and C 
contribute to the distinguishability of these 
retrieval routes, this through their in- 
fluence on emphases of the features of 
nominal A. Given that B and C are re- 
sponsible (at least in part) for the difference 
between Ag and Ac, how can we explain 
the DaPolito independent retrieval phe- 
nomenon, the fact that B and C cluster 
gether instead of with other Bs and Cs, an 
Melton’s seeming counterevidence against 
McC *eoch's independence hypothesis? 

Begin by transforming the Wichawut 
stimulus component independence phe- 
nomenon into a Premise that applies to 
stimulus features in general. To do SO is 
perfectly reasonable. For example, the 
gone feature G of the stimulus 

ZORGE can hardly be associated with the 
ne tee ae feature of GEORGE, any 
selectively ed ees signal /in/ be 
encoding bran bag the semantic 

“?ER Or SIN. Assume that 


into memory, 


" "E i 
* A convincing example is the professor who may 
E a given meeting of his Senate act either as a sub- 
Stantive Participant or the Parliamentarian. 


d of him will deter-' 


mine which features of the proceedings he selectively 


and keeps track of. 


, Beginning with Miller and Nicely (1955) and 
Liberman (1957), there has been a theo, under- 


a highly likely de-: 


upon receipt of a stimulus, the subject 
«OQ, draws a random sample of the features that 
describe that stimulus such that the paired 
response is a determiner of the probability 
distribution over those features. Then 
three points can be argued : 

1. If An, and Ap, are two stimuli in A-B 
learning and Ac, is the A-C learning 
version of Ag, then certainly Ap, will be 
more similar to Ac, than to An, (in standard 
paired-associate experiments where intra- 
list similarity is minimized). Accordingly, 
if in free recall (stimuli absent) of the B 
and C responses the Response B, is re- 
called, then via mediation through An, the 
subject is more likely to follow up with 
recall of Cy than with Bs Hence the 
i determined clustering results of 

and Mackay (1970). (Of course, 
pictures for when intralist stimulus 
imilarity is high and when the paradigm 
is A-B, C-D must be appropriately 
different.) 

2. To simplify matters for the sake of 
argument, suppose Nominal Stimulus A 
has features “1, 03, az, and a, only. Let 
Ap = dq, and Ag = a2 as a result of selective 
stimulus encoding in A~B and A-C learn- 
ing, respectively. In the MMFR test, 
suppose the subject takes a size-two random 
sample of these features. If this sample 
includes both a, and 4», he will recall both 
B and C; if à, and either a3 or ay, he will 
recall only B; if as and either a; or ay, then 
only C; if a3 and “4, then neither B nor C, 
Since this sample is random, there will be 
no retrieval dependency between Responses 
Band C. Hence the DaPolito independent 
retrieval phenomenon. The apparent dom- 
inance of C over B in retroaction is dic- 
tated solely by the shape of the probability 
distribution over [a,]. The amount of 
proaction follows similarly, from the likeli- 
hood that (in this example) a; but not dy 
will be sampled. 

3. The generalized Melton-Irwin (1940) 
phenomenon is that overt intrusion of the 
interpolated Cs in a subsequent test for 


w 


A 


Ky 


s 

standing that speech perception depends on com- 
binations of Perceptually independent 
(probably articulatory, see Liberman, 
3 Shankweiler, & Studdert-Kennedy, 1967). 


features 
Cooper, 
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what is remembered of original A-B 
learning first increases and then decreases 
with degree of interpolated A-C learning, 
While retroaction increases throughout. 
In light of the point of view being argued, 
the explanation would be that with limited 
A-C practice, the subject is still, on occa- 
sion, retrieving the original A-B unit from 
memory—this as opposed to unlearning 
the association between A and B. As A-C 
trials accumulate, Ac becomes differen- 
tiated from Ap so that the nominal A is 
interpreted reliably as Ac, to the end that 
only the unit A-C js retrieved from 
memory. The everincreasing degree of 
retroaction is simply the increasing stimu- 
lus encoding bias toward Ac instead of Ap. 

What the dependency relation that char- 
acterizes this view should be is not clear at 
this stage of development. With some con- 
fidence we can say that it is not associative 
dependency among stimulus components. 
The central postulate, however, is that Ac 
is different from Ag and that A-C learning 
involves differentiation of Ar and Ac. The 
idea that Ac might differ from Ay has re- 
ceived some attention in the literature. 
The results have been mixed, but they are 
worth commenting on. After having 
learned a list of A-B pairs, Houston's (1967, 
Experiment IIT) subjects were then re- 
quired to learn a second list of the form 
either AX-C or AX-C', where X was a 
new stimulus component (added to the 
original A) and C and C' were new re- 
sponses that were, respectively, different or 
meaningfully similar to the original B 
responses. Following this second task, his 
subjects were presented with either A, X, 
or AX and requested to provide the second- 
task response (C or C’). For the AX-C* 
condition of second-list learning, the com- 
pound AX stimulus elicited 7.00 (out of a 
possible 8) C’ responses. The X and A 
Components elicited, respectively, 2.88 
(41% of 7.00) and 5.88 (84%) C' responses, 
indicating that when the first- and second- 
task responses were similar (one possibly 


"The Melton-Irwin result was observed not in 
paired-associate but in serial learning. The basic 
reference for this phenomenon in Paired-associate 
learning should be Underwood (1945), 
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mediating the other), subjects tended to 
stick with the first-task stimulus (the A 
component). But when the first- and 


second-task responses were quite dissimilar 
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this differentiation in the second task, either 

by exercising some variation on the original 
m soe 3 

encoding or combining the original encoding 


with some new feature. If this is the case, 


(the AX-C condition), the respective mean 
number of second-task responses elicited 
were 7.13 for AX, 4.38 (61%) for X, and 
4.88 (68%) for A, indicating that in an 
interference situation there was a tendency 
to switch to the new component (X) or to 
add the new component to the old compo- 
nent (A). The critical comparison is for 
the X component: 41% for the AX-C’ 
condition, 6195 for the AX-C condition. 
An ordinary 2 X 3 analysis of variance of 
these data did not, however, yield a sta- 
tistically significant interaction, the fact 
that led Houston to conclude against re- 
coding of the stimulus in the second task. 

But results from such experiments as 
Houston's, whatever their apparent out- 
come, must be viewed with reservation. 
Implicit in the conclusions drawn from 
them is the idea that the functional en- 
coding of a given stimulus is to be identified 
with, say, a particular letter of that stimu- 
lus. Surely this is an idea we can no longer 
entertain seriously, especially in light of 
the numerous encoding possibilities already 
listed with firm, supporting evidence (e.g., 
Underwood, 1969; Wickens, 1970). As if 
to emphasize the difficulty in trying to 
identify obvious, external, experimenter- 
defined stimulus attributes with the 
learner's subtle, internal, subjectively de- 
fined functional encodings, we may note 
that Schneider and Houston (1968) found 
positive evidence for shifts in stimulus en- 
coding in a negative transfer paradigm 
while Williams and Underwood (1970, 
Experiment I) did not. 

The general idea of stimulus recoding on 
the part of a learner who is faced with 
learning a new behavior in an old situation 
must be distinguished from an accompany- 
ing and complementary notion, namely, 
that one of the products of first-task learn- 
ing in the experimental paired-associate list 
situation is reliable differentiation among 

the several stimuli in the list. Presumably, 
this accomplishment requires effort; and if 
so, learners may well prefer to make use of 


| 


then rough tests of recoding not only can 
fail to prowide evidence that recoding 
occurred but also can provide, spuriously, 
evidence that the original encoding was re- 
tained in its original form in second-task 
learning. In short, it seems to be true that 
we have, on occasion, allowed conveniences 
of method and measurement to play the 
role of silent or hidden assumptions in our 4 
inferences. 

CONCLUSIONS e 

The general view outlined’ in the pre- 
ceding section is in agreement) with 
McGeoch’s hypothesis of independence 
between A-B and A-C associations. ‘It 
proposes that McGeoch's idea of dominance ` 
be interpreted as a stimulus encoding bias 
that is in part determined by the to-be- 
associated response. It says “yes” that 
two distinct behaviors can be associated to 
the same nominal stimulus situation, but 
"no" that they can be associated to the 
same functional encoding of that situation. 

The general view argued in the preceding 
section is that a major portion of learning 
is perceptual learning—learning an effec- 
tive identifying response to the nominal 
stimulus situation. Learning new behavior 
in the same nominal situation (transfer) is, 
accordingly, largely a matter of learning to 
perceive the situation differently from 
before. The phenomena of retroaction, 
Li meon e par de ty ic 

Sven by how the learner 
perceives (encodes) the stimulus situ 
on a later test occasion. 

In general perspective, 
taken in this paper are in part intended t 
shift the focus of theoretic attentic e 
perceptual end of the human infos i 
processing system, a seemingly mo ef aD 
domain; a domain, Moreover. ; re tertile 
heuristic contact with other „n> Breater 
chology. Of course, it doe r areas of psy- 
passu, that old dites S not folloxw, pari 
solved; only that Ml are thereby 
likely route of approach, May be a more 
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THEORETICAL NOTES 
SELF-PUNITIVE BEHAVIOR—MASOCHISM OR CONFUSION?! 


L DREYER axp K. EDWARD RENNER? 


University of Illinois 


Mon of self-punitive behavior in rats was considered and experi- 
Ustrated with human Ss. It had been suggested in the previous 
toosing to run toa painful shock, the rats were showing a prefer- 
ent, thus providing an analogue for masochism. This exten- 
sm was considered inappropriate, however, because of the 
pt of choice and the attribution of a state of preference to the 
Pquate preference methodology. The apparent paradoxical 
Bent seen in self-punitive behavior were resolved by making 
iction between instrumental learning about response-contin- 
spectancy) and motivational states (preference), 


yn, Martin, and At the start of a trial, a warning signal comes 
in, Athey, and on followed by the opening of a trap door 
rated that under which drops the rat onto the electrified grid 
to shock in a floor of the runway below. According to the 
tion will retard discrimination hypothesis, rats punished dur- 
Specifically, rats ing extinction Spend so little time in the run- 
Which is present Way area under the start box that there is no 
will continue to opportunity for them to learn that the first 
nt only in the part of the Tunway no longer delivers shock. 
ils Which are given [n Contrast, subjects in the regular-extinction 
Notion trials. This group are not shocked in the runway; thus, 
Beled self-punitive ^ these subjects run more slowly, allowing dis- 
crimination and the subsequent extinction to 


occur. 
MibNs or The explanation based on the'discrimination 
E VIOR hypothesis has been criticized (Brown, 1969) 


T in that discrimination is defined by whether 
r self-punitive or not the rat runs without providing inde- 
€. The first pendent evidence as to Whether or not the 
a discrimination change in conditions has been discriminated. 
the rate of eX- n order to avoid circularity, it would be 
A nt to which the necessary to show that the subjects punished 
N are discriminably dif- during extinction had not discriminated the 
; 204 ns of acquisition, The change, but that rapid extinction occurred 
second is the vicious circle hypothesis origi- when the change was discriminated, 
nally proposed by Mowrer (1947) in which Another criticism of the discrimination hy- 
the , punishment A S the fear which pothesis (Brown, 1969) is that it fails to ac- 
motivates the response which is Punished. count for the results of studies by Melvin 
(1964) and Martin and Moon (1967). These 
The Discrimination Hypothesis studies showed that groups of rats trained 
under 33% and 67% shock schedules and then 
shifted to 100% shock during extinction were 
just as resistant to extinction as groups main- 
"Supported by Grant MH 11633 from the Na- tained on their original schedules, Brown 
tional Institute of Mental Health, United States interpreted the discrimination hypothesis to 
Public Health Service, require that a schedule shift would facilitate 
* Requests for reprints should be sent to K. extinction. However, the discrimination h 


Edward Renner, Psychology Building, University thesis does not re üire tha 
of Illinois, Champaign, Illinois 61820. TS E 4 at 


In the normal punishment-facilitation ex- 
periment, the start box is above the runway 


y- 
: - ar any change 
whatsoever in the situation will facilitate exs 
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tinction. The rate of extinction should be in- 
fluenced only by changes which do in fact 
affect a discrimination between shock and 
nonshock in the startbox. 


The Vicious Circle Hypothesis 


The vicious circle hypothesis, which is 
favored by Brown (1969), has four basic as- 
sumptions: (a) Fear becomes conditioned to 
environmental cues during training because of 
contiguity. (b) Fear of the start box provides 
the motivation for running. (c) Running is 
reinforced by fear reduction in the goal box. 
(d) The acquired fear-arousing properties of 
the situation are strengthened on each self. 
punitive trial in which shock is encountered 
in the runway. Brown (1969) concludes that 
"the rats run because they are afraid and 
continue to be afraid because they encounter 
punishment when they run Cp. 5037." 

There is no need however, to assume that 
the two hypotheses are actually in conflict. 
The discrimination hypothesis states that to 
the degree to which rats fail to discriminate a 
safe from an unsafe start box they will con- 
tinue to run. The vicious circle hypothes 
explains from the point of view of the experi- 
menter why this is so. It does not matter for 
the vicious circle hypothesis whether or not 
Shock is present as long as fear is elicited in 
the start box. When fear is present, the sub- 
jects punished during extinction will then be 
subjected to further shock and caught in the 
circle, while regular-extinction subjects will 
extinguish to the fear-producing cues in the 
start box. 

The distinctive point about a vicious circle 
is that the behavior should show unusual 
persistence over a long period of time, assuming 
other conditions do not change. Given the 
correctness of the vicious circle hypothesis, 
the eventual extinction which does occur indi- 
cates that subjects punished during extinction 
do learn that there is no shock in the area 
under the start box. But if this is true, it 
implies that when extinction started, the rats 
punished during extinction were not discrimi- 
nating the change in the area under the start 
box, or at least could not inhibit taking the 
first few steps down the runway. The un- 
answered problem is, therefore, why discrimi- 
nation or suppression of running is slower for 
the punished-extinction as opposed to the 
regular-extinction rats. 
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EXTENSION oF SELr-PuxiTIVE BEHAVIOR 
TO THE CONCEPT or Masocuism 


Aside from arguing against the discrimina- 
tion hypothesis and for the vicioĝs circle 
formulation, Brown (1969) assumes that the 
rats which are punished, in comparison to the 
Tats receiving nonpunished extinction, are 
choosing to leave the safe area under the start 
box to run across a shock grid to a safe goal 
He writes: “although the start box is 
conventionally described as the safe section, 
of the runway, the fact that the self-punitive’ 
Tat repeatedly chooses to expose himself fo 
shock rather than remain in the Start ox 

pox 
aversive than the shock or the shock as more 
attractive than the start box Cp. 506." /Tt is 
this extension of the data, not the data them- 
selves, which requires Brown to reject the 
discrimination hypothesis. The attrib tion of 
choice to his animals requires, by de nition, 
that they have discriminated 
alternatives available to them. Only by ex- 
cluding the discrimination hypothesis} can the 
extension of self-punitive behavior tofmasoch- 
ism be maintained as an appropriate 

In Brown's (1965) conception of mz 
he argues that the attractiveness or 
siveness of physical stimuli can be dé 
only by the behavior of the organism}, Events 
which are approached are called añ ractive, 
while those which are avoided are called aver. 
sive. According to this "behavioral crithsr; 
masochistic behavior occurs when an in 


permits one to define the start box as n 


sochism, 
the aver- 
termined. 


Thus rats Punished during 
that they ap- 
Proach electric shock which other rats (and 
they themselves later) normally avoid. 

At first appearance, Brown's “behavioral 
Criterion" may seem to be a useful way of 
looking at the problem of masochism, but if 
it is applied, then all at once many behaviors 
Which in Common usage are not considered 
masochistic will become so. Is the man who 
Steps off a curb into an oncoming car (some- 
thing he usually avoids) being masochistic ? 
Clearly, observing his behavior is not enough 
to give this information (unless he is declared 
masochistic by fiat). It is necessary to know 
Something about the state of his expectancies 
rà Preferences when he Steps off the curb. 

oes the evi is : 
that ni. rein ane ues RU 
tion are masochistic, that j We e 

» that is, that they “choose” 


' room before 


 __ ee 
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fo run over shock to the goal box rather than 
remain in the start box? 

In order to answer this question, the concept 
of choice must be considered. First it must 
be established that the start box is perceived 


by the rats as not containing shock. As an 
illustration of this requirement, consider the 
following hypothetical illustration, If rats 


Were trained in a T maze containing 5 grams 
of food in the left arm and nothing in the right 
arm, one would expect consistent selection of 
the rewarded arm. If 10 grams of food were 
then placed jn the right arm, we would not be 
justified in claiming that 5 grams were pre- 
ferred over 10 grams because the rat continued 
to go to the left arm. The statement of 


. Preference could only be made if the animals 


"knew" that 10 grams were in the right arm. 
Not only must at least two discrete outcomes 
exist, but they must be discriminated by the 
organism making the choice, The preference 
Which choice implies does not exist when one 
of thé behaviors has a consequence which is 
unknown to the subject, 

In most punishment-facilitation studies, the 
Starting point is one of the alternatives and 
there is no way of knowing, a priori, that not 
nonshock alternative for 
Vhen requirements for a concept of 
choice are considered, Brown's extension to 
masochism: is no less circular than the dis- 
crimination hypothesis because there is no 
independent evidence that a choice exists, 

A nonpunished contro] group is included in 
Brown's research in order to have a standard 
against which to compare the behavior of the 
rats punished during extinction, Brown's 
procedure, however, establishes only that two 
groups discriminate these alternatives 
different rate, the understanding s 
an interesting Psychological question. The 
empirical fact of this differential rate is mean- 
ingless, however, 
ism, because the methodological Procedures of 


not permit the use of the concept of choice 
The conceptual difficulty can 
be illusttated by a parallel experiment con- 
ducted with humans. 


An Illustrative Human Experiment 


Method. The subjects were seated alone in a 
2 numerical readout Screen. At the 
start of a trial, the experimenter said “Get ready” 
one to two seconds later the 
number 01 appeared on the screen and 
Sounded for .5 seconds, followed immediately by a 
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pulsating shock to the index and ring finger of the 
nondominant hand. The dominant hand was free 
to operate a telegraph key. The subjects were 
instructed that: 


The electric shock may be escaped by pressing 
the telegraph key. Twenty presses on the key 
are required to increase the number on the screen 
by one. Once you get the screen to count up to 
the number ten, the shock will turn off. 


After five acquisition trials, during which the 
shock was successively increased to a maximum of 
2.5 milliamperes, the subjects, all of whom were 
male, were alternately assigned to one of two ex- 
tinction groups. For the regular-extinction group, 
the shock was turned off. For the punished-extinc- 
tion group, 
turned on the shock, which could be escaped only 
by advancing the screi 


A time limit of 50 seconds, 
Which was set on all trials, also served as the ex- 
tinction criterion, 

lt was expected that subjects would start to 
make key press responses during the warning period 
or at least pounce on the key as soon as the bu 
stopped. Once the extinction phase of the experi- 
ment started, the regular-extinction group should 
soon discover that the shock had been terminated 
and stop responding or slow down. The second 
group, which paralleled Brown's punished-extinction 
group, should find only that they continue to be 
punished and must Press the key to escape the 
punishment, Al] subjects punished during extinc- 
tion who had not discovered at the end of the five 
extinction trials that it was their own key pressing 
which was Causing them to be shocked were in- 
structed as to the actual contingencies and given 
an additional trial, All subjects filled outa standard 
questionnaire and were interviewed at the end of 
the experiment, 


Results. The results are shown in Figure 1. 
Except for the lack of a learning curve during 
acquisition, the figure is indistinguishable from 
the animal data on self-punitive behavior, 
The acquisition phase of key pressing is close 
to the capacity of the subjects, reflecting that 
humans know how to press a key rapidly. 
The apparent separation of the two groups 
during acquisition was not significant (Fz 
1.86, df = 1/20) and reflected only that the 
regular-extinction group contained several sub- 
jects who could not generate fast key Pressing ; 
there was no qualitative difference jn their 
behavior. The extinction data are of interest 
on two counts: First, the Sroup punished 
during extinction Pressed the key at a sig- 
nificantly faster rate than the Tegular-extinc. 
tion group (F = 149, df= /20, p< 01) 
over the five extinction trials, Second, 10 of 
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Presses were completed on the five acquisition and 
five extinction trials for the punished-extinction 
(PE) and regular-extinction (RE) subjects, (Sub- 
jects punished during extinction received an eleventh 
trial after they were informed that it was their 
initial key press which turned on the shock. The 
50-second limit on each trial imposed a lower bound- 
ary of 2.0 on the speed score. The maximum speed 
obtained by any individual subject was 4.0.) 


the 11 subjects in the group punished during 
extinction stopped pressing the key once they 
were informed of the contingencies (Trial 11 


The questionnaire and interview data served 
to confirm what is intuitively obvious. Nine 
of the 11 subjects punished during extinction 
continued to press for all five extinction trials 
and reported that they did not know the shock 
was being turned on by their own key presses. 
Two of the 11 subjects punished during extinc- 
tion always waited for the shock to begin 
during acquisition and immediately detected 
the change at the start of extinction, refraining 
from making any responses during extinction. 
Six regular-extinction subjects pressed the key 
for all the five extinction trials, but all did so 
at a slower rate than during acquisition. Of 
the six, two reported that they thought the 
shock might come on again so they wanted to 
be close to the end, while the other four re- 
ported that they wished to get the experiment 
over with. The five regular-extinction sub- 
jects who stopped pressing altogether stated 
that there was no need to press because there 
was no shock. Nine of the 11 regular-extinc- 
tion subjects made key-press responses at the 
Start of the sixth trial and presumably would 
not have detected the change had they been 
in the punished-extinction condition. 
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The human data are quite similar (o the 
animal data, in which some subjects punished 
during extinction detect the change and im- 
mediately stop responding, and some regular- 


Discussion. What is obvious in the human 
situation also applies conceptually to the 
animal situation, The experimental procedure 
simply does not permit the introduction of the 
terms ‘‘choice,”’ “preference,” or “masochism,” 


receiving punish- 
the no-shock outcome is outside the 


Requirements for an Extension to Masochism 


Our human 
of the animal 


into the role of competing alternative explana- 

Clearly, fear motivates responding, and 
a change in behavior does 
discrimination between the alternatives has 
taken place, Our evidence with humans sug- 
gests that as soon as this discrimi 
place, subjects Punished during extinction will 
stop responding. [n experiments on self-puni- 
tive behavior, it is necessary to separate ex- 
Peclancy questions about response-outcome 
Contingencies from Motivational 
about Preferences between those outcomes, 


here is no basis at this time to infer choice 
ing) over 


“behavioral 


criterion” Proposed by Brown (1965, 1969) is 
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inadequate for it does not use a preference 
methodology (e.g., Irwin, 1961), 

Our critique of the existing research is nol 
intended to dispute the reliability of the em- 
pirical phenomenon of self-punitive behavior. 
Indeed, it is as reproducible with humans as 
with animals. But, the roles of fear and dis- 
crimination both need to be considered. 
Whether the basic paradigm can be used to 
provide an analogue for masochistic behavior 
yet remains to be demonstrated through the 
use of preference methodology. To meet the 
constraints of a preference methodology, it 
would be necessary to show that running into 
the shock by the punished-extinction subjects 
is preferred over some alternative; but that 
the regular-extinction subjects prefer (or are 
indifferent to) the same alternative versus 
running without shock. Such a demonstration 
seems unlikely, 
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STRENGTH THEORY AND CONFIDENCE 


RATINGS IN RECALL. 


* HARLEY A. BERNBACH ! 


Cornell University 


Two different iypes of theories about the 
nature of the memory trace have been applied 
to confidence ratings in short-term memory. 
One, a strength theory (eg. Wickelgren & 
Norman, 1966), effectively postulates à con- 
tinuous distribution of memory states. The 
other postulates only two states. Both can 
handle recognition memory data about equally 
well. But Murdock (1966) reported an unex- 
pected invariance in one measure of a subject's 
performance in recall experiments: the Type 2 
receiver operating characteristic (ROC) curve 
does not vary with serial position. This is an 
important finding, for it may allow us to dis- 
tinguish between the two classes of theories. 
The two-state theory clearly predicts this re- 
sult (Bernbach, 1967). But until now, it has 
Dot been possible to derive the predictions for 
the continuous strength theory, 

The purpose of this note is to present the 
predictions from continuous Strength theory 
and to show that they are incompatible with the 
data presented by Murdock (and replicated by 
Bernbach, 1967, 1970; Bernbach & Bower, 
1970). This note therefore modifies my 1967 
statement about continuous strength theory. 
The reasoning in that paper was incorrect. 
This note, however, shows that when the pre- 
dictions of continuous strength theory are 
worked out (by means of a Monte Carlo simu- 
lation), the prediction of the earlier paper is 
correct, even though it was made then for im- 


proper reasons. 


The Simulation 

First it was necessary to make some assump- 
ions about the relationship between confidence 
udgments and a strength theory for a multi- 
Uternative recall task. The strength theory 
dopted has normally distributed strength dis- 


1 Requests for reprints should be sent to Harley 
t. Bernbach, Department of Psychology, Morrill 
Iall, Cornell University, Ithaca, New York 14850, 


tributions and uses the traditional “habit family 
hierarchy” theory, which assumes that the re. 
sponse emitted is the one with 
strength at the instant of recall. 
of specific 


sponse, while the second assumed confidence to 
be a function of the difference in strength be- 
tween the two highest items at the instant of 
recall. It was felt models repre. 
sented end points on 

models for the relationship between strength 
and confidence, 

For the Monte Carlo Tuns, a task with just 
four response alternatives was assumed, mainly 
to keep down the 
pirically, such a fask was discussed by Bern- 
bach (1967, p, 477). At that time, the ROC 
curves shown were modified to “correct for 
guessing,” so that the two-state model would 
Predict no change across trial number, This 
was necessary because the two-state model pre- 
dicts Mvariance in ROC curves with respect to 


t was these modified ROC curves that were 
Shown to be invariant With respect to trial 
number, Thus, the Same modification for 
guessing was applied to the simulated data be- 
, Curves were plotted to i 
i erences were predicted p l E 
tinuous strength models, It should be qoo. 
that this modification has the Property OR 
ing the curves closer together and, 
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9 TRIAL | 
© TRIAL 2 


modified p (ylcorrect) 


p (ylerror) 
Fic. 1. Simulated and modified-for-guessing ROC 


curves for the relative confidence model. 
distributions, all with standard deviations equal 
to one. This is the simplest case of equivariate 
strength distributions and is most closely re- 
lated to the strength models for recognition 
memory proposed by Wickelgren and Norman 
(1966). The first distribution was designated 
"correct," and depending on trial number (des- 
ignating the two ROC curves to be plotted), 
had a mean of 1 Or 2. These values were 
chosen arbitrarily to give some range to the 
probability correct (which turned out to be 
.55 and .83, respectively). The remaining three 
distributions were designated as errors, and 
their means were zero, 

In the computer program, an array with two 
rows and 15 columns was established for each 
model, relative confidence and absolute confi- 
dence. lf the highest of the four numbers gen. 
erated at random from the four distributions 
was the first, a number in one of the cells in 
the first row of each array was incremented by 
one; otherwise, a number in a second row cel] 
was incremented. The columns of the cells in- 
cremented depended on the confidence model 
involved, An example will show how this was 
done, 

Suppose the four values generated 
were 1.37, .26, —.43, and .03. 
is the highest, a number in the 


in a draw 
Since the first 
first (or “cOr- 
would be incre- 
mented by one. absolute model, the 
number 1.37 would be multiplied by 4 (a multi- 
itrarily to provide Some range 
and rounded off to S Thus, 


@ TRIAL | 


modified p(ylcorrect) 


© TRIAL 2 


P (ylerror) 


Fic. 2. Simulated and modified-for-guessing ROC 
curves for the absolute confidence model, 


the number in cell 1 by 
mented by one, For the relative model, the 
difference between the two highest numbers, 
1.11, would be multiplied by 6 (again an arbi- 
trary multiplier) and rounded off to 7. In 
the array for the relative model, cell 1 by 7 
would be incremented. The ROC curves were 
obtained from the filled arrays in the standard 
manner, and then were modified for guessing, 
using a guessing rate of one-fourth. 

The simulated (and modified-for-guessing ) 
ROC curves for the relative strength model are 
shown in Figure 1. It is quite clear that the 
distance between the modified ROC curves and 
the major diagonal increases with the proba- 
bility of a correct response. This is also true 
of the absolute strength model, as shown in 
Figure 2. As mentioned earlier, the empirical 
evidence shows no such change, even with 
larger differences in correct response proba- 
bility than that obtained from this particular 
Monte Carlo simulation. (Several alternative 
Simulations have also been carried out, each 
with the same result: the continuous strength 
theory cannot predict invariant Type 2 ROC 
Curves, Among the variations tested were 
different rules for generating confidences and 
allowing the means and variances of the 
strength distributions to differ.) 


5 would be incre- 


Discussion 

Of course, it is not possible 
above results to every theory 
trace that relates a strength 


to generalize the 
of the memory 
continuum to 
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confidence. The models considered were rea- 
sonable ones, however, and the results quite 
deb. Despite the faulty earlier argument 
( Bernbach, 1967), it would seem, until a model 
to the contrary appears, that the invariance of 
Type 2 ROC curves relative to recall proba- 
bility argues for the two-state model. 

It should be made clear that the preceding 
argument is addressed only to strength theories 
that, like the models of Wickelgren and Nor- 
man (1966), postulate a direct relation be. 
tween strength and confidence, A strength 
theory like that Proposed by Shiffrin (1970) 
is, in fact, a two-state theory for confidence 
ratings. In Shiffrin's search theory, an item is 
in either a found State or a not found State at 
the time of test. Though the Strength of an 
item determines Which state it is in, confidence 
can be based on distributions for only these 
two states. 
The 

as far as 
the subject's awareness and ability to rate con- 
there are two and only 
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two states. As stated in the earlier paper 
( Bernbach, 7), there are a multitude of 
Specific theories of memory that are compatible 
with this, 
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THE BENAVIORISTIC CONCEPTION OF NEUROSIS; 
- ^ REPLY TO TWO CRITICS 


JOSEPH WOLPE1 
Temple University Medical School 


Criticisms of the behavioristic view of neurosis by Costello and by Weitzman 


are answered. It js observed that some of the criticisms arise from failure 
to note the centrality of autonomic responses, 


The idea that neurotic behavior consists of relevant is that that stimulus, in the course of 
conditioned Tesponses of specifiable character — the same set of conditioning operations that 
a has in the Past decade been the subject of a conditioned it to the avoidance reaction, will 
considerable amount of experimental and clini- also have become conditioned to the anxiety 
cal research that has largely supported it (see, Tesponses that the unconditioned stimulus 
e.g., Paul 1969). "This paper replies to two evoked. Evocations of the conditioned anxiety 
recent critiques by Weitzman (1967) and by response are an effective Source of second- 
Costello (1970). ‘Since Weitzman's criticisms order conditioning of that response. (Wolpe, 
are mainly in the clinical area, while Costello's 1952a; 1958, P. 60)—in contrast to evocations 
deal essentially with learning mechanisms, it of an avoidance response (McAllister & Mc- 
will be logical to discuss the latter first. Allister, 1964) 


(c) Costello declares that "the explanation 

Costello usually accepted by behavior therapists" for the 
Theory of neurosis. (a) Much of Costello’s resistance to extinction of Phobias is Solomon 
criticism stems from his erroneous belief that and Wynne's (1954) Proposition that extinc- 
. behaviorists regard neurosis as an instance of tion is prevented because the avoidance re- 
motor avoidance conditioning, In fact, the sponse makes "reality testing" (by the unre- 
definition that has been generally adopted is — inforced presentation of the CS) impossible. 
this: A neurosis is a persistent unadaptive This explanation js at odds with the facts of 
habit that has been acquired by learning in one experimental neurosis (Wolpe, 1952a; 1958, p. 
or a series of anciety-gencrating situations. — 71). Subjecting experimentally neurotic ani- 
Conditioned anxiety is usually the central com- mals to repeated or prolonged exposure to the 
ponent of neurotic habits (Wolpe, 1958). Anx- stimuli eliciting the neurotic responses does not 
iety is defined as an individual organism's lead to extinction. In other words, when 
characteristic constellation of autonomic re- "reality testing" does Occur, it is not "cura- 
sponses to noxious stimulation, — [t is ironic tive,” What might account for the resistance 
that in referring to Eysenck's recently ex- — to extinction is that removing the animal from 
pressed adoption of a classical aversive condi- the experimental cage and its anxiety-evoking 
tioning theory rather than an avoidance one, stimuli results in anxiety-drive reduction, which 
Costello states, "it does not appear to have re. reinforces the anxiety responses to the cage 
sulted in any Proposed changes in the under- stimuli that were evoking them (Wolpe, 1952a, 
standing and methods of treatment of Phobias 1952b). 
and need not be discussed further [p. 251) Causation, (a) Costello states that although 
The methods of behavior therapy have always Wolpe (195 2a) and Smart (1965) demon. 
been based on the classical conditioning theory strated that no conflict is necessary to produce 
implied in the above definition of neurosis, experimental neuroses, "even the most casual 
(5) Costello's contention that animals do not Observation of clinical phobias" Suggests that 
usually learn to avoid the conditioned stimulus conflict is involved in them. He adduces no 
to an avoidance reaction is obviously irrele- evidence to support this statement. The ubiq- 
vant to this definition of neurosis. What is uity of conflict in clinical neuroses is not borne 
+ Requests for reprints should be sent to Joseph Out by patients’ histories, Most Phobias, as 
Wolpe, Department of Psychiatry, Temple Uni- well as “more complex anxiety habits, have 
versity Medical School, c/o Eastern Pennsylvania — no discernible relation to any ongoing conflict ; 
Psychiatric Institute, Henry Avenue, Philadel. nor need conflict have played any p 


| phia, Pennsylvania 19129. origin. For example, a driving phobia may 
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date from a panic occasioned by an attack of 
hypoglycemia during driving. On the other 
hand, of course, conflict can be the “uncondi- 
tioned” source of the anxiety response to which 
neurotic conditioning occurs (Wolpe, 1958, 
pp. 38 and 79). ; 

(b) The fact that Bregman (1934) failed to 
reproduce the phobia that Watson and Reyna 
had induced in Little Albert has little signifi- 
cance. In the first place, people are not all 
equally conditionable. Second, failure was 
probably ensured by the fact that “more neutral 
stimuli” were used as potential conditioned 
stimuli. Seligman (1970) has shown how rele- 
vant the character of a stimulus is to its condi- 
tionability to a particular response. 

(c) Costello argues that the success of sys- 
tematic desensitization does not support the 
learning theory of phobias because “methods 
other than systematic desensitization appear to 
be equally successful.” The equal success of 
other methods has by no means been demon- 
strated (see, e.g, Nawas & Mealiea, 1971; 
Paul, 1966; Sushinsky & Bootzin, 1970; Willis 
& Edwards, 1969). To the extent that methods 
derived from a causal theory are egregiously 
successful, the theory becomes more plausible; 
but direct testing in the causal context can 
alone establish a causal theory. 

(d) Costello finds the suggestion of Andrews 
(1966) that phobias develop in relation to the 
satisfaction of dependency needs “at least 
equally plausible, despite the lack of an experi- 
mental analogue.” We are not told what con- 
siderations make it plausible. The conditioning 
theory of the development of phobias at present 
looks quite persuasive—it has an experimental 
base, it is borne out by the details patients 
give of the onset of their fears, the clinical 
methods that are its fruit have been attended 
by success, and no factual considerations have 
been brought forward that seriously chal- 


lenge it. 


Weitzman 


Weitzman states that the major substance 
of his essay is “an examination of the grounds 
on which clinicians have based their rejection 
of behavior therapy.” 

The relation of “learning theory” to behavior 
therapy. On the basis of a consensus among 
experimental psychologists (Estes, Koch, Mac- 
Corquodale, Meehl, Mueller, Schoenfeld, & Ver- 
planck, 1954) that “there is no monolithic sys- 
tem properly called learning theory,” Weitz- 
man deduces that there is no learning theory 
competent to handle the data of either behavior 
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therapy or Psychotherapy. Behavior therapists 
may give a misleading impression when they 
speak of behavior therapy arising out of “learn- 
ing theory,” but their allusion is not to any 
global theoretical system, but to particular prin- 
ciples and paradigms that experiments in the 
psychological laboratory have yielded (see, e.g., 
Skinner, 1953; Wolpe, 1958). An 'experimen- 
tally established paradigm is a solid possession 
which behavior therapists apply to clinical 
problems. This enterprise has borne results 
that support the hypothesized parallelism be- 
tween the laboratory and the Clinic. 

Weitsman’s criticisms of systematic desensi- 
lization. (a) Weitzman States that the deriva. 
tion of systematic desensitization from the 
treatment of animal neuroses by systematic 
ieeding in the presence of increasingly strong 
anxiety-evoking stimuli hides "a form of anal- 
ogy-making which gives comfort to the be- 
havior therapist but which obscures differences 
of profound significance between the systematic 
desensitization of cats and men [p. 304]." 
Surely, if parallel procedures in cats and men 
cure the neuroses of both, the species differ- 
ences must be judged irrelevant in the particu- 
lar context. 

(b) In the standard desensitization proce- 
dure, the scene the patient is directed to imag- 
ine is regarded as paralleling the exogenous 
stimulus situation in which th experimental 
cat is desensitized, Weitzman endorses the 
view of Breger and McGaugh that the use of 
the term Stimulus" is here only “remotely, al- 
legorical” to the traditional use of the term in 
"oeste is cong, “SO £00 reso s: 
(1937) “definition of «qii po," Lundberg's 

shicl oy Stimulus as “ E 
Sepe d e made, 4 succession of words 
in ines e ^ is Just as much a stimulus 
Iii Je response that follows it as 

^us stimulus would be. Second, sev- 

sanie P ae a shown that. much the 
silii « 1§ results are found if extrinsic 
are used in the place of imaginal stimuli 
Goldberg & D'Zurilla, 


(c) Weitzman describes 


an experi 
undertook on S periment he 


IX patients to investigate their 
; t period when they 
jn to be magining hierarchical 
ar, F. e states that they all reported a 
& flow of visual imagery which he feels 
lowever, he id f association in fantasy." 

20M , oe POM Maintain i. at 
ization of tre i n A us 
up to one mi a phe 
minute fa; More than the standard 
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duration of 5-7 seconds for scene presentations 
in systematic desensitization ( Wolpe, 1969), 
Patients seem almost invariably Able to main- 
tain constant vistalizations for the standard 
short spells. 

Weitsman’s defense of Psychoanalytic bsycho- 
therapy. Weitzman tries to rescue Psychoana- 
lytic theory from the implications of the failure 
of symptom substitution to occur after behavior 
therapy by putting forward three separate pro- 
posals. The first of these is that symptoms 
may disappear permanently through means 
other than the breakdown of repression, quot- 
ing Freud's (1936) distinction "between mere 
repression and the true disappearance of an old 
desire or impulse (p: 85]," Weitzman does 
not suggest by what mechanism consonant with 
psychoanalytic theory desensitization could 
achieve the lasting disappearance of "old de- 
sires and impulses.” 

The second Proposal is that somehow desensi- 
tization may be a repression-lifting mechanism. 
There would need to be evidence that presumed 
repressed material does Somehow emerge in 
the course of desensitization—and of this, those 
who routinely use the technique have so far 
seen no sign, 

Weitzman's third proposal is that systematic 
desensitization, by diminishing fear, increases 
ego strength, and may thus signal the poss 
bility of creating more adequate “binding be- 
haviors.”” -^Why then,” he asks, "predict symp- 
tom substitution ?” But can the patient be safe 
from it as long as the repressed energies re. 
main in thrall? And does not this proposal 
undercut some other tenets of Psychoanalytic 
theory outlined by Weitzman ? If anxiety 
“serves as a signal to the ego that a dangerous 
instinctual demand is growing in strength" 
and if an external phobic stimulus must "sym. 
bolically represent the meaning of the internal 
danger," it follows that the removal of anxiety 
by desensitization must deprive the patient of 
his valuable warning system without removing 
the "internal danger." 7 
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AVOIDANCE CONDITIONING AND BEHAVIOR THERAPIES: 
A REPLY TO COSTELLO 


ARNOLD POWELL? AND AUGUSTUS R. LUMIA ? 


University of Nebraska 


Costello's conclusion that avoidance responses and phobias are dissimilar 


was criticized on several points: 


(a) Costello had 
responses are adaptive and phobias unadaptive. 
be functionally distinguished on the basis of adaptiveness, 


proposed that avoidance 
However, behaviors cannot 
since adaptiveness 


refers to a relationship between a given response and particular environ- 
mental events. (b) Given Costello's definition of adaptiveness, it can be 
shown that avoidance behaviors are unadaptive under a variety oí conditions. 


(c) Herrnstein's analysis of avoidance conditioning does 
reduction as a sufficient condition for avoidance. 
pretations of avoidance conditioning must take 


not eliminate fear 
Although theoretical inter- 
Herrnstein's analysis into 


account, this does not necessarily have any implications about the similarity 
between avoidance responses and phobias. 


The view that two-factor theory of avoidance 
conditioning provides an appropriate model of 
the acquisition and maintenance of phobic re- 
actions was recently criticized by Costello 
(1970). While Eysenck and Rachman (1965) 
were singled out as examples of behavior ther- 
apists who maintain such a view, the criticisms 
were considered appropriate for any position 
that postulates that phobias are acquired ac- 
cording to the principles of Mowrer's (1947) 
theory of avoidance conditioning. Further- 
more, these specific criticisms represented an 
indictment against the behavior therapies in 
general. By providing *more specific detailing 
of discrepancies between the data of learning 
studies and the practices and formulations of 
the behavior therapists [ Costello, 1970, p. 250]" 
Costello had hoped to establish a more de- 
fensible basis than had previous writers (e.g., 
Breger & McGaugh, 1965; Weitzman, 1967) 
for criticizing the increasingly popular behavior 
therapies. 

Given the present state of knowledge in our 
science as well as in our technology, it is neces- 
sary to carry on a constant evaluation of the 
rationale and practices of psychotherapy irre- 
spective of whether the model for the therapy 
process stems from the learning laboratory or 
from any other perspective in the field of psy- 


1 During the preparation of this paper the first 
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chology. Our laboratory models as well as the 
practice of psychotherapy benefit from such 
evaluations and comparisons. However, it is 
also important to recognize that discrepancies 
between the behavior of laboratory subjects and 
persons observed in the clinic can arise from a 
variety of sources and the occurrence of such 
discrepancies does not necessarily demand that 
we judge them to be governed by different un- 
derlying principles. Behaviors which are topo- 
logically different, for example, can be function- 
ally similar. The present writers feel that the 
conclusion that conditioned avoidance reactions 
and phobias are “dissimilar” is somewhat pre- 
mature and that many subtleties involved in 
arriving at such a conclusion need to be further 


explored, ? 
Costello’s (1970) principal thesis was that 
conditioned avoidance 


responses do not pro- 


dde i : 
ide an appropriate model of phobic reactions 
because 


€sponses are adequate (coping) 


3. 


There ar iffi 
€ are a number of difficulties associated 


with a ; 

Ni attempts to l Suish between these be- 
E on the basis of their “adequateness” or 
adaptiveness” (Costello d 


1970 251 
First, accordi 2. PP. 251 ff.). 
“adsprivensea e to the definition provided for 


* the animals sim- 
> ++. But phobias 
the occur- 
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rence of behaviors desired by the individual wda 


. s- and/or desired by society . . . [p. 251], 


there is much evidence in the literature to sup- 
port the conclusion that under many circum- 
stances, avoidance responses could be consid- 
ered "unadaptive" What is perhaps more 
important is the fact that regardl of the 
specific connotations or denotations given the 
term adaptiveness, it is not typically used to 
describe some inherent characteristic of be- 
havior. In general, adaptiveness refers to some 
relationship between a particular behavior, or 


class of behaviors, and selected environmental 


events. The judgment that some behavior is 
“adaptive” or "unadaptive" must be predicated 
on the demonstration that this relationship 
does, or does not, hold. However, since adap- 
tive pressures and environmental contingencies 
can change rapidly over time and can vary for 
organisms at any particular point in time, this 
judgment can only be made with regard to 
some specified behavior of a given organism at 
a specified point in time. 

In order to comment further on the proposi- 
tion that avoidance behaviors are "adaptive" 
and phobias are "unadaptive," we shall first 
examine a couple of alternative implications of 
the definition that Costello provides for these 
terms and then proceed to examine some rele- 
vant experimental findings. Because of the 
factors discussed in the preceding paragraph, 
any conclusions reached about the adaptiveness 
of avoidance behaviors must be qualified by 
"given a particular set of conditions and or- 
ganisms," 

One implication of the above definition would 
be to call a behavior unadaptive if once the re- 
sponse has been acquired, the organism will 
persist in making it even after the original 
environmental contingencies have been removed 
or even reversed. It should be possible to find 
many examples of phobias that could be de- 
scribed as unadaptive in this sense. In fact, 
this is often characteristic of a phobia ; that is, 
the organism encounters some situation that is 
highly aversive but later finds that situation 
to provide a different set of contingencies, and 
the earlier acquired tendencies prevent him 
from responding appropriately to the new set 
of contingencies. On the other hand, it is pos- 
sible to find examples of the same phenomenon 

^ in laboratory investigations of avoidance con- 
ditioning. For example, in several of the classic 
studies by Solomon and Wynne (1954) on 
traumatic avoidance conditioning, dogs given 
only a very few exposures to strong electric 
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shock continued to make avoidance responses 
throughout hundreds of extinction trials. When 
Solomon and Wynne (1953; 1954, pp. 364 ff.) 
provided some "therapy" by reversing the con- 
tingencies and making the presentation of shock 
now contingent on making the response that 
previously avoided shock, the speed of this 
instrumental reaction was actually found to in- 
crease. And in another context, Sidman, 
Herrnstein, and Conrad (1957) have shown 
that free, or noncontingent, shocks administered 
in extinction to an animal that has already 
acquired an avoidance response will lead to 
an increase in the rate of that avoidance re- 
sponse. Finally, in a series of recent investi- 
gations, Brown and his associates (Brown, 
1969) have presented considerable evidence for 
self-punitive behaviors in particular escape- 
avoidance situations. 

There are still other implications of the way 
in which Costello employed the term "adap- 
tive" To consider briefly another aspect, a 
behavior should be considered unadaptive if it 
disrupts the organism's ongoing behavior in 
some other situation, particularly if the ob- 
jective environmental contingencies do not re- 
quire that the disruptive behavior even occur in 
other situations. No doubt there are many 
examples of instances in which phobic reactions 
have this kind of disruptive effect on other 
behaviors. However, what is more important 
in terms of Costello's conclusions is the fact 
that certain elements in avoidance conditioning 
can also have this disruptive effect—as evi- 
denced by the voluminous literature on condi- 
tioned emotional responses (cf. Campbell & 
Church, 1969). 

It should be clear from-the above considera- 
tions that according to Costello’s use of the 
terms "adaptive" and “unadaptive,” there are 
a variety of conditions under which avoidance 
behaviors may be described as unadaptive. 

Recent studies on the phenomenon of “learned 
helplessness” (see Maier, Seligman, & Solomon, 
1969) can be used to further illustrate the 
problems in describing a behavior as adaptive 
or unadaptive. It has been found that the 
presentation of noncontingent shocks prior to 
exposure to an avoidance contingency will have 
à very disruptive effect on avoidance acquisi- 
tion. Should the behavior that is being ac- 
quired during the noncontingent stage be de- 
scribed as "unadaptive?" An unqualified "yes" 
or "no" cannot be given to this question, for if 
the subject should never be exposed to the 
avoidance contingency, then this behavior can 
be viewed as adaptive. Since the Shocks are 
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presented noncontingently, any effort expended 
in trying to alter or prevent their Occurrence 
is wasted effort. However, if the avoidance 
conditioning is forthcoming, then we might 
describe learned helplessness as being unadap- 
tive, because under these conditions the animal 
is responding to a set of contingencies that are 
no longer present. The general point is that 
the "adaptiveness" oi avoidance behavior as 
well as any other behavior including phobias 
can only be defined in terms of the particular 
response under a particular set of conditions. 

It is interesting to note that the situation 
with Solomon and Wynne's (1953) subjects 
contains the element of conflict that Costello 
indicates is so characteristic of phobias ob- 
served in the clinic. That is, the dogs given 
traumatic avoidance conditioning were later 
presented with a conflict between "approach- 
ing" in order to avoid shock and actually jump- 
ing into shock. When presented with such a 
conflict, the subjects jumped even faster than 
before the element of conflict was introduced. 
This suggests the possibility that it is the 
presence of conflict that makes the behavior 
of an organism that has acquired a phobia seem 
so unadaptive. That is, the acquisition of 
strong fear or avoidance tendencies is not, in 
itself, unadaptive, but with reversal of contin- 
gencies that introduces conflict, even greater 
fear and tendencies to avoid are evoked. In 
spite of the fact that many organisms with 
phobic reactions also happen to have conflicts 
present, it is possible that avoidance behaviors 
and phobias are acquired according to the same 
principles and that Mowrer's two-factor theory 
specifies those principles. 

Costello relied heavily on Herrnstein’s (1969) 
recent interpretation of avoidance conditioning, 
according to which the CS in avoidance condi- 
tioning serves a discriminative function—its 
most salient characteristic being that it sets the 
occasion for the avoidance response. This js 
in contrast to Mowrer's two-factor theory in 
which CS onset arouses fear or anxiety that 
motivates the avoidance response, and CS off. 
set provides the reinforcement (via fear reduc- 
tion) for avoiding. Herrnstein was extremely 
ingenious in finding experimental avoidance 
tasks in which CS offset was not a necessary 
condition for avoidance to occur, For example, 
avoidance responses were acquired in situations 
where the only effect of that response was to 
reduce (for a given time interval) the future 
probability (frequency) of shock presentation 
(Herrnstein & Hineline, 1966). Here, then, 
CS termination was not a necessary condition 
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for avoidance conditioning. On the other 


hand, there are still situations in which it, 


should be recognized that CS termination does 
provide a sufficient condition for avoidance to 
occur. In one of the earliest studies on two- 
lactor theory (Brown & Jacobs, 1949), it was 
shown that CS termination reinforced an in- 
Strumental response even though the CS had 
never before set the Occasion for a response 
that reduced the probability of shock. 

Although Herrnstein's analysis forces a re- 
evaluation of two-factor theory, it does not de- 
mand that we totally reject the postulated in- 
volvement of conditioned fear reactions jn 
avoidance conditioning, for there is too much 
evidence to the contrary. As Herrnstein 
pointed out, and as Costello noted, the typical 
avoidance procedure does contain a Pavlovian 
conditioning component. When this compo- 
nent is removed, as in the Herrnstein and Hine- 
line (1966) study, we find that it is not a 
necessary condition for avoidance learning. 
However, other studies, such as that by Brown 
and Jacobs (1949), do seem to require that CS 
termination be classified as at least a sufficient 
condition for avoidance, 

Even a very substantial revision in our con- 
ceptions of avoidance conditioning (which the 
evidence reviewed by Herrnstein seems to de- 
mand) does not require that we reject the 
proposition that avoidance behaviors and. pho- 
bias are "similar," Tt would suggest, how- 
ever, that the conceptions of the behavior ther- 
apists be reevaluated and revised to be more 


congruent with current Jat 


T A oratory knowledge, 
he flexibility of the behavior therapies and their 


openness to changes Suggested by experimental 
müde m OS " most desirable charac- 
à S Omparison with more analytically 
based therapies, Analytic conceptions of psy- 
chotherapy tend to be more dogmatic and less 
responsive to laboratory tests. Even though 
two-factor theory had been shown to be inade- 
quate in some respects, it still may be a more 
hae and useful explanation of phobias 
fag 2 model that has little or no re- 
i ship aboratory ings 
example, ar ne hen ati, Poe! = 
piri best to p as as a “loss 
ping reactions,” but this alternative pro- 
an an in i ip- 
tion of almost all clinically ee 
odel does not pro- 


Per se, nor do red explanation of phobias, 
es it seem open to empirical test. 


In conclus; 
"s amen. some few Words should be 
can be acq in the possibility that behaviors 
tured and maintained under different 
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conditions. A behavior acquired under posi- 
tive reinforcement could, for example, be main- 
tained by negative reinforcement, and vice 
versa. This same principle could apply in the 
case of avoidance conditioning where reactions 
could be acquired according to the principles 
of two-factor theory but be maintained at a 
continuing high rate by the principles of 
Herrnstein's "discrimination hypothesis." Such 
a proposal would explain why animals in avoid- 
ance conditioning seem to become less anxious 
as learning progresses. In any case, the possi- 
bility that phobias are acquired under one set 
of conditions but maintained under others 
should be considered further. 
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CONDITIONED AVOIDANCE RESPONSES AND PHOBIAS: 
A REPLY TO WOLPE AND TO POWELL AND LUMIA1 


C. G. COSTELLO 2 


University of Calgary 


In the paper criticized by Wolpe and by Powell and Lumia, 


that conditioned avoidance responses 
paradigm of phobias, These critics 


, it was argued 
did not provide a plausible experimental 


with this argument, Instead, they argue in favor of classically conditioned 
emotional responses, selí-punitive behavior, and learned 


paradigms of phobia acquisition and maintenance. 


any further valuation of the original proposal. i 


This paper concerns itself first with Wolpe, 
then with Powell and Lumia. 
Wolpe 

In reply to Wolpe (1971), it may be noted 
that (a) I do not entertain the “erroneous 
belief that behaviorists Tegard neurosis as an 
instance of motor avoidance conditioning.” 
My paper was concerned with the behavior 
therapists’ theory of phobia development. All 
neurotic conditions are not phobic conditions. 
It must be admitted that Wolpe in his writings 
appears to place particular emphasis on the 
classical conditioning of anxiety responses. On 
the other hand, as he himself notes, conditioned 
anxiety is considered to be but one component, 
albeit the central one, of neurotic habits. In 
his first book, he lists “physical avoidance of 
stimuli conditioned to anxiety” as one of “the 
causal relations of some of the commonest 
phenomena of human neurotic behavior , , . 
[ Wolpe, 1958, Figure 6, p. 77]." In his latest 
book, he lists the "avoidance of anxiety evoking 
stimuli" as one of the "consequences of neu- 
rotic anxiety." As the "mechanism" of this 
consequence, he lists *motor avoidance condi- 
tioning [Wolpe, 1969, Table 1, pp. 24-26]." 
It seems fairly clear that Wolpe accepts the 
conditioning of avoidance responses as a likely 
experimental analogue of phobia development, 
Certainly the weight of the evidence is against 
any definition of anxiety solely in terms of auto- 
nomic responses (for a recent review of some 
of the relevant literature, see Lang, 1969), 

There is no evidence in Eysenck (1968) that 
he has, as Wolpe claims, adopted classical aver- 

1 The preparation of this paper was supported by 
Grant 606-7-136 from the Canadian Department of 
Public Health. 
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sive conditioning theory rather than an avoid- 
ance theory for all neurotic conditions, He 
States clearly that the account of neurotic dis- 
orders given in Eysenck and Rachman (1965), 
and to which I referred in my original paper, 
“may serve to describe certain types of neu- 
totic disorders . , . [Eysenck, 1968, p. 316]." 
He does not Specify whether these neurotic 
disorders include phobias. On the other hand, 
he does not suggest that his theory of incuba- 
tion should now be adopted to account for 
phobias. Tt might be worth noting also that 
Eysenck chooses a “novel nomenclature" in 
"putting over" his theory because “the classical 
account, which js implicitly accepted when we 
use the classical terms, is somewhat deficient 
: + + [Eysenck, 1968, pp. 312-313]." 

The book by Eysenck and Rachman (1965) 
was chosen as an example of behaviorists’ ac- 
counts of phobia development in terms of con- 
ditioned avoidance responses, Similar accounts 
can = found in Ullman and Krasner (1969, 
thas Hu despite their Skinnerian rather 

a an emphasis, Meyer and Crisp (1970, 
pp. 265-271), and Yates (1970 
Bandura (1969) d Phillips 
(1970) | Phillips 

assume, as T do, that therapists using 


Systematic desensitization in the treatment of 
phobic disorders 


accept the conditioned avoid. 
ance Tesponse 


analogue, though these authors 
Ks Critical of such acceptance, 


as in terms of con- 
ce appear to be sufficiently 
ustify the critique Dresented in 
Wishes to pro- 
rely classical conditioning 
ty of such an account could 
" © examined in a 
my Original one. 


Prevalent to j 


Tt cannot he d T 
kinds MC e done here, The 
: ds of data with Which Such an account will 
lave to contend ar 
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found that a random association between CS 
and UCS produced greater emotion defined in 
terms of galvanic skin responses than either 
forward or backward classical conditioning. 

(c) Wolpe agrees with me that Solomon and 
Wynne's (1954) account of the resistance to 
extinction of avoidance responses is of doubt- 
ful validity. But whereas I suggest this to be 
so because the two-factor theory on which it 
is based is of doubtful Validity, Wolpe rejects 
the Solomon and Wynne account because “when 
‘reality testing’ does occur, it is not ‘curative.’ ” 
But it seems that, on the contrary, there is now 
considerable evidence that “the avoidance re- 
sponse extinguishes very rapidly following a 
period of response prevention . . . [Baum, 
1970, p. 277]." Furthermore, Wilson and 
Davison's (1971) review suggests that the 
provision of counterconditioning agents such 
as relaxation, drugs, and food probably expe- 
dites extinction of avoidance responses in ani- 
mals by facilitative exposure to the CS. 
Mathews (1971) makes a similar proposal to 
account for the success of Systematic desensi- 
tization in humans. 

(d) I suggested that conflicts between appeti- 
tive and avoidance behaviors were always in- 
volved in clinical phobic disorders. Wolpe 
states that they are not. Unfortunately, neither 
of us has any experimental evidence to support 
Our views; rather, we rely on our clinical im- 
pressions. 

(e) Seligman's (1970) review of data indi- 
cating that not all events are equally associable 
certainly seems to be relevant to the under- 
standing of Bregman’s (1934) failure to pro- 
duce a conditioned fear response to a neutral 
CS. Seligman himself points this out. Wolpe 
may wish to consider some modification of his 
theory of phobias in the light of these data. 

"il Though it may be admitted that con- 
flicting data have appeared on the value of cog- 
nitive desensitization and implosive therapy, 
evidence that therapeutic techniques other than 
svstematic desensitization are effective for the 
treatment of phobias continues to appear. To 
those listed in the original paper, we may add 
(i) relaxation training without specific de- 
sensitization trials (Kondas. 1967; Paul, 1969; 
Zeisset, 1968), (ti) programmed fantasy 
(Crowder & Thornton, 1970), (iii) leucotomy 

rks, Birley, & Gelder, 1966), (iv) anti- 
(Marks, Birley, & E i: : 7 jà 

rugs (Kelly, Guirguis, Frommer, 

depressant drags M b 1o70) (P) em 
Mitetell-Fisg, Ub Sangedb , ie, Ron 

pectancy manipulation (terete acd. ` 

ran, 1969), and (vi) intravenous acety € 


injections (Sim & Houghton, 1366); 
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Even if one wishes to agree with Wolpe that 
“to the extent that methods derived from a 
causal theory are egregiously successful the 
theory becomes more plausible . . .," one 
should note that not all of the above methods 
or those listed in the original paper were de- 
rived from any one theory of phobia develop- 
ment. 

(g) Marks (1969, p. 100) reviews the clini- 
cal evidence indicating that "patients with pho- 
bias often have a shy, dependent, premorbid, 
personality." On the other hand, there is no 
experimental evidence on the relationship of 
dependency needs and phobias. This is one 
of the reasons why I stated in my original 
paper that theories alternative to the condi- 
tioning theories should be given equal time in 
the laboratory, 


Powell and Lumia 


In reply to Powell and Lumia (1971), it may 
be noted that (a) I cannot understand why 
Powell and. Lumia thought I might consider 
my specific criticisms relating to phobias and 
conditioned avoidance responses to represent 
an indictment against behavior therapies in 
general. I do not so consider them, 

(b) I agree wholeheartedly that the terms 
“adaptive” or "adaptiveness" should not be 
used to describe some inherent characteristic 
of behavior. That is why I referred only to 
avoidance responses in the usual avoidance 
situation. Avoidance responses, it seems, may 
persist under circumstances where they are 
punished or where free noncontingent shocks 
are administered. Whether or not avoidance 
responses under these circumstances provide an 
adequate experimental analogue for phobias is 
another matter altogether. Similarly, the dis- 
ruptive effects of conditioned emotional re- 
sponses have no direct bearing on the adaptive- 
ness of conditioned avoidance responses in 
usual or unusual situations. If self-punitive 
masochisticlike behaviors of the hooded rat or 
if conditioned emotional responses are to be 
proposed as the analogues for phobias, the ade- 
quacy of the proposals could possibly be evalu- 
ated as I evaluated the conditioned avoidance 
response analogue in my original paper. Again, 
discussion of the adaptiveness or unadaptive- 
ness of behavior that is being acquired in the 
noncontingent stage of learned helplessness 
studies, though perhaps of general interest, has 
no direct bearing on the adaptiveness Or un- 
adaptiveness of conditioned avoidance responses, 

Powell and Lumia attempt to save two-factor 
theory by suggesting that the situation in which 
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avoidance responses are punished contains the 
element of conflict that I indicated was char- 
acteristic of phobias. It is not convincing. ; I 
was clearly referring to an approach-avoid- 
ance conflict. The self-punitive situations may 
involve, as Brown (1969) notes, a weak avoid- 
ance-avoidance conflict. A 

(c) In presenting evidence against the two- 
factor theorv of avoidance learning, neither 
Herrnstein's (1969) paper nor my original 
paper relied solely on experimental data in 
which CS offset was not a necessary condition 
for avoidance to occur. Presentation, there- 
fore, by Powell and Lumia of a study ( Brown 
& Jacobs, 1949) showing that CS termination 
may reinforce an instrumental response has 
little impact, particularly since Herrnstein 
(1969) has ably demonstrated that interpre- 
tation of such findings is not by any means an 
easy matter. It should nevertheless be pointed 
out that there are a number of other studies 
presenting findings similar to those of Brown 
and Jacobs (for a recent review see Bitterman 
& Schoel, 1970). 

(d) My suggestion that phobias may be the 
result of “a failure to develop coping be- 
haviors . .." (not the result of a “loss of 
coping reactions" as Powell and Lumia wrongly 
quote me) cannot be developed here but has 
been further elaborated elsewhere (Costello, 
1971). One example of a coping response is 
the conditioned avoidance response as it is ac- 
quired and maintained in the usual avoidance 
Situation. To quote Miller and Weiss (1969) : 


Thus, we suggest that having a coping response 
makes avoidance animals less afraid because re- 
laxation (or safety) becomes conditioned to the 
response, and then becomes increasingly anticipa- 
tory. Perhaps some of the various hypotheses we 
have discussed may provide the primitive basis for 
what, at its cognitively more complex human level, 
we call hope [1969, pp. 360-361]. 


But then to develop this coping response one 
has to be a rat, preferably hooded and not 
albino, and one has to be a lucky enough rat 
to be assigned to the right condition. Phobic 


patients appear to be far more helpless than 
hopeful. 
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A REPLY TO WOLPE 


BERNARD WEITZMAN ! 


New School for Social Research, New York 


In a reply to Wolpe's criticism of Weitzman's criti 


number of points are developed. 


sented by Weitzman. 


Wolpe (1971) answers criticisms by Weitz- 
man (1967) of the behavioristic view of neuro- 
sis. First, he seems to agree that no “learning 
theory” exists which could be said to be the 
source of behavior therapy. He suggests that 
Weitzman is in error in thinking there is an 
issue here to debate. He accounts for the ap- 
parent misunderstanding by referring to “a mis- 
leading impression” given by the way in which 
certain behavior therapists speak. However, 
in illustrating the position of behavior thera- 
pists, Weitzman (1967) quoted Eysenck (1960) : 
behavior therapy is an alternative type of treat- 
ment to psychotherapy ... it is a superior type 
of treatment, both from the point of view of theo- 
retical background and practical effectiveness . . . 
in so far as psychotherapy is at all effective, it is 
so in virtue of certain principles which can be 
derived from learning theory... psychotherapy 
itself, when shorn of its inessential and irrelevant 


parts, can usefully be considered as a minor part 
of behavior therapy (p. ix). 


The position taken by Eysenck seems plainly 
and clearly stated and is difficult to “write off” 
as giving a “misleading impression.” 

Second, Wolpe argues that “if parallel pro- 
cedures in cats and men cure the neuroses of 
both, the species differences must be judged 
irrelevant in the particular context.” Here 
Wolpe supplies his own context. The context 
in Weitzman’s (1967) discussion was a theo- 
retical consideration of the nature of the process 
through which desensitization is accomplished, 
In that context, that is, a discussion of the 
process, species differences have crucial rele- 
vance. 

Third, Wolpe argues that the criticisms of 
his use of the term “stimulus” are not valid, 


1 Requests for reprints should be sent to Bernard 
Weitzman, Graduate Faculty, New School for 


Social Research, 66 West 12th Street, New York, 
New York 10011. 


t: 


que of behavior therapy, a 


It is asserted that behavior therapists do, 
in fact, speak of the existence of a “learning theory.” 


operational definitions do not, as Wolpe seems to think, 
retical analyses. It is maintained that the issue of the sta 
tions in desensitization therapy is unresolved. 

the basis of apparent misunderstandings of tl 


It is argued that 
, supersede theo- 
bility of visualiza- 
An attempt is made to clarify 
he theoretical arguments pre- 


In support of this assertion he quotes Lund- 
berg's definition of "stimulus." This definition 
merely denies the existence of the problem. It 
does not cogently defend Wolpe’s position. 
Wolpe also defends the validity of his use of 
the term "stimulus" by pointing to parallel 
results in desensitization comparing what he 
calls “extrinsic” versus “imaginal” stimuli, It 
seems to this writer that the fact that a person 
may reduce his fear of his boss by confronting 
his boss and by imagining his boss is not ade- 
quate grounds for holding that the confronted 
boss and the imagined boss are identical stim- 
uli. To argue that stimuli are equivalent which 
elicit similar responses may have engineering 
logic but, at this level of analysis, is not satis- 
fying to theoretical reason, 
Fourth, Wolpe states that “P; 
most invariably able to m 
alization for the stand 
seconds]. He intends, 
force of observations ind 
repe La EAM Dy a process involving 
diuidi E o cognitions, Wolpe’s report 
e a A . experience of the writer which 
at most patients need at least five 


Seconds to form a clear visu iz 

isualizati 
a 1 tion although 
anxiety Teactions to es 


the verbal s i 

"s al presentation 
5 art de not uncommon, and simply continues 
; SS argument. Since this point is crucial 
9 the condition; 
tion, it w 


atients seem al- 
aintain constant visu- 
ard short spells [5-7 
thereby, to undo the 
icating that desensiti- 


of “Weitzman’ 
hotherapy,” 
(a) Weitzman do 


s defense of 
Wolpe makes 


mptomatic be- 
ered by Freud 


THEORETICAL NOTES 


(1936). The present writer made no such 
claim for desensitization. The argument pre- 
sented may be summarized as follows: If treat- 
ment by desensitization does not entail symp- 
tom substitution, then we must consider, as 
Freud did, the possibility that we were treat- 
ing a habit not maintained by “old desires and 
impulses.” In other words, it was being 
pointed out that psychoanalytic thought could 


entertain a “habit” conception of certain symp- 


toms. 

(b) Wolpe understands Weitzman as pro- 
posing that “desensitization may be a repres- 
sion-lifting mechanism.” In this section of the 
paper, Weitzman is, in fact, exploring the op- 
posite possibility, that is, that desensitization 
may be a repression-augmenting mechanism. 
That is, the possibility is presented that any 
ego-syntonic alteration of object relations, by 
any therapeutic means, may increase the repres- 
sive potential of the ego. In another section 
of the paper, Weitzman describes an application 
of the desensitization technique to the analysis 
of dreams. This use of desensitization is de- 
scribed as facilitating the overcoming of re- 
ance and might, thus, accurately be charac- 
“repression lifting.” Behavior ther- 


sist 
terized as 
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apists, however, are not typically in a position 
to replicate this observation. 

(c) Finally, Wolpe appears to have missed 
the main point of the analysis of the symbolic 
function of the symptom. It was argued there 
that the term “symptom treatment” is a mis- 
nomer. It implies a compartmentalization of 
man which is antagonistic to psychoanalytic 
thought. In fact, from the point of view of 
psychoanalysis, there can be no “symptom 
treatment.” This assertion is based on the 
postulated symbolic relationship between symp- 
tom and dynamic cause which is basic to psy- 
choanalytic thought. Once this relationship is 
grasped, it becomes clear that in treating a 
symptom one is, of necessity, affecting the 
status of the “internal danger.” 
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LETTERS TO 


A NONMEDIATIONAL IDEOMOTOR 
THEORY 


Greenwald's ideomotor theory (Greenwald, 
1970) makes two claims on page 93, the first 
of which is incorrectly interpreted, the second 
of which is logically questionable, if not im- 
possible. 

He postulates that "ideomotor analysis as- 
sumes that responses are represented in the 
form of images of their consequences . . , 
[p. 93].” This point is substantiated by spec- 
trographic analyses of speech production, in 
that the initial /s/ in /stru/ shows the con- 
textual influence of the vowel which follows 
three phonemes later (Halwes & Jenkins, 
1971). However, as Halwes and Jenkins dem- 
onstrate, the “machinery” that is required to 
generate such a context effect must be more 
Powerful than merely "subgoals . . | as images 
of sensory feedback +++ [Greenwald, 1970, p. 
Boi” 

In discussing symbolic mediators, Greenwald 
claims that “ ‘hall’ is assumed to abstract the 
common properties of a variety of stimuli , . . 
[p. 93]." However, the number of features 
common to all instances of a conceptual medi- 
ator, such as “ball,” rapidly approaches an in- 
finitesimal, if not null, set. And to assert that 
a ball is any type of an object that is set in 
motion during a game will place the mediator 
into the category of “ball-relevant responses,” 
which merely begs the question, Greenwald's 

claim would have been more accurate had he 
used the verb “subtract” instead of “abstract.” 
But a truly abstractive theory of concepts would 
be generative (deriving a Situation-specific re- 
sponse from rules operating on abstract en- 
tities), and not mediational (pairing a specific 
abstracted [sic] feature with a specific re- 
sponse). 
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TIIE 


EDITOR 


SPENCE’S PREDICTION ABOUT 
REVERSAL-SHIFT BEHAVIOR 


A simulation (Wolford & Bower, 1969) of 
Spence’s original discrimination-learning model 
(Spence, 1936) demonstrated that in some 
Cases subjects should execute a reversal shift 
more rapidly than an extradimensional (non- 
reversal) shift, This fi 
be inconsistent with interpretations offered by 
mediational theorists (eg, Kendler & Kend- 
ler, 1962) that Spence’s single-unit formula- 
tion predicted that a reversal should always 
be slower than an extradimensiona] shift. 
The latter interpretation is Correct, however, 
Where reversal and extradimensional shifts 
are compared for a discrimination-shift study 
involving a counterbalanced design with two 
orthogonal dimensions (e.g. brightness and 
shape), each of which contain two values 
(e.g., black, white, triangle, circle), Regard- 
less of what Start vectors were used, two 
independent simulations, by two of the corre- 
spondents (MAH and GW), found that 
Spence’s model always predicted a faster extra- 
dimensional than reversal shift when Preshift 
training was carried to a criterion of 10 suc. 
cessive correct responses. In 
theory, depending on the initial start vectors, 
can predict faster reversal or extradimensional 
shifts. counterbalanced 
designs, overtraining, the model is un- 
equivocal ; reversals should be slower than 
extradimensional shifts. 
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INTEGRATION THEORY AND ATTITUDE CHANGE!, 
NORMAN H. ANDERSON? 


University of California, San Diego 


A theory of information integration is applied to attitudes and social judgments, 
sed on a principle of information integration. For quantitative analysis, 
a simple but general algebraic model of judgment is used, in which each infor- 
mational stimulus is characterized by two parameters, scale value and weight. 
Functional measurement procedures are employed to derive equal-interval 
scales of parameter values. Exact tests based on analysis of variance are given 
for four applications of the model, and these applications are reconsidered 
under the further restriction imposed by the averaging hypothesis. Qualita- 
tive comparisons are made to several other theories of attitude change. Ten- 
dencies toward balance and congruity are shown to be consequences of the 


principle of information integration. 
theory, 
parisons 
tion-contrast theory, 


ind similarity-< 


Critical tests between integration 


and balance and congruity theories are also suggested. Similar com- 
e made to summation theory, logical-consistency theory, assimila- 
traction theory. 
s of communication structure are considered briefly and the analysis 


Molar and molecular 


of inconsistency resolution within integration theory is also discussed. Finally, 


noted that integration theory h 
learning, perception, judgment, dec 
as well as attitude change. 
general theory. 


Attitude change stands out from most 
areas of experimental psychology in the 
nature of its stimuli. In even the simplest 
investigations of attitudes and opinions, 
the stimuli typically carry information at a 
cognitive level not often reached in other 
areas of research. Informational stimuli 


1This work was supported by grants from the 
National Science Foundation (GB-6666) and the 
Office of Education (Project No. 9-0399) to the 
author, and from the National Institute of Mental 
Health to the Center for Human Information Pro- 
cessing, University of California, San Diego. This 
paper owes much to many people. 1 wish to express 
my enduring appreciation to David A. Grant and 
Carl I. Hovland who stimulated my interest in the 
method and content of this work. I owe special 
thanks to Sam Himmelfarb and Rhoda Lindner for 
detailed, insightful criticism, and to Robyn Dawes 
and Robert Wyer for many helpful comments. I am 
also indebted for comments, not always heeded, on 
particular points by Donn Byrne, Don Campbell, 
Gerald Clore, Rudolf Cohen, Harry Gollob, Philip 


s had reasonable success 
ion making, and personality impressions, 
It may thus provide a beginning to a unified 


n the areas of 


continually impinge on the person, in life 
or in the laboratory, and he must integrate 
them with one another as well as with his 
prior opinions and attitudes. Social judg- 
ments are typically based on a cumulation 
of various pieces of information, sometimes 
of the most diverse nature. Factual and 
hearsay evidence, rumors, prestige associ- 
ations, gesture and appearance, may all 
bear on the final attitude. Information in- 
tegration is thus fundamental in attitude 
change. 
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In previous research, the writer has been 
working on a general theory of information 
integration. Experimental applications 
have ranged from personality impression 
formation to decision making and psycho- 
physical judgment. The present paper 
gives a more detailed exposition of integra- 
tion theory in attitude change, developing 
further the theme of some of the first re- 
ports in this research program (Anderson, 
1959a; Anderson & Hovland, 1957) and of 
more recent work (Anderson, 1968a). 


INTEGRATION THEORY 
Integration Model 


A basic distinction in the present formu- 
lation is that between weight and value. 
Each piece of information is represented by 
two parameters: a scale value, s, and a 
weight, w. The value is the location of the 
informational stimulus along the dimension 
of judgment. The weight represents the 
psychological importance of the informa- 
tion. It is important to note that both s 
and w will depend on the dimension of judg- 
ment as well as the individual. For sim- 
plicity, only a single dimension of attitude 
or judgment will be explicitly considered. 
The same piece of information may have 
quite different value and importance on 
different dimensions, or for different indi- 
viduals on the same dimension. 

The theoretical model used in most of this 
Work can be written as a weighted sum: 


R=C ++ E wy,. [1] 


Usually, R is taken to be the overt response, 
measured on a numerical scale. In some 
cases, however, might be considered as 
the attitude underlying an overt yes-no 
decision. The summation in Equation 1 is 
Over all relevant informational stimuli. 
The contribution of Stimulus 7 is just its 
weight ze; times its value s;. The constant, 
C, allows for an arbitrary zero in the re- 
sponse scale and will not be explicitly con- 
sidered here, The first term in the sum, 
®oSo, represents the initial opinion, prior to 
receiving the informational stimuli. 


Norman H. ANDERSON 


Two basic algebraic operations, adding 
and multiplying, are involved in the inte- 
gration model. In simplest form, the total 
effect of any one informational stimulus is 
the product of its weight and its scale value. 
The total effect of two or more informa- 
tional stimuli is the weighted sum of their 
scale values. These two algebraic opera- 
tions correspond to two classes of experi- 
mental manipulations in attitude experi- 
ments as noted below. 

Despite its simple nature, the algebraic 
judgment model has considerable flexibil- 
ity. It includes adding, averaging, sub- 
tracting, dividing, and multiplying models, 
as well as proportional-change and linear 
operator models. Special cases have been 
studied by numerous investigators in varied 
areas as cited here and elsewhere (Ander- 
son, 1970a; S. Rosenberg, 1968). 


Empirical Support 


The writer's Program of research in in- 
formation integration has been based on a 
systematic exploitation of the. algebraic 
judgment model. A substantial portion of 
the work has dealt with personality impres- 
sions (see Anderson, 1968a), including the 
study of stimulus interaction and configural 
use of cues (Anderson, 1965b; Anderson & 
Jacobson, 1965; Lampel & Anderson, 1968; 
Sidowski & Anderson, 1967). Other appli- 
cations include decision making (Anderson 
& Shanteau, 1970; Shanteau, 1970a), psy- 
chophysical judgment (Parducci, Thaler, 
& Anderson, 1968; Weiss & Anderson, 
1969), illusions (Anderson, 1970b; Massaro 
& Anderson, 1971), learning (Anderson, 
1969b ; Friedman, Carterette, & Anderson, 
1968; Himmelfarb, 1970), and attitude 
change (Sawyers & Anderson, 1971). 

The integration model has done well, 
both qualitatively and quantitatively. It 
rests upon a substantial data base, Al- 
though a major part of the supporting 
experimentation has been done with per- 
sonality impressions, the same principles 
may be expected to apply to the more tradi- 
tional Communication-persuasion paradigm. 
Indeed, Personality impression formation 
involves attitude change toward a particu- 
lar individual and can be viewed as a min- 
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iature attitude change situation. Each 
trait adjective corresponds to a communi- 
cation whose weight can be controlled by 
attributing it to various sources, for in- 
stance, or by manipulating its reliability. 
The development of personality impres- 
sions and attitudes both involve the inte- 
gration of information into evaluative judg- 
ments that have social relevance. From 
the present standpoint, an a priori distinc- 
tion between them would be artificial. 


Valuation and. Integration 


The two fundamental operations in inte- 
gration theory are valuation and integra- 
tion. Valuation, in model terms, involves 
the determinants and the measurement of 
the w and s parameters. Integration com- 
prises the ways in which the several stimuli 
are combined. The judgment task may 
affect the integration rule and certainly will 
affect the valuation process. In general, 
however, valuation and integration may 
conveniently and effectively be kept sepa- 
rate as long as the stimuli do not interact. 

When the stimuli do interact, valuation 
and integration will become interlinked. 
As a working rule, it will be assumed here 
that stimulus interaction primarily affects 
the valuation process, and that the changed 
parameter values are then employed in the 
integration. 

Most of the experimental work to date 
has used the assumption that the and s 
values for an informational stimulus remain 
constant, regardless of what other stimuli 
it may be combined with. This is conven- 
ient in tests of the model, particularly in the 
early stages of development. Fortunately, 
the assumption that the stimuli do not in- 
teract seems to be justified in an important 
group of situations. 

More interesting though more difficult 
problems arise in the many situations in 
which context-induced changes may occur. 
Primacy effects (Anderson, 1965b; Asch, 
1946), positive context effects (Anderson, 
1966a; Kaplan, 1971), discounting effects 
(Anderson & Jacobson, 1965; Lampel & 
Anderson, 1968; Schümer & Cohen, 1968), 
and differential weighting (Oden & Ander. 
son, 1971) have all been studied in personal- 


ity impressions with some theoretical suc- 
cess. Such problems of configural cue usage 
are even more important in the general 
study of attitude change as the latter dis- 
cussion of the source-communication rela- 
tion will show. 

The problem of stimulus configurality 
also bears directly on the problem of cogni- 
tive structure. A personality impression, 
for example, or even a trait adjective, ob- 
viously has a more complex cognitive 
structure than can be represented on a 
single dimension. A multidimensional rep- 
resentation would do better, of course, and 
the present development may be considered 
as applying to each of several dimensions in 
turn. 

However, the undoubted success of the 
dimensional view in psychology can lead to 
preoccupation with a restricted set of prob- 
lems, and to a possibly artificial view of 
cognitive structure. Information integra- 
tion, for example, may itself occur at a more 
basic level than the judgment, and the lat- 
ter may be merely the valuation of the 
structure along the specified task dimension 
(Anderson, 1967). The standard dimen- 
sional view will be used in this paper, but 
its probable limitations need particular at- 
tention in the study of stimulus interaction. 
For example, the data suggest that affective 
and semantic inconsistency produce about 
equal discounting (Anderson & Jacobson, 
1965). In inconsistent combinations, the 
response must be based on the stimulus 
configuration, and it is problematic how 
far standard dimensional representations 
of the stimulus will go. 


Application to Attitude Change 


In attitude change, the source-communi- 
cation-issue schema developed most exten- 
sively by Hovland and his associates (e.g., 
Hovland, 1957; Hovland, Janis, & Kelley, 
1953) provides a useful way to organize 
many problems. A communication from 
some source provides information that is 
relevant to the person's attitude or opinion 
on some issue. The effective stimulus is 
then the source-communication combina- 
tion. The present use of two parameters, 
weight and scale value, to represent the 
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Source-communication parallels the ap- 
proach of Anderson and Hovland (1957). 

The scale value of the source-communica- 
tion corresponds to its position on the atti- 
tude dimension. For many purposes, this 
can be taken as the position advocated by 
the Source-communication. This value will 
be primarily determined by the semantic 
properties of the communication, as they 
are interpreted within the person's value 
System. Conceptually, it is sometimes use- 
ful to consider the scale value as the attitude 
that would be left unchanged by the com- 
munication. 

The weight parameter is more important 
theoretically than the scale value because 
of the many determinants of the weight, A 
primary determinant of w; will be character- 
istics of the Source. Thus, Source status, 
reliability, and expertise will al] affect zw; 
Moreover, wo, the Weight parameter of the 
initial opinion, wil] reflect Personality char- 
acteristics, such as Persuasibility, as well as 
ego involvement and strength of prior 
opinion. In addition, the evidence indi- 


recency effects are 


in terms of the weight 
Parameter as discussed later. 
Because of j 


emphasizes 
to communication-persuasion 
than has been attitude theory, 
Traditional approaches haye been more con- 
cerned with qualitative analysis of valua- 
tion than with integration Per se. Since 
functional measurement opens UP possibili. 
ties for direct measurement of the stimuli, 
the present approach is less exclusively 
Concerned with qualitative analysis, More- 
Over, because the theory rests on a Principle 
of information integration, the experimental] 
Program has emphasized the study 
formation variables in the stimulus 
Will become apparent, some of the 
tional Problems of attitude theory are par- 
ticularly amenable to analysis within 4 
Conceptual framework based on inform 
integration, 


Processes 
usual in 


of in. 


- As 
tradi- 
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Functional Measurement 


The power of the 
model has depended 
development of 
surement, 


algebraic judgment 
on the concomitant 
a theory of functional mea- 
This scaling theory makes it 
possible to get the equal-interval scales 
that are needed in the quantitative tests 
of the model, The basic ideas of functional 
measurement were given in two early 
Papers (Anderson, 1962b), and a 
more systematic exposition in psychophysi- 
j is given in Anderson (1970a). 
here; the later 
theoretical applications will exhibit func- 
tional measurement in action, 

À central idea of functional measurement 
is that measurement scales are derivative 
from substantive theory. Measurement 
thus involves three simultaneous problems 
(Anderson, 1970a, Figure 1): (a) Problem 1 
—to measure the subjective stimulus values 
on equal-interya] scales; (5) Problem 2—to 
measure the sy bjective response value on an 
€qual-interya] scale; and (c) Problem 3—to 
find the Psychologica] law that relates the 
subjective values of stimuli and response. 
All three Problems are to be solved together. 
That this can sometimes be done quite 
simply is shown jn Previous experimental 
work using functional measurement pro- 
(e.g., Anderson, 1962a). These 


Procedures are illustrated in the next 
Section, 


Nearly all the experimenta] Work in this research 
Program has used numerical response measures such 
But rating scales may be only ordinal, 
Sts of the model require an equal- 
Functional measurement allows 
transformation of an ordinal scale into an equal- 
interval scale on the response side. Functional scales 
on the stimulus side can then be obtained. The alge- 

Tac judgment model is used as the scaling frame so 
that the validity of the scales depends on the validity 
of the model. Substantive theory and meas 
theory are thus cofunctional in development. 

“unctional measurement may be contrasted with 

urstonian scaling in severa] respects, two of which 
Should be noted here. First, the T 
Proach is largely concerned with sti 


Scale. 


urement 


‘ aditionally seen as a Preliminary to the 
ubstantiye inquiry. Such an approach has its at 
ractions but it b, i 


ases the Substantive įr 


Separate Scaling theory of uncertain validity 


asa consequence, 


to develop "measurement free” 
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methods that use only rank-order information. The 
present approach, in contrast, treats scaling and 
theory as inherently interlinked at every level of 
consideration. 

A second difference between functional measure- 
ment and Thurstonian scaling is in the treatment of 
individual differences, a problem of some import- 
ance in attitudes and opinions. As is well known, 
Thurstonian paired-comparison scales of attitudinal 
stimuli almost necessarily rely on pooled group data. 
Most people will have clear preferences, and the re- 
sultant 0-1 probabilities disallow scaling at the indi- 
vidual level. Group scales (Bock & Jones, 1968, 
pp. 1-2) have normative and sociological value, as in 
Coomb's (1967) replication of Thurstone’s (1927) 
scale of seriousness of offenses. However, their role 
in the study of psychological processes is severely 
limited. Functional measurement has the important 
advantage that it can operate directly at the level 
of the individual. 

Conceptually, the response scaling feature (Ander- 
son, 1962b, 1970a; Bogartz & Wackwitz, 1970, 1971) 
is essential to functional measurement. The logic of 
this approach rests on “using the postulated behav- 
ior laws to induce a scaling on the dependent vari- 
able" (Anderson, 1962b). Indeed, it would not be 
possible to consider this approach as a general theory 
of measurement without systematic methods for 
transforming ordinal response measures to an in- 
terval scale. Experimental illustrations are given 
in Anderson and Jacobson (1968) and Weiss and 
Anderson (in press). 

For the kinds of situations considered in this 
article, however, there is increasing evidence that 
ordinary rating procedures can provide interval 
scales simply and directly, at least with reasonable 
experimental precautions (e.g., Anderson, 1967; 
Himmelfarb & Senn, 1969; Shanteau & Anderson, 
1969). In this article, therefore, it will be assumed 
that the observed response is on an interv: ale. 
This assumption is directly testable since an inade- 
quate scale would tend to disconfirm the model pre- 
dictions. Conversely, if the model does fit the data, 
response scale and model are jointly validated. 

From the experimental standpoint, an important 
methodological feature of functional measurement is 
the use of stimulus combinatorics. Each informa- 
tional stimulus is systematically placed in several 
different combinations. This experimental strategy 
embodies a direct approach to the substantive prob- 
lem of stimulus integration. Happily, it also allows 
the effect of the given stimulus to be factored out 
and scaled. Extensive use is made of factorial de- 
signs, therefore, with analysis of variance a principal 
tool. 

Conjoint measurement (Krantz & Tversky, 1970; 
Luce & Tukey, 1964) has certain similarities to func- 
tional measurement, but as a matter of principle re- 
stricts itself to rank-order data. Its principal virtue 
seems to be that it can provide rank-order tests of 
goodness of fit for certain models. That virtue by- 
passes the response scaling problem, which could 
certainly be advantageous, though practical dificul- 
ties seem to have limited the use of these techniques 
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(Tukey, 1969; Zinnes, 1969). Functional measure- 
ment, in contrast, enables one to use whatever metric 
information is in the overt response. In that sense, 
the difference between functional and conjoint mea- 
surement parallels that between parametric and non- 
parametric tests (Anderson, 1961a). At a more 
fundamental level, the functional measurement ap- 
proach makes measurement theory an integral part 
of substantive psychological theory. As one conse- 
quence, it has the flexibility needed to cope with non- 
additivities caused by averaging processes and cer- 
tain kinds of stimulus interactions. 


MATHEMATICAL APPLICATIONS 


This section details several applications 
of the integration model, and much of it can 
be skipped initially. After the next two 
subsections, therefore, and the initial re- 
marks on the averaging hypothesis, it may 
be desirable to go to the main section of the 
paper, Relations to Other Theories. 


Basic Integration Rules 


The applications below all rest on two 
kinds of experimental manipulations: of 
the weight parameter and of the scale value. 
These correspond to three basic integration 
rules. The first is a multiplying rule which 
specifies the total effect of a communication 
to be the product of its weight and its value. 
The rationale for this multiplying rule can 
be illustrated by considering any method of 
manipulating weight. Source credibility, 
for example, does not of itself produce 
opinion change, but only affects degree of 
acceptance of the communication. A de- 
crease in source credibility would thus act 
by attenuating the effect of the communi- 
cation. 

The second basic integration rule specifies 
that two communications will follow some 
kind of adding rule, as in Application 1 be- 
low. An adding-type rule is intuitively 
attractive and has been employed in some 
form in almost all attempts at quantitative 
analysis. 

However, there is considerable evidence 
for an averaging rule instead of an adding 
rule. Mathematically, the two rules have 
considerable similarity, though the aver- 
aging rule tends to be more difficult to work 
with. In many cases, the two rules make 
the same prediction, but they disagree 
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M in some important situations as 
discussed under Averaging Theory. ' 

Psychologically, adding and averaging 
are quite different. In a Strict adding 
model, the informational stimuli can be 
completely independent in their action. 
In contrast, averaging always implies some 
degree of cognitive interaction because the 
weight of any one informational stimulus 
necessarily depends on all the Others. Ac. 
cordingly, averaging is considered as a third 
basic rule and considered separately in the 
next section. 

A comment on terminology may help 
avoid some ambiguity in the use of the 
terms "adding" and "averaging." Because 
they make the same predictions under cer- 
tain conditions, there may be neither need 
nor basis to make a distinction. In many 
of the published papers that have employed 
one or the other model, no distinction has 
been necessary. In line with mathematica] 
usage, therefore, the term adding will be 
used generically to include both strict add- 
ing and averaging models. When the dis. 
cussion is to be restricted to one or the 
other, this will be indicated. The results 
of the following four applications qo not all 
hold for averaging models as discussed in 
the next main section on averaging theory, 


Application 1: Variation in s 


This application is appropriate wh 
(or more) communications that have differ- 
ent scale values on a given issue are com. 
bined. In the experiment cited below, for 
example, the communications Were para. 
graphs that described good and bad actions 
of United States Presidents, 

It is assumed that the communications 
are presented according to a Row x Col- 
umn factorial design as illustrated in Table 
1. The row and column factors each corre. 
spond to one source, and the levels Within 
each factor correspond to communications 
attributed to the row source and column 
source, respectively. The Source may þe 
only implicit, when the communications are 
presented as historical fact, for example, 

Under an adding model, the theoretica] 
entry in each cell of the design is 


en two 


Rig = we + Unsni + Wesc; [2] 


Norman H. ANDERSON 


Here wr and wo are the communication 
weights associated with the row and column 
Sources; sr; is the scale value of the Mer 
munication coming from Source R pee! 
5c; is the scale value of the communication 
from Source C in column j. Each opinion 
is thus considered as the resultant of ieee 
informational stimuli: the two ee 
tions and the initial opinion. rhe ici E 
opinion will be assumed equal to ps 
simplification that does not affect the c 
clusions. i i 
Equation 2 imposes two up 
The first is that the communications d pen 
interact or change scale value when b Bt 
bined. The second is that the row weig l ; 
Un, is constant over rows and ane 
for the column weight. Since wr n 
Sents the joint effect of the row s qe- 
the row communication, this virtue lost 
quires that each row communication nmm 
the same natural weight and that sou 
and communication do not interact. ld be 
These restrictive assumptions E nb 
kept in mind when designing an exper! than 
"hey are, however, more restrictive plica- 
necessary for an adding model. pops 
tion 4, below, allows the weights to ields 
across rows and/or columns and still y 
exact Predictions. 
Test of fil, 
makes a very 
rows differ 


el 
For this design, the d 
simple prediction: eac ple; 
by a constant. For ap eg ois 
€ between rows 1 and 2 in € 
able 1 is Wn(smi — Sre) " 
del is correct, then the oues 
Should show a very simple pattern. oe a 
ically, the three rows of data should are y 
three parallel curves except for samp! is 
SHOP. An example based on Table Sta- 
given in Figure 1 in the next section. >, l- 
Ustically, this means that the Row X 
umn interaction is zero in principle Ke 
Practice. Thus, EV 
ance provides a simple 2! 
St of goodness of fit. al- 
*emphasized that this pare v 
be made directly on the ra“ 


ata 


lelism test can 


ata, T : : ar 
ek Tigs Person receives the two inform e 
tional stimuli ang then makes his respons 
on 


à numerica] sc 


p à » di- 
ale. The test is made di 
rectly on these 


ic esti- 
responses. Separate est 
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TABLE 1 


RATED FOR TWo-COMMUNICATION FACTORIAL DESIGN 
ATTRIBUTED TO Each or Two SOURCES 


INTEGRATION MODEL ILL 
WITH VARIOUS COMMUNICATIONS 


A. Formal model expressions 


Source C (wc) 


Source R (vg) 


Communica- | Communica- | Communica- | Communica- | M 

tion Ct tion C2 | tion C3 | tion C4 | 

Communication R1 | wrsri + weser | wrsri + we wns + woscs | wrsri + Weses | wnsri + wose 
| | 


Communication R2 
Communication R3 
M 


wrsne + 
wrsra + 
wj + 


JURSRS TP UCSCI 
wrsra + Weser 


wrsr + weser 


wrsre + wese 
wrsra + wese 
m + wesc 


wrsne + Weses | 
wrsra + weses | 
wrr + weses 


wnsre + Weses 
wrsns + weses 
wrsr + weses 


B. Numerical example 


Source C(wg = 2) 


Source R (wg = 1) = — = — 

Sc. = 1 | sc: = 2 | soa = 3 | Soa 6 M 
sri = 1 3 $ | 7 13 | íi 
$0222 4 6 8 | 14 8 
srn = 6 8 10 | 12 | 18 12 

M 5 7 | 9 


mates of the scale values of the communica- 
tions are not required. This is invaluable 
in attitude research because prior scaling 
of the communications is often undesirable 
or infeasible. 

Application 1 has special interest because 
it holds when the communications are pre- 
sented in strictly serial order as is typical 
of attitude change situations. The row 
factor then corresponds to the first com- 
munication, the column factor to the sec- 
ond communication, etc. Applications of 
such serial-factor designs have been made 
in a number of other areas (e.g., Anderson, 
1964d; Shanteau, 19702; Weiss & Ander- 
son, 1969). Under certain conditions, it is 
possible to construct the serial-position 
curve knowing only the response at the end 
of the sequence. 

Functional scales. A striking property of 
the integration model is that it provides a 
method for scaling the communications. In 
Table 1, the column means are linear func- 
tions of the sc;, 

Ry = 


wrr + Wescj, 


[3] 
where the mean row value, 8n, is a constant. 
The observed column means, therefore, are 


estimates of the communication values on 
an equal-interval scale. 

This is illustrated in the numerical ex- 
ample in the lower half of Table 1. The 
column means are twice the column scale 
values plus 3; the row means are the row 
scale values plus 6. These interval scales 
have arbitrary zero and unit and are unique 
only up to a linear transformation. This 
needs to be kept in mind for scaling pur- 
poses; more than two row stimuli would 
ordinarily be needed to estimate any scale 
values for the row stimuli. Of course, no 
more than two rows are needed for the test 
of fit, and two rows suffice for scaling under 
certain conditions when the same stimuli 
are used in each factor of the design (e.g., 
Weiss & Anderson, 1969). 

These results are not restricted to two- 
way designs but hold for any number of 
factors. The first experimental application 
of functional measurement (Anderson, 
1962a) used a three-way design for person- 
ality impressions. 

Experimental example. Figure 1 shows 
an experimental test of integration theory. 
Subjects judged general statesmanship of 
various United States Presidents, each de- 
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Fi. 1, 
informational paragraphs. 


Mean attitude toward United States Presidents produced by pairs of 


(Equal weighting model predicts parallelism for 


the2 X 2 design in each panel. Mean Squares for interaction and error denoted 


by I and E. H, M+, M-, and L denote paragraphs c 
rately favorable, moderately unfavorable, and very 


of highly favorable, moder- 
unfavorable information.) 


(Reprinted from an article by Barbara K. Sawyers and Norman H, Anderson 


published in the May 1971 Journal of Personality and Social Psycholo; 
righted by the American Psychological 


scribed by two paragraphs based on histori- 
cal sources (Sawyers & Anderson, 1971). 
Very positive, mildly positive, mildly nega- 
tive, and very negative paragraphs are de- 
noted by H, M+, M7, and L. Each panel 
shows a 2 X 2 design which should obey 
the parallelism | prediction. Parallelism 
seems to hold fairly well except that the 
discrepancy was significant in the top 
center panel. 

Adding and averaging models make the 
same parallelism prediction in this experi- 
ment and are not distinguishable from these 
data. However, subsequent work appears 
to rule out the adding model for President 
judgments. 

The parallelism test is made directly on 
the raw response and does not require actual 
estimation of the scale values, Normative 
ratings were used to class the paragraphs as 
H; M+, M-, or L, but the only function of 
these categories was to separate the data 
curves in order to allow potential nonparal- 
lelism to appear. It should be noted that 
the 2 x 2 design is frequently not conven- 


Association, Inc., 1971.) 


£y. Copy- 


ient or usable for functional scaling. How- 
ever, the primary purpose of this experi- 


ment was to test the model rather than scale 
the stimuli, 


A pplication 2: Variation in w 


This application js similar to the first, 
but with manipulation of the weight param- 
eter instead of the scale value. This appli- 
cation would be appropriate for studying 
the combined effects of two different sources. 
Since it can provide an interval scale of the 
Weight parameter, jt may be especially use- 
ul in the analysis of the many determin- 
ants of source effects, j 

A factorial design is employed, as in Ap- 
plication 1, but with the roles of w and $ 
interchanged, Each factor corresponds to 
one communication; the levels within each 
factor Correspond to different sources to 
Which the Communication is attributed. 

"Or a two-way design, the model becomes 


[4] 


Riz = ws, + wrisr + WEJSC, 
Parallel to Equation 2, 
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The assumptions of Equation 4 are ana- 
logous to those of Equation 2, though per- 
haps more easily satisfied in practice. The 
row communication is assumed to have the 
same scale value regardless of which source 
it is combined with, and similarly for the 
column communication. The weight of 
each communication depends on the source 
and will in general be different in each row 
and column. Finally, as in all these ap- 
plications, it is assumed that the combining 
of two stimuli within any cell of the design 
does not change the parameter values. 

For a strict adding model, the analysis is 
identical to that of Application 1. If the 
data satisfy the model, therefore, they pro- 
vide equal-interval scales of the weight 
parameters. This application may thus 
be useful in the analysis of source effects, 
though it should be recognized that it as- 
sumes a strict adding model. 


Application 3: Variation in both w and s 


Application 1 was concerned with how 
two communications are combined, and 
Application 2 with how two sources com- 
bine. Application 3 is concerned with how 
one source and one communication com- 
bine. It may be particularly useful when 
only a single source-communication can be 
given. 

Mathematically, Application 3 is inter- 
esting since it rests on a multiplying model 
instead of an adding model. As a conse- 
quence, the model analysis requires some 
new developments. A two-way factorial 
design is used again, but with the row factor 
representing different sources and the col- 
umn factor representing different communi- 
cations. The model then becomes 


Rij eso + WRiSCA L5] 


where wr; is the weight of the source in row 
i, and sc; is the value of the communication 
in column j. In contrast to Applications 1 
and 2, there is only one communication in 
each cell of this design. The assumptions 
required by Equation 5 are that the weight 
parameters depend only on the row and the 
scale values depend only on the column. 
Equation 5 is a multiplying model, since 
the initial opinion term is just an additive 
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TABLE 2 


HYPOTHETICAL Data FOR SOURCE X COMMUNICA- 
TION DESIGN TO ILLUSTRATE FUNCTIONAL 
MEASUREMENT FOR A MULTIPLYING MODEL 


Communication 
Source 
1 2 3 4 $ M 
1 9 14 29 24 44 24 
2 8 12 24 20 36 20 
3 6 8 14 12 20 12 
+ 5 6 9 8 12 8 
M | 7 10 19 16 28 
Est(s) | 0 1 4 3 7 
| 
constant. Functional measurement for 


multiplying models has been discussed for 
psychophysical judgment (Anderson, 
1970a), and it has been used to scale sub- 
jective- probability and utility in decision- 
making experiments (Anderson & Shan- 
teau, 1970; Shanteau, 1970b). The same 
approach may be used here. 

The hypothetical data of Table 2 will 
illustrate the procedure. Each entry is to 
be considered as the attitude resulting from 
one combination of source and communica- 
tion. The first step is to test whether these 
data follow the multiplying model of Equa- 
tion 5. 

Test of fil. Testing the model would be 
straightforward if the communication scale 
values were known, Plotting the entries in 
any one row as a function of the sc; would, 
if Equation 5 was correct, yield a straight 
line with slope wri- Although the sc; are 
not known, this idea provides the basis for 
solving the problem. 

A simple graphic solution may be ob- 
tained as follows. First, two communica- 
tions are assigned arbitrary values: In 
Table 2, the first two communications have 
been assigned the values 0 and 1 in the last 
row of the table. The observed means for 
these communications, 7 and 10 in the table, 
are plotted as à function of these assigned 
scale values. These two points define a 
straight line, and each remaining communi- 
cation mean is then marked on this line at 
its proper elevation. The downward pro- 
jection of this point on the value axis is à 
provisional estimate of the scale value of 


* 
180 2. 


that communication. Thus, if 19 is marked 
on the line joining 7 and 10, its projection 
on the value axis, namely 4, is a provisional 
estimate of the scale value of Communica- 
tion 3. In the same way, the scale values of 
Communications 4 and 5 are estimated as 
3and 7 in the last row of the table. 

The final and critical step is to plot the 
data of each separate row as a function of 
these provisional scale values. If the model 
is correct, these row curves will form a 
family of diverging straight lines. Devia- 
tions from this pattern would infirm the 
model. 

To supplement this graphical test, an 
exact statistical test is also available by 
appropriate use of analysis of variance. 
Since the model predicts nonparallel curves, 
the Row X Column interaction is nonzero. 
However, if the model is correct, all the 
interaction is concentrated in the Linear 
X Linear component, and the residual in- 
teraction should be nonsignificant (Ander- 
son, 1970a, p. 157). 

Two extensions of this application de- 
Serve mention. The first adds on a second 
two-way design so that each cell contains 
two such source-communication combina- 
tions. The analysis would then be similar 
to the duplex bets studied in Anderson and 
Shanteau (1970). Another extension adds 
on a third factor representing a second 
source variable so that each communication 
is attributed to two sources, 
be useful in the theoretical 
source combinations. 

Functional scales, Verification of the 
model simultaneously validates the pro- 
visional scale values used in testing it, 
Indeed, Equation 3 implies directly. that 
the column means of the Source x Com- 
munication design are a linear function of 
the communication values, 


This would 
analysis of 


5cj Similarly, 
the row means are a linear function of the 
The 
marginal means of the factorial design thus 
Provide equal-interval scales of the stimu- 
lus parameters, 


source-communication weight, wri. 


The multiplying model is 
just like the adding models in this respect, 
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A pplication 4: Variation in both w and s 


The last application is a generalization of 
Application 1 in which a different source- 
communication is allowed in each row and 
in each column. It may thus be applicable 
to situations in which the communication 
interacts with the Source, as in studies of 
Source expertise. 

À two-way factorial design is used again. 
The strict adding model then implies that 


[6] 


in which different weights and values are 
allowed in each row and in each column. 
The only restrictive assumption is that the 
row and column stimuli do not interact 
when combined in any cell. However, 
source and communication in any given row 
or column may interact in any manner. 

Even in this general case, the adding 
model still makes the parallelism prediction. 
Analysis of variance can thus be applied 
just as in Application 1. There is a loss in 
Precision since the functional scales do not 
separate the weights and the scale values. 
Nevertheless, the row means still measure 
something useful, namely, wrisp;, which is 
just the total effect of the given communi- 
cation, 

Applications 1 and 4 bear a close relation. 
In practice, the row weight in Equation 2 
might depend on the Scale value of the 
Communication, Jt could not then be con- 
sidered as a Constant, wr, but would need 
to be taken as a variable, wr; The adding 
model predicts parallelism in either case. In 
an averaging model, however, the analysis 
becomes more difficult as will now be seen. 


Rij = woso + Unisn; + WejScj, 


AVERAGING THEORY 


Phere is increasing evidence for the im- 
portance and ubiquity of averaging pro- 
cesses in information integration, This 
anding has fundamental importance in inte- 
8ration theory, It affects every aspect of 
an Experiment, from general design and 
procedure to theoretical interpretation. 

1€ Critical evidence wil] be briefly noted, 
and then the four applications discussed 


a re wi $ x 
a bove will be Considered from the averaging 
Standpoint, 


Averaging versus Adding 


Two main experimental paradigms have 
been used for critical comparisons between 
averaging and adding. First, adding mildly 
polarized to highly polarized stimulus in- 
formation decreases response polarity (An- 
derson, 1965a; Anderson & Alexander, 
1971; Hendrick, 1968; Lampel & Anderson, 
1968; Manis, Gleason, & Dawes, 1966; 
Levin & Schmidt, 1970; Oden & Anderson, 
1971). This result eliminates a direct 
adding formulation, though it might be 
saved by postulating a contrast effect. 
Thus, an adding model would predict a 
decrement if mildly favorable information 
received a negative evaluation in the con- 
text of highly favorable information. It 
seems doubtful that such contrast effects 
play an effective role with verbal stimuli 
(e.g., Anderson, 1971a, 1971b; Anderson 
& Lampel, 1965; Kaplan, 1971; Tesser, 
1968; Wyer & Dermer, 1968), but the 
possibility cannot yet be ruled out. 

The second result is that adding informa- 
tion of the same value typically makes the 
response more extreme; this is the familiar 
set-size effect. Qualitatively, the set-size 
effect accords with an adding process, 
though no adding formulation has yet given 
a quantitative account, More seriously, 
the set-size effect eliminates a simple aver- 
aging model in which a constant stimulus 
mean would produce a constant response. 

The present averaging formulation ac- 
counts for the set-size effect by the use of 
an initial opinion. The set-size function is 
then a growth curve of somewhat different 
form for serial presentation (Anderson, 
1959a) than for simultaneous presentation 
(Anderson, 1965a). In this form, the aver- 
aging model can give a good quantitative 
account of the set-size effect (Anderson, 
1967). 

This last point bears emphasis in two 
respects since it has sometimes been mis- 
interpreted. First, the averaging formula- 
tion can account for both incremental and 
decremental effects of added information, 
so there is no need to postulate two pro- 
cesses. Second, the averaging formulation 
has given exact, quantitative accounts 
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of both the incremental and decremental 
effects. 

An interesting appearance of the set-size 
effect can be seen in an argument of M. T 
Rosenberg (1968), who makes the plausible 
argument that a given piece of information 
that could produce considerable opinion 
change if the issue was new and unfamiliar 
would have little effect if the issue was old 
and familiar. Rosenberg interpreted this 
argument against averaging as well as add- 
ing. However, it accords with the averag- 
ing set-size function on which the old 
opinion, based on a considerable accumula- 
tion of information, would be near asymp- 
tote. More precisely, the old opinion, being 
based on much more evidence, would have 
a much greater weight parameter than the 
newopinion. In the averaging formulation, 
the weight of the given piece ol information 
must then be much less relative to the old 
opinion, since the weights must sum to one. 


Averaging Assumptions 


Mathematically, the difference between 
adding and averaging models is simple. 
Adding models impose no constraints on 
the weight parameters, whereas averaging 
models require the weights to sum to one. 

Averaging always implies stimulus inter- 
action because of this constraint on the 
weight parameters. In an adding model, 
the effect of any one stimulus may be inde- 
pendent of the other stimuli. But in an 
averaging model, the effect of any one 
stimulus is inherently dependent on the 
whole set of information. In a certain 
sense, therefore, an averaging model has 
gestalt qualities, since the role of each part 
depends on the whole. 

For present purposes, therefore, the as- 
sumption of no interaction will be used to 
mean that each stimulus has a natural or 
absolute weight which, together with its 
scale value, is constant across stimulus 
combinations. The effective weight of each 
stimulus is then its absolute weight divided 
by the absolute weights of all the stimuli 
in the given combination. In this way, 
each application considered above is di- 
rectly convertible to an averaging model 


by dividing by the sum of the weights, and 
this procedure will be followed here. 
These effective or normalized weights sum 
to one for each stimulus combination. 
Mathematically, it is often convenient to 
use unnormalized weights at intermediate 
stages in the calculations or derivations. 


Application 1 


For this application, the weight wp was 
assumed constant across rows, and the 
weight we was assumed constant across 
columns. Equation 2 then implies that the 
sum of the weights, wy + Vn + we, is 
constant, the same in each cel] of the design, 
Dividing each predicted attitude by this 
constant is equivalent to changing the unit 
or range of the response scale. Since the 
scale unit is arbitrary, adding and averaging 
are not distinguishable under the giv 
sumptions. Both Predict Parallelism, and 


appears as soon as the Weights y. 
TOWS or columns. Not much is 
the adding model; it stil] predic 
ism as discussed under Applicati 
But since the sum of the w 


‘ary across 
lost under 
ts parallel- 
on 4 above, 
eights will now 
r the design, the 
averaging model predicts Systematic devi- 


Because of this complication in the aver- 
aging analysis, experimental precautions to 
help ensure equal weighting may be worth- 
while. Of course, this may not be possible 
when the stimuli vary naturally in weight. 
In the personality-impression task, for in- 
stance, the response to a combination of 
moderate and extreme negative adjectives 
is displaced toward the extreme (Anderson, 
1965a, 1968b; Anderson & Alexander,1971). 
This directional result is consistent with 
the hypothesis that the extreme adjectives 
have greater natural weight and is one piece 
of evidence that favors the averaging 
hypothesis, 


Application 2 


The averaging model is quite different 
from the strict adding model in this design. 
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Written as an averaging model, Equation 4 
becomes 


Ri; = (woso + WRiSR + wcjsc)/ 
(wo + Wri + wcj). [7] 


Since the denominator varies from cell to 
cell of the design, the parallelism prediction 
will not hold, The deviations from parallel- 
ism will still show a systematic pattern, 
however, depending on the pattern of 
weight Parameters. The expected form of 
the nonparallelism can be studied by using 

quation 7 to determine how the difference 
between any two rows varies with column 
weight. The mathematical analysis is 
fairly straightforward but too detailed to 
include here, 


To illustrate the matter, consider a square design 
for which the weights in row Zand column tare equal. 
Suppose that the weights are arranged in increasing 
order of magnitude, from left to right across columns, 
and from top to bottom down rows. Finally, assume 
that wo, the weight of the initial opinion, is zero. 
(The design provides a partial check on this last as- 
sumption, since it implies that the entries in every 
cell along the main diagonal are constant, equal to 
the simple mean value of the row and column com- 
munications. If wy is not zero, then averaging theory 
predicts a trend along the main diagonal from so to- 
ward the mean of 5n and sc.) 

„In this desig: 1, the global Shape of the set of pre- 
dicted Curves is similar to a Correlation ellipse, trun- 
cated at either end, Above the diagonal, the row 
curves Converge to the right ; below the diagonal, the 
row curves converge to the left. The overall trend 
1S upward if the column communication has greater 
value, the row communication has 
4 The Magnitude of the trend depends 
on the difference between the values of the row and 
column Communications, If these are equal, then 
the Predicted response is the same in all cells. 
um Rings vobtcimental realization of this illustra- 
-ve design would be available with the personality- 


sets of personality- 
a single person. Weight 
ated by specifying the occupa- 
t ' sources (Rosenbaum & Levin, 1968, 
» or by otherwise varying the 
rmation, 


Applicat ion 3 


7 " 
f Under the averaging hypothesis, Equa- 
tion 5 becomes 


Ris = (wy, + Wriscs)/ (wo + wr;). [8] 
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If wo is zero, then Rj; equals the scale value 
of the communication regardless of the 
source weight. This might seem odd, but 
with no other basis for opinion, the com- 
munication must be taken at face value. 
Since this case is mathematically trivial, it 
will be assumed that wo is not zero. 

The analysis under the averaging hy- 
pothesis parallels that of Application 3 
above except in one respect. If Equation 
8 is averaged over rows, the marginal col- 
umn means are a linear function of the com- 
munication values. In practice, this means 
that the observed column means may be 
used as provisional estimates of the scale 
values in the bilinear test of fit. To the 
degree that the model is validated, the col- 
umn means may be considered as interval 
scales of the communication values. 

The one change required by the averag- 
ing constraint comes in the estimation of 
the weight parameters. In the simple 
multiplying model, the slopes of the bilin- 
earized curves are proportional to the 
weights, wr; with the averaging constraint, 
they are proportional to Wri/ (Wri + wo). 
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Application 4 


Only one case of Application 4, one that 
has considerable importance, will be con- 
sidered in detail. The model for this case 
may be written, 


Riz = (woso + wrr + tc;sc;)/ 
(wo + wr + Wc). [9] 


This is the averaging form of Equation 6 
under the added restriction that the row 
weight is constant. 

Figure 2 illustrates the difference be- 
tween the adding and averaging models. 
These plots are for a 3X 4 design for 
which the communication scale values are 
those listed in Table 1B. The left panel is 
calculated on the assumption that the row 
and column weights both equal one. These 
three curves are parallel. The right panel 
is calculated on the assumption that the 
row weight is one, but that the column 
weight increases with scale value as speci- 
fied in the figure legend. These curves are 
not parallel. 

Despite this nonparallelism, the right 
panel of Figure 2 shows two regularities. 
The first is qualitative: the curves converge 
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Fic. 2. Illustration of parallelism prediction with equal weighting in left 
graph, nonparallelism with unequal weighting in right graph. (Each point is 
the predicted opinion resulting from two communications whose scale values 
are given in Table 1B. Inthe right graph, the weight of the row communication 


is 1, while the weight of the column communica 


according to the formula, 1 + .2s¢;.) 


tion increases with scale value 


ut 


toward the right, that is, as the scale value 
inc This illustrates a general con- 
ver, rediction of the differential 
weighting model: Whenever the weights of 
the informational stimuli are directly re- 
lated to their magnitudes, this same con- 
vergence pattern will be obtained. If the 
design were extended to include stimuli 
with negative values, this pattern would 
reappear as a convergence from the middle 
toward the left. Bilateral asymmetry 
would reflect differential weighting of posi- 
tive and negative stimuli. 

The second regularity in the right panel 
of Figure 2 is quantitative; the difference 
between any two row curves is a constant 
multiple of the difference between any 
other two row curves, T his can be seen 
more readily by reference to Equation 9, 
The difference between the first two rows is 


Ru — Ry = wr(sr — 521)/ 
(wo + wr + we). [10] 


If the difference between the first and third 
rows is expressed similarly, the ratio of 
these two differences is 


(Ry — Ri;)/ (Ray — Ri) 
= (spo — 511)/ (Sra — Sri), [11a] 


Which is constant, independent of columns, 
Equation 11a can thus be rewritten, 


(Ry — Ri) =a (Rs; — Ry), 
je ide. 


[11b] 


where a is a constant. This result holds in 
general so that any two sets of row differ- 
ences should plot as straight-line functions 
f each other. This property could be used 
is a rough graphical test of fit. 

Another graphical test, somewhat sim- 
ler, can be obtained by plotting the en. 
‘ies in each column as a function of the 
w means. The model implies that each 
umn should plot as a straight line except 
r sampling error. 


An interesting. extension of bilinearity analysis 
nderson, 19703, p- 157) provides an exact statisti- 
| test. The computing procedure given there may 
followed exactly with these modifications. F irst 
reduced data matrix is obtained by subtracting 
' column mean from all entries in each column. 
is has the effect of reducing Equation 9 to the 
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form of a simple multiplying model. Al further 
calculations are done in terms of this reduced matrix. 
The comparison coefficients for rows are just the row 
means, since the grand mean is zero. To get the 
comparison coefficients for columns requires a detour, 
since all column means are zero in the reduced ma- 
trix. For this Purpose only, the sign of all cell en- 
tries is ignored; the column comparison coefficients 
are then obtained as in the cited reference as column 
deviations from the grand mean in this temporary 
matrix of absolute values. These comparison co- 
efficients allow calculation of the Linear X Linear 
component of the interaction. The residual inter- 
action should then be nonsignificant. Its df equals 
(r — 2)(c — 1) — 1, where r and c are the number of 
Tows and columns; the initial reduction of the data 
matrix loses one degree of freedom per column 
through estimating an additive constant in each 
column. ^w. 

The error term requires a brief comment. , The 
within-cells error would be used for both the Linear 
X Linear component and the Residual interaction 
for the case of independent groups in each cell and, 
typically, for single subject analysis Re- 
peated-measurements designs, with eac subject in 
all conditions, are more complicated. If the analy- 
sis is to be made over subjects, the above procedure 
needs to be applied to each subject. The overall 
Row X Column interaction is split into Linear 
X Linear and Residual components using the means 
of the individual comparison coefficients, In paral- 
lel, the Subject X Row X Column interaction is 
split into Subjects X Linear X Linear, and : bjects 
X Residual. Each component of the overall inter- 
action is then tested against its interaction with 
subjects, E 

Two functional scales are also available, 
If Equation 9 1$ averaged over columns, it 
becomes a linear function of the row scale 
value, SRi As in Application 1, the row 
marginal means Provide an equal-interval 
scale of the values of the TOW communica- 
tions, 


Furthermore, Equation 10 indicates that 
the row difference is àn inverse measure of 
the column weight, we; Thus, the recipro- 
cals of the row differences can provide an 


interval scale of the column weight param- 
eter as shown in Equation 12: 


Ve; — — (wy + Un) + wr (Sri — Sre)/ 
(Ru — Ra). [12] 


The optimal Statistical procedure for esti- 


mating these Weights is not known, how- 
ever, 


Differential Weighted A veraging 


In the application just considered, the 
Weight Parameter was allowed to vary arbi- 
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trarily as a function of the scale value. 


4 There is increasing evidence that such dif- 


ferential weighting is more the rule than the 
exception (Lampel & Anderson, 1968; 
Oden & Anderson, 1971), and differential 
weighting seems especially likely in attitude 
change experiments. It is fortunate, there- 
fore, that some direct analyses and estima- 
tion methods are possible. 

The importance of this result can be il- 
lustrated by the date ratings of Lampel 
and Anderson (1968) which illustrate that 
a cognitive interaction of some complexity 
can be handled simply. Boys described by 
a photo and two personality-trait adjec- 
tives were rated as coffee dates by college 
girls. The data provided a critical test of 
averaging-adding in favor of averaging. 
In addition, the Adjective X Photo inter- 
action was very large, with the adjectives 
having a greater effect with a positive than 
with a negative photo. This was inter- 
preted as a discounting effect: girls do not 
want to be seen dating unattractive boys 
no matter how nice they are; as their physi- 
cal appearance improves, their personality 
characteristics become more important. 
In theoretical terms, then, the weight of 
the photo is inversely related to its value. 

In this experiment, the adjectives corre- 
spond to the rows, the photos to the col- 
umns of Application 4. Figure 1 of the 
cited article plots two difference curves 
analogous to Equation 10. The upper 
curve is almost exactly a constant multiple 
of the lower curve, in agreement with Equa- 
tions 11a and 11b. 


Conjunctive and disjunctive models (e.g., Coombs, 
1964; ‘Torgerson, 1958) can also be represented as 
differential weighted averaging models. The stand- 
ard conjunctive model sets à criterion on each of 
several dimensions. Only if the criterion is exceeded 
on all dimensions simultaneously is a positive deci- 
sion made. The present approach would replace the 
criterion by a weighting function that had high 
weights below some threshold region, diminishing to 
lower weight higher levels of the scale value. 
Because the averaging model requires the weights 
to sum to one, any stimulus variable that fell below 
the criterion could dominate the response and pre- 
vent à positive decision. The averaging model thus 
provides a more general and flexible analysis. In 
particular, it replaces the somewhat artificial point 
criterion by a band or region. 
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It should also be noted that scaling is an integral 

part of psychological theory in the present approach. 
The standard treatments of conjunctive and dis- 
junctive models, in contrast assume that stimulus 
scale values are already at hand. But if weight and 
scale value are inversely related, as in the date- 
rating experiment, this critically affects the scaling 
analysis. 
It has been suggested that a strict adding model 
should have been applied to the main data of the 
date-rating experiment. The observed interactions 
then would be interpreted as an artifact of a merely 
ordinal response scale, to be eliminated by appropri- 
ate monotone t nsformation (Anderson, 1962b; 
Bogartz & Wackwitz, 1970; Kruskal, 1965). This 
same argument holds more generally for the aver- 
aging analysis of Application 4. A convergence 
interaction could reflect à nonlinear output function 
(Bogartz & Wackwitz, 1970, 1971) ather than à real 
averaging effect. Because of their simplicity, strict 
adding models have been favorites in abstract mea- 
surement theory, and they have many mathemati- 
cally convenient properties. However, the empirical 
evidence summarized under Averaging versus Add- 
ing above raises considerable doubt about the vali- 
dity of strict adding models. 


RELATIONS TO OTHER THEORIES 


This section compares integration theory 
with several other theories of attitude 
change, mainly those that have attempted 
some degree of quantitative analysis. The 
main purpose of these comparisons is to 
show how integration theory handles vari- 
ous traditional problems in attitude theory. 


Balance Theory 


As an organizing concept, à principle of 
information integration has certain ad- 
vantages over the principles of balance and 
congruity (Anderson, 1968a). Critical 
comparisons by Lindner (1970, 1971) have 
supported integration theory. This section 
will illustrate the theoretical comparison in 
more detail. Since the most extensive at- 
tempt to relate balance theory to attitude 
change is that of Feather (1964, 1967), only 
his development will be considered here. 
A recent review of balance theory is given 
by Zajonc (1968). Mention should also be 
made of the work of Wiest (1965), who has 
attempted to quantify the strengths of the 
relations, and of Newcomb (1968), who 
has attempted to give a more realistic def- 
inition of balanced states. 
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Fic. 3, Communication graphs, with S, C Land R denoting Source, Com- 


munication, Issue, and Receiver. (Solid lines denote 
attitude; broken lines denote a negative attitude.) 


Feather bases his analysis on the graph 
diagrams studied by Cartwright and Har- 
ary (1956) and illustrated in Figure3. The 
Solid arrow connecting C to J means that 


larly, the Solid arrow connecting S to | 
means that the receiver (R) considers the 
source (S) to be favorable toward the issue 
prior to receiving the communication, The 
solid line connecting S to C 
that the Communication comes from the 
Source. The initia] attitudes of the receiver 
toward the issue and the source are denoted 
by so and Ip; respectively, These features 
are the same in all four diagrams, 

Attitude toward issue, For each diagram 
in Figure 3, the receiver gets two pieces of 
information about the issue: the initial 
stand of the source and the favorable com- 
munication from the Source. For simplic- 
ity, it will be assumed that the prior stand 
of the source has already been integrated 
and become part of so The integration 
equation for the attitude after receipt of 
the favorable communication is then 


S = Wosy + WiSc, 


[13] 
a weighted sum or average of the initia] 
attitude, sy, and the scale value of the com- 
munication, sc. The effective weight, wi, 


of the communication will depend on prop- 
erties of the Source and communication as 
iscussed elsewhere, 
The four cases in Figure 3 differ accord- 
ing as initial attitudes of the receiver to- 
Ward the source and issue are positive 


à positive association or 


(solid arrow) or negative (broken arrow). 
These will now be discussed in turn. . 

Case 1 : Here the receiver's initial attitude 
is positive toward the source, negative m 
ward the issue, Figure 3 then represents 
an unbalanced Structure, and balance 
theory postulates a tendency to resto 
balance. In integration theory this tend- 
ency is not a postulate but a deduction. In 
Equation 13, So is negative, because the 
initial attitude toward the issue is negative, 
but both w, and sc are Positive, Accord- 
ingly, Equation 13 implies directly that 
the Postcommunication attitude will be 
less negative, Perhaps even positive. 

his is no more than common sense, of 

course, but integration theory differs from 
balance theory in two important ways. 
First, the final state need not be balanced; 
the Postcommunication attitude may be 
Predictably negative, Second, integration 
theory allows a quantitative analysis of the 
Magnitude of attitude change and its de- 
pendence on various stir 


It thus takes into account the strengths of 
t ations which balance theory is unable 
to handle satisfactorily. : 

lere the receiver's initial atti- 
"ard source and issue are both 


: * is no imbalance in the 
làgram, 


Which to Predict a change in attitude. 
Integration theory does Predict a change, 
Under the averag- 
el, Moreover, the changed atti- 
i i more positive or less positive, 
according as sc, the Communication value, 
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is more or less positive than so, the initial 
X attitude. 

^" Case 3: Here the receiver is initially 
negative toward source and issue both as 
indicated by the broken arrows in Figure 3. 
Since the SRI triad is balanced, balance 
theory again lacks a basis on which to pre- 
dict a change in attitude. 

Under integration theory, the theoretical 
analysis depends on whether w, is positive 
or negative. "That, may be positive even 
with a negative source is indicated by the 
positive attitude change found with low 
credible sources (see Hovland et al., 1953; 
McGuire, 1969). If v; is positive, then the 
receiver's attitude toward the issue will be- 
come less negative. Indeed, it may even 
become positive, thus converting a bal- 
anced state to an imbalanced state. 

Case 4: Here the receiver has a negative 
attitude toward the source but is positive 
toward the issue. This unbalanced state 
could become balanced if either the receiver 
adopted a positive attitude toward the 
source, or else a negative attitude toward 
the issue. Balance theory does not predict 
which will occur. 

For integration theory, the theoretical 
analysis again depends on whether iis 
positive or negative. If w, is negative, the 
receiver's attitude toward the issue will be- 
come less positive, and hence more bal- 
anced. However, if wı is positive, then the 
receiver's attitude toward the issue becomes 
even more positive, increasing the imbal- 
ance. As in Case 3, this allows a critical 
qualitative test between balance theory 
and integration theory. g 


Attitude toward source. Integration theory ana- 
lyzes attitude toward the source in much the same 
way as just illustrated for attitude toward the issue. 
In the diagrams of Figure 3, there are three relevant 
informational stimuli, including the receiver's prior 
attitude toward the source. Thus, the communica- 
tion attributed to the source constitutes a piece of 
information about the source; as such it has a cer- 
tain weight and scale value. The same holds for the 
initial stand attributed to the source. Both these 
pieces of information are then to be integrated into 
the receiver's initial opinion about the source, and 
the integration process is assumed to be essentially 
the same as for the issue. 

There are, however, important differences in de- 
tail, especially in the valuation process, that reflect 
the asymmetry of the source-issue relation. That 
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the source favors the issue does not mean that the 
issue favors the source, for instance. In addition, 
the valuation process will depend on the dimension 
of judgment. In particular, the parameters of the 
informational stimuli will be different if the receiver 
judges the position of the source on the issue than 
if he makes an evaluative judgment of the source. 

Evaluation of the communication. Judgments of 
the communication, such as its position on the issue, 
or its fairness, will of course depend in detail on the 
content and structure of the communication. They 
will also, as is well known, depend on the attitude of 
the receiver and his valuation of the source, and this 
dependence will be discussed briefly. 

If the receiver is asked to judge the position of the 
communication on the issue, he has two main pieces 
of information to rely on. One is his opinion, ex- 
plicit or inferred, about the source’s stand on the 
issue independent of the communication. ‘The other 
is the content of the communication per se. Under 
the simplest theoretical analysis, the judgment of 
the communication will then be a weighted average 
of these two pieces of information. Loosely speak- 
ing, the judgment might be considered a compromise 
between what the source actually said and what the 
source was expected to say; certainly this is a rea- 
sonable strategy from the subject's view. 

From this standpoint, the theoretical analysis of 
the communication parallels that just given for 
judgments of the issue. For instance, if the initial 
attitude toward the source is negative, the communi- 
cation will ordinarily be seen as less positive when 
attributed to the source than when considered alone. 
Here as before, a tendency toward balance arises as 
a deduction from integration theory. 

This simple averaging analysis does not allow for 
the seeming fact that the judged position of a com- 
munication may also depend on the receiver's atti- 
tude toward the issue. This is a complicated prob- 
lem (see Kiesler, Collins, & Miller, 1969, pp. 264 ff.) 
that will be briefly considered under Assimilation- 
Contrast Theory, below. 

Judgments of the fairness or plausibility of the 
communication have also been considered (see, e.g., 
Brigham & Cook, 1970; McGuire, 1969). The gen- 
eral finding is that communications more discrepant 
from one's own position tend to be judged as less 
fair. This commonsense result merits a common- 
sense explanation: if the receiver is at all attached to 
his own position and considers it correct, he will 
naturally judge a discrepant communication as in- 
correct, hence misleading, and hence unfair. At 
the same time, many experimental studies of opinion 
change do not involve firm opinions, and in such 
cases fairness would not necessarily be affected. 
Thus, it seems more promising to look for a social 
learning base for such evaluat ions rather than a 
principle of balance. 


Summary comments. The main conclu- 
sion from the preceding analysis is that a 
tendency toward balance may be derived 
from a principle of information integration. 
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Further, the integration principle is more 
general since it also accounts for opinion 
change in balanced situations. Only cer- 
tain illustrative situations have been con- 
sidered, of course, but much the same analy- 
Sis would seem to apply to the numerous 
other graph structures considered by 
Feather (1967). 

Integration theory has additional ad- 
vantages. Feather (1964, 1967) has noted 
that major unresolved problems in balance 
theory include allowing for the strengths of 
relations, for the importance of the issue, 
às well as for individual differences, These 
problems are handled in a natural way, in 
terms of the w and s parameters of integra- 
tion theory. Moreover, functional mea- 
surement technique is specifically oriented 
toward measurement at the level of the 
individual. 

Heider (1958) has stated that “Formula- 
tion in terms of equifinality [balanced end 
state] is more Parsimonious than formula- 
tion in terms of single conditions and 
effects [p. 207]." Integration theory takes 
the opposite road to parsimony, Stressing 
the primary importance of detailed analy- 
sis, both theoretical and experimental, of 
stimulus conditions and their effects. 


Congruity Theory 


Congruity theory (Osgood & Tannen- 
baum, 1955; Tannenbaum, 1967, 1968) 
postulates an averaging model of the form 
of Equation 1 (S. Rosenberg, 1968). It is, 
however, a very specialized averaging 
model, since it requires the weight param- 
eters to be specific functions of the scale 
values or polarities of the stimuli. For two 
stimuli of values sı and Sz, congruity theory 
postulates that w, = Isil/(si] + Isal), 
w = |se /(lsi] + |s3[). This formulation 
makes the weight increase with the scale 
value, which would be reasonable in many 


situations. It also requires a neutral stimy- 
lus to have zero weight, which is clearly 
Wrong, 


In the present formulation, the weight 
parameter can depend on factors other than 
stimulus value, For instance, one can ex- 


perimentally Manipulate the credibility or 
reliability “of 


the information without 
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thereby affecting its scale value. This indi- 
cates that Congruity theory has an inade- 
quate conceptual base. Moreover, con- 
gruity theory has not been able to handle 
the set-size effect, in which the addition 
of information of equal value increases the 
extremity of the response (e.g., Anderson, 
1959a, 1965a, 1967). An extensive discus- 
sion of this and related problems is given 
by Smith (1970). 

Tannenbaum (1967) has recently dis- 
cussed two experiments that he interprets 
to support congruity theory and to infirm 
an information-processing approach. How- 
€ver, both experiments can be treated as 
2 X 2 designs for which integration theory 
predicts zero interaction, in agreement with 
the data. An earlier experiment by Tann- 
enbaum and Norris (1965) also showed a 
nonsignificant interaction, though there the 
theoretical analysis is less clear. 

In one experiment (Tannenbaum, 1967, 
Table VI), each condition contained one 


source. The four 
conditions differed in that they’ contained 
two positive messages, two negative mes- 
Sages, or one of each polarity. The inter- 
action term would apparently not haye ap- 
proached significance, in agreement with 
integration theory. The data imply a nega- 
tive weight for the negative source, though 
this may reflect the covariance adjustment. 
In the other experiment, the four experi- 
mental conditions received either the same 
or different messages, from either the same 
or different Sources. In an informational 
analysis, redundant information requires a 
redundancy Parameter, in this case for 
Same-versus-different message as well as for 
Same-versus-different Source. These re- 
dundancy Parameters neeg not be evalu- 
ated, but can be incorporated directly in the 
formal model. The adding model then pre- 
icts zero interaction, and the averaging 
model makes approximately the same pre- 
Viction, Tannenbaum's data (1967, Table 
/) show a very small interaction term that 
Would evidently not have been significant. 
„Neither experiment, therefore, poses any 


difficulty for information integration theory: 


Indeed, integration theory goes beyon 


. 
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congruity theory to make quantitative pre- 


F~ dictions in both cited experiments. 


*, Integration theory implies that congru- 
ity, like balance, will generally increase. 
A congruity principle, or at least a congru- 
ity tendency, is thus a deduction from a 
more general principle of information inte- 
gration. The detailed supporting analysis 
for this claim has already been given in the 
discussion of balance theory. Here it may 
be noted that integration theory avoids a 
fundamental criticism raised by Abelson 
(1963). lf a positive source makes a posi- 
tive statement about some issue, the evalu- 
ation of source and issue will approach each 
other. 'The cause, according to congruity 
theory, is a pressure toward congruity 
which exists as long as the source and issue 
evaluations are unequal. This makes it 
awkward to explain why complete congruity 
is seldom if ever obtained. For integration 
theory, there is no difficulty. The change 
is effected by the information in the com- 
munication; once this is absorbed, no 
further change is implied and there is no 
reason to expect complete congruity be- 
tween source and issue. 


Assimilation-Contrast Theory 


The present theory of opinions and atti- 
tudes is in large part a theory of social 
judgment. Although this section is meant 
to compare integration theory with other 
quantitative formulations, comparison with 
assimilation-contrast theory (Sherif & Hov- 
land, 1961; Sherif, Sherif, & Nebergall, 
1965) is appropriate since it also empha- 
Sizes judgmental processes. Summaries 
and evaluations of their theory have been 
given by Insko (1967) and Kiesler et al. 
(1969). Here the two approaches will be 
compared in the context of an interesting 
recent experiment by Rhine and Severance 
(1970). 

Rhine and Severance studied three vari- 
ables: source credibility, ego involvement, 
and discrepancy between the person's ini- 
tial attitude and the position advocated by 
the communication. For a single com- 
munication with weight wi and value s;, 
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the postcommunication attitude can be 
written as 


[14] 


where so is the initial opinion, with weight 
wo. In an averaging model, the weights are 
required to sum to one. This effect can be 
achieved by letting wi, and wo = 1 — wi be 
the relative weights. Then Equation 14 
can be rewritten as 


R = so + wilsi — so). 


R = weso + wisis 


[15] 


From Equation 15, the change in attitude is 
seen to be just wı(sı — so). Accordingly 
the theoretical analysis devolves on the 
determinants of w; and of (si — so). 

Source credibility will affect w:. A more 
credible source would correspond directly 
to a larger value of w; and would produce 
more attitude change. 

Ego involvement was defined by choice 
of issue, either the proposed tuition increase 
in the University of California, Riverside, 
where the experiment was run, or the de- 
sired size of the city park in Allentown, 
Pennsylvania. For this particular manip- 
ulation, ego involvement corresponds to 
Wo, the strength of the initial opinion. 
Thus, wı would be lower for the higher 
level of ego involvement and less change 
would occur. 

The discrepancy variable, (si — so), was 
manipulated by using three different dollar 
or acre values in the communications given 
to different groups. With no further quali- 
fication, Equation 15 predicts that amount 
of opinion change will be a linear function of 
the discrepancy, (s; — so). This was found 
to hold for the parks issue though not on 
the tuition issue. 

Integration theory includes a process of 
inconsistency discounting (Anderson & 
Jacobson, 1965) in which inconsistent in- 
formation is given decreased weight. For 
example, a communication attributed to a 
highly credible source would tend to be dis- 
counted if it was inconsistent with the per- 
son's previous opinion about the source's 
position. In the same way, a communica- 
tion will tend to be discounted when it is 
inconsistent with fact or with the person's 
own previous opinion on the given issue. 
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ordingly, the predicted effect of the 
Iscrepancy variable needs to be considered 
in the light of the experimental task, For 
the parks issue, the su bjects were told that 
the average park size was 20 acres for a 
city with a little less than half the 106,400 
Population of Allentown. The highest ad- 
vocated figure was 240 acres which would 


relation between advocated change and ob- 
tained change is in accord with expectation 
(Anderson, 1959a; Anderson & Hovland, 
1957), Theoretically, 
consequence of the averaging hypothesis, 
The highest advocated increase in tuition 


ministration. For the tuition issue, w, 
would decrease as (s; — So) increased. The 
predicted change, product of 
=e two factors, would accordingly in- 
Crease at first and later decrease as sı be- 
came more discrepant from So. 

The inverted-y relation obtained for the 
tuition issue by Rhine and Severance, as 
well as similar results obtained by others 
(e.g., Aronson, Turner, & Carlsmith, 1963; 
Bochner & Insko, 1966), is thus consistent 
with integration theory, Jt should be 
noted, however, that a firm prediction of 
the later decrease depends on having suffi- 
ciently large values of the discrepancy vari- 
able. Rhine (personal communication, 
1970) has Suggested that this was the case 
on the ground that the discrepancies ex. 
tended well into the latitude of rejection. 

In their Table 5, Rhine and Severance 
list five predictions about attitude change 
and conclude that assimilation-contragt 


theory accounts for three of these while 
dissonance theory accounts for two. Inte. 
gration theory, on the analysis just given, 
agrees with the data in all five Cases. This 


analysis is post hoc, of course, but it is not 
ad hoc; no special assumptions were made 
in applying the integration model. Rhine 
and Severance also provide data on source 
and message evaluation that may be ana- 
lyzed in the Manner indicated in the section 
on balance theory, 
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Inconsistency discounting is central i 
the present analysis of the discrepancy vari 
able. Almost ail other theories of attitud 
change postulate some process that has th 
same general effect (see Insko, 1967, Ch 
3). None of these is too satisfactory, sinc 
their application depends more or less or 
common sense, Integration theory has onc 
advantage, Since it can make definite 
quantitative predictions about the effects of 
combining two communications. 


The heayy dependence of the Sherif-Hovland ap- 
proach on concepts of assimilation and contrast is of 
theoretical concern for several reasons (see Ir ko, 
1967 ; Kiesler et al., 1969; Upshaw, 1969). Assimila- 
tion-contrast is considered again later, and only two 
Points will be made here. Both bear ona problem. 
mentioned in the discussion of balance theory; 
namely, the effect of 4 Derson's own opinion on his. 
judgment of the Position of a communication. 

Sherif and Hovland claim that people with strong 
Opinions tend to see communications that portray 
markedly different opinions as even more discrepant 
"really" are, a contrast effect. Such 
are beset by interpretational difficulties. 
Defining the “real” position of the communication in 
terms of responses of other people is a dubious pro- 
cedure. ithi functional measurement theory, 
the scale value of the communication js unique to 
each person; that one person's evaluation differs 
from that of another does not mean that either has 
distorted the Position of the communication. This 
view Upshaw's (1965, 1969) 
explanation in terms of a variable zero point in the 
j In general, comparisons between 
different Populations can be most difficult (Ander- 
son, 1963), and this is no less true when the popula- 
tions are single people, 

It is also Possible that Such displacement effects 
i For ex- 
ample, the overt judgment of the communication 
may be a weighted average of its position on the 
Issue, and of the felt Position, or perhaps of the social 
evaluation of the person to whom the statement is 
t For many political and social issues, 
Such a Judgmenta] process would appear to produce 
, Contrast" for Dositions far from that held by the 
Judge, and "assimilation" for positions near that 
k ej On this basis, apparent contrast 
In evaluation of discrepant communications woul 

€ expected from Dawes’ (1971) finding that persons 

with OPposite points of view are thought to be more 
2e i A related model for 
an assimilation effect has been applied to component 
Judgments in impression formation with some suc- 
1, 19663 ; Kaplan, 1971), 


Sim larity-A ttraction Model 


An extended series of experiments by 
yrne and his Associates (see Byrne, 1969) 


— 
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has supported, under certain experimental 
conditions, the hypothesis that interpersonal 
attraction is a linear function of proportion 
of similar attitudes. In the typical experi- 
ment, the subject receives a form purport- 
ing to describe the attitudes of a stranger on 
political affiliation, drinking, and similar 
issues. These forms are constructed to have 
specified numbers of attitudes that are 
similar and dissimilar to the attitudes of the 
subject. The subject's response is a rating 
of how much he would like the stranger. 

This interpersonal attraction task is 
quite similar to the personality-impression 
task studied by the writer (Anderson, 1967 ; 
Byrne, Lamberth, Palmer, & London, 
1969). It is appropriate, therefore, to ask 
how integration theory might apply to the 
interpersonal-attraction task. Of the num- 
erous interesting results, only one or two 
can be considered here. 

From the view of information integration 
theory, each item on the form constitutes a 
piece of information about the stranger, 
much the same as a trait adjective. As 
such, it has a weight and a scale value along 
the dimension of judgment. Let wp and sp 
be the weight and value of an item marked 
similar, and let wna and s, be the weight and 
value of an item marked dissimilar. Sup- 
pose there are N items altogether of which 
k are marked similar. Under the adding 
model, the theoretical expression for the 
attraction response is then 


R = hys + (N — kywsss. [16] 


, If the number of items marked similar is 
increased by one, k would be replaced by 
k +1 in this equation. The change in 
response is then the difference, 


AR = WpSp — WaSn- [17] 
Since AR does not depend on £, it follows 
that R is a linear function of k, and hence 
also of &/ N, the proportion of similar items. 
Under the given assumptions, therefore, 
Byrne's linearity result is consistent with 
prediction from integration theory. More 
direct empirical support has been obtained 
by Kaplan and Olczak (1970) who em- 
ployed a factorial integration design as in 
Application 1. 


The adding model of Equation 16 rests 
on the assumption that the similar items 
have equal value and weight, and so also 
for the dissimilar items. If these assump- 
tions do not hold, the linearity prediction 
does not follow. However, it would still 
be possible to use Application 4 of the adding 
model to get a parallelism prediction. 

If the averaging model holds, the right 
side of Equation 16 must be divided by the 
sum of the weights, kw, + (N — kyw,. 
The linearity prediction then holds only if 
wy = Wy. Deviations from linearity would 
then be expected to the degree that similar 
and dissimilar items are differentially 
weighted. This possibility deserves further 
consideration since the personality-trait 
studies argue for an averaging model. 

Two important differences between inte- 
gration theory and Byrne's formulation 
can be illustrated in the Byrne and Rhamey 
(1965) experiment. In addition to the in- 
formation about the stranger's own atti- 
tude, the subjects also received information 
about the stranger's evaluation of the sub- 
ject's own intelligence, adjustment, etc. 
To handle two sources of information, the 
attraction equation was modified so that 


attraction toward X is a positive linear function 
of the sum of weighted positive reinforcements 
(Number X Magnitude) received from X divided 
by the total number of weighted positive and 
negative reinforcements received from X [Byrne & 
Rhamey, 1965, p. 887]. 


This differs from the integration model in 
several respects. For instance, the scale 
values, which correspond to their "'rein- 
forcements,” do not appear in the denomin- 
ator of the integration equation. 

The Byrne and Rhamey experiment has 
special interest, since it employed a two- 
way design in which the interaction was 
significant, contrary to the simple integra- 
tion model. Inspection of their data 
suggests that the interaction results from a 
discounting of the attitude-similarity infor- 
mation in the context of the favorable eval- 
uation of the subject. The value of their 
control condition in supporting this inter- 
pretation deserves particular mention as a 
useful methodological device. 
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It should be noted that Byrne and Rha- 
mey also predict no interaction (see Table 
5, Byrne & Rhamey, 1965). That they got 
a close fit to the linear function despite the 
interaction illustrates the limitations of 
regression analysis discussed in the next 
section. 

The second difference between integra- 
tion theory and Byrne's formulation is more 
fundamental. Ag illustrated in the above 
quotation, Byrne considers the attitude- 
similarity items to be reinforcing stimuli. 
Integration theory considers them to be 
informational stimuli, a view that is sup- 
ported by the work of Himmelfarb and 


present view, giving praise or blame does 
not necessarily act asa reinforcing stimulus 
for the attraction response to the source; 
it may simply provide an informational 
stimulus which is integrated into the opin- 
ion about the Source. [n applications 
to learning (Anderson, 1969p Anderson & 
Hubert, 1963; Friedman et al., 1968), inte- 
gration theory thus appears as a cognitive, 
information-processing theory somewhat in 
the sense of Tolman rather than an S-R 
reinforcement theory in the sense of Hull 
or Skinner, 


Summation T theories 


Certain kinds of integration tasks evi- 
dently require strict adding rather than 
averaging models. For example, the value 
ofa commodity bundle might be taken as 
the sum of the values of its Components 
(e.g., Bock & Jones, 1968 ; Gulliksen, 1956), 
possibly with a law of diminishing returns 
(Shanteau, 1970b). The addition of one 
Commodity to the bundle would thus in- 
Crease the total value. For attitude change 
also, an adding process is intuitively at. 
tractive, on the analogous argument that 
the addition of favorable information 


should increase the favorableness of the 
response, 

The Most vocal attempt to support an 
adding model for attitude change has been 
by Fishbein and 


made his associa tes. 


Triandis and Fishbein (1963) used a pi 
sonality-impression task to compare 
Summation model with congruity theor 
Mean correlations between observed ai 
predicted were -65 for the summation mode 
33 for Congruity theory. Two later e 
periments (L. R. Anderson & Fishbei 
1965; Fishbein & Hunter, 1964), also o 
personality impression formation, showe 
similar support for a regression model ovc 
the congruity model. 

For some reason, this disconfirmatio: 
of congruity theory was generalized to in 
clude all averaging models on the one hand 
and all balance-consistency theories on the 
other. But congruity theory is a very 
specialized averaging model as already 
noted. Moreover, general balance theory 
Carries no commitment to adding or aver- 
aging (Abelson, 1968a, p. 123; Anderson 
19683). M. J. Rosenberg (1968) also 
discusses this matter, 

One further Piece of evidence was given 
by Fishbein and Hunter (1964). With 
serial presentation of personality adjectives, 
they found that the addition of adjectives 
of essen tially equal value increased the ex- 
tremity of the response, This Set-size 
effect contradicts the simplest form of an 
averaging model, However, essentially 
the same result had been obtained in previ- 
ous work on attitude change (Anderson, 
1959a) and was interpreted to result from 
the averaging jn of an initial opinion. 
Later work has Supported this interpreta- 
tion (e.g., Anderson 1965, 1967; Anderson 
& Alexander, 1971; Hendrick, 1968; Oden 
& Anderson, 1971), 

„+t should be emphasized that 5 quantita- 
tive model can be used to good purpose in 
an essentially. qualitative way as exempli- 

ed by the summation formulation used by 

Rosenberg and by Peak. Their 
Means~ends formulation looks at the con- 
Sequences or Properties of an issue or object: 

Zach Consequence jg considered to have 4 
Certain value for the Person (analogous t 
the Present s parameter), and also a "pers 
ceived instrumentality” (Rosenberg, 1960) 
or “judged probability" of 
(Peak, 1955) (analogous to the present : 
Parameter), Attitude toward the issue ? 


eR 
occurrent 
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object is then simply Xws, an adding form- 
ulation that seems to be conceptually 
similar to that used by Fishbein (see Insko, 
1967, pp. 136, 197). Rosenberg tested this 
formulation by using posthypnotic sug- 
gestion to change attitudes on certain issues, 
and found associated changes in the corre- 
sponding w and s values. Although this 
work has been criticized on the ground that 
hypnosis is even less understood than at- 
titude change, there is no doubt about its 
great interest and potential. 


Regression-Correlation Formulations 


Much work on information integration, 
especially in clinical judgment, has rested 
on multiple regression and correlation 
analysis (e.g., Goldberg, 1968; Hammond, 
Hursch, & Todd, 1964; Hoffman, 1960; 
Meehl, 1954, 1960; Triandis & Fishbein, 
1963; Tucker, 1960; Wishner, 1960; Wyer, 
1969). This approach has suffered from 
two serious shortcomings. 

, In the first place, the correlation-regres- 

sion analyses that are reported seldom in- 
clude a test of fit of the linear regression 
model. It might seem that high correla- 
tions between predicted and observed are 
relevant evidence. However, very high 
Correlations are virtually guaranteed by 
the usual experimental designs, even with 
a seriously defective model (Anderson, 
1962a). Adequate assessment must attend 
to the discrepancies from the model predic- 
tions. 

"That discrepancies from an adding model 
can be important despite high correlations 
Is illustrated in Sidowski and Anderson 
(1967). Subjects judged the desirability 
of working at certain occupations in cer- 
tain cities. An adding model was applied, 
and the correlations between predicted and 
Observed were .986 and .987 in two experi- 
ments; yet there was a sizable interaction 
localized at the occupation of teacher in 
the least desirable city. Similarly, the 
dini correlation for the date ratings 
oe qe above (Lampel & Anderson, 

268) was .985 despite a strong Adjective 
X Photo interaction. Even in an experi- 
me designed specifically to produce a 
arge configural effect (Anderson & Jacoh- 


son, 1965, Condition 2), the correlation was 
977. In all these experiments, the dis- 
crepancy was clearly visible as a nonparallel- 
ism, and was readily picked up by the 
analysis of variance. 

Thus, the present approach leads to a 
different view from that of Goldberg’s 
(1968) review of linear models in clinical 
judgment. Goldberg considers three inter- 
pretations of the general finding that hu- 
man judges are seldom better than linear 
prediction models, and suggests the most 
attractive interpretation to be that judges 
really behave configurally but that the 
linear regression model is so powerful that 
it obscures the real configural processes. 
The results cited in the previous paragraph, 
as well as related work, point to a fourth 
interpretation: A linear-type integration 
model holds in many situations, but there 
are certain conditions that produce configu- 
ral response; further, analysis of variance 
has ample power to detect configurality 
when it occurs (Anderson, 1969a; Slovic, 
1969). 

The second difficulty with regression-cor- 
relation is more fundamental. Regression 
analysis typically proceeds under two 
scaling assumptions: that the stimulus 
values are known, and that the overt re- 
sponse is on an equal-interval scale. These 
assumptions naturally come under suspi- 
cion if the linear regression model does not 
fit wel. Nonlinearity may be nothing 
more than an inappropriate stimulus scale. 
For if the ostensible stimulus scale is not 
equal interval, then a linear function of the 
true subjective values will appear to be 
nonlinear. Similarly, apparent configural- 
ity may not represent true interaction but 
only an inappropriate response scale} (An- 
derson, 1961a, 1962b; Bogartz & Wackwitz, 
1970). Without a theory of measurement 
to support the scaling assumptions, inter- 
pretations in terms of nonlinearity or con- 
figurality rest on uncertain ground. The 
advantage of functional measurement the- 
ory is that it gives a unified approach to 
these problems. 

This brief discussion should not obscure 
the great usefulness of regression analysis 
in certain practical prediction problems. 


As Yntema and Torgerson (1961) indicate, 
Systematic discrepancies from a simple 
additive rule of combination may not be 
serious in certain man-machine systems. 
Much of the work on clinical judgment can 
be viewed from a similar standpoint of 
practical prediction. Unfortunately, there 
seems to be an overwhelming temptation 
to generalize from product to Process, that 
is, from prediction to understanding, Good 
prediction does not imply good under- 
standing, as the correlation coefficients 
quoted in the third Paragraph of this sub- 
section show. When the main concern is 
to understand the psychological processes 
that underlie the judgment, any systematic 
discrepancy may be meaningful and impor- 
tant. 

It should also be emphasized that it is 
possible to apply regression analysis to the 
question of goodness of fit, as well as to the 
measurement question, 
ance and multiple regression are both ap- 
plications of the general 
of mathematical Statistics and | 
siderable similarity, However, t] 
regression design Corresponds to 
confounded factorial design; there 
ion are generally 
difficult to localize and interpret. More- 
over, nonindependence of the beta Weights 
(Darlington, 1968) can complicate the stim- 
ulus scaling. On the other hand, regression 
designs can be very efficient, since they re- 
quire far fewer observations than a com- 
plete factorial design. An attempt to apply 
multiple regression to attitude measure- 
ment has been made by Ramsay and Case 
(1970), though their approach assumes 
that the stimulus values are known and 
fails to supply a test of fit, Some work on 
extending functional measurement to re. 
gression analysis has been done by Bogartz 
and Wackwitz (1970, 1971). 


Logical-Consistency Theory 


That human reasoning does not obey formal syl- 
logistic logic is well known. [t is also well known 
that formal syllogisms are only a small part of logic, 
50 perhaps human reasoning might accord with 
some more general logic model. This interesting 


Possibility has been studied by McGuire in a Series 
(e.g., McGuire, 1960, 


of articles 1968) on a logic 
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model for probabilistic beliefs. The basic ae 
is that subjective probabilities or beliefs must be 
consistent with one another in such a way that they 
obey the laws of mathematical probability theo 
though they may otherwise have any relation to the 
objective probabilities. : h 

McGuire has applied this logie model in b 
qualitative and quantitative ways. The qualia 
approach assumes that the tendency toward logico. 
consistency is complicated by other factors, such a 
wishful thinking, and is concerned with Socrat! 
effects, cognitive Structure, and related prolema 
The quantitative approach, which has been adopt E 
by Wyer and Goldberg (1970) and Wyer pr s 
sumes the basic postulate, that subjective proba x 
ties obey mathematical Probability theory, “al Be 
actly true. Only this quantitative approach wi 
considered here, 

To facilitate comparison with integration tiea 
only the following law of probability theory wi 
considered : 
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Here P4, Pat and Pj are the probabilities of At 
not-4, and B; Para and Pj are the condit 
Probabilities of B, given A and not-A. The «tive 
cal-consistency model does not require “magnet 
Probabilities to have any relation to objective m ua 
abilities (e.g, expectation of success need not ed 
its true probability), 
Jective probabilities be 
in the manner Specified by Equation 18. 
"hat one's b 1 
issues ordinari] 
is a matter of comme 
Such a set oj 


logic model is not easy to determine. _As ap 
tion, the investigation might be limited to 


' fact or argument Hired das 

» the belief system may be a 
2 nged. M Sarl 

Naturalistic observation, Testing the logic m 


: r on the mea” 
Setting rests completely on 2 he five 


| as ob- 
It is fairly Straightforward to test between Fa sment 
served and „Predicted. An absolute meast t the 


Scale is required, however, and it is unclear A 
a at have been used in the expe intent 
yr even equalinterva| scales. Sha abe 
(1970b) has found mete aie between Pons 
ability rating words such as probably, rotation 
k eir functional scale values. oe i i 

o igs n analysis has the same shortcomings 
a that were discussed previously. — stive 
abili use of Tatings as measures of subite he co 
h ms trpecially propleditis p^ 
icatio S that occur in the ogic ! n 
sho oR, "If a bortion is infanticide, the 

then de Prosecuted like other munis 
ium 2) Presumably ought to get a » strongl/ 
g; but Probably such ratings would be 5 e 


^ ri 
Affected in many people by their belief in the P 


prob’ 
ndi 
Thé 
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mise or the conclusion (Lefford, 
1968) 
Equation 18, unfortunately, may be insensitive 
to such biases when the beliefs are extreme. If P, 
the belief in the premise, is near zero, bias in Pp;4 
Will have little effect. This illustrates that the logic 
model may yield reasonably good predictions even 
when it is seriously wrong. Of course, even if the 
ratings of the conditional events are not valid, that 
does not necessarily infirm the logic model. ‘The 
person's internal cogitation may be perfectly con- 
sistent with Equation 18 without his being able to 
assign valid ratings to the events. j 
j Two brief comments should perhaps be added on 
the use of such probability ratings. st, the pro- 
oe that underlie such ratings may be of consider- 
ple interest and worth closer study in their own 
ng ht. Second, there are many purposes for which 
sis equaFinterval scale is not needed. Random as- 
us iment allows comparison between different ex- 
Perimental treatments even with an ordinal scale 
(Anderson, 1961a, 1963). Similarly, McGuire's 
(1960) interesting work on the Socratic effect does 
Ot necessarily require an interval scale. 
Experimental analysis. If integration theory can 
4 applied to these probability judgments, a radical 
implification of the measurement problem is possi- 


1916; McGuire, 


be 


le. Accordi EA E 
es ccordingly, the blanket assumption is made 
lad all five terms in Equation 18 follow this formu- 
ation, 


averágin anos Pa, Pas, and Px presumably follow an 
Would h Parer than an adding model, some care 
ES desirable to ensure equal weighting of the 
and P. cations. „For the two conditionals, Psya 
Hate gi a pueblsing model would seem appro- 
isila ough the overt ratings themselves might 
oW an averaging model. 
mena eun to be considered is whether the 
tually cor t and the integration formulation are mu- 
furere. s istent. „Applica tion 1, in which two com- 
dioe for ae are given in a factorial design, will be 
mental Chane Purpose: This reflects a key experi- 
of Weer 5 ey from the work of McGuire (1960) and 
VM de ue „Goldberg (1970); the use of two com- 
age for a na ina factorial design provides the lever- 
^m OF a Critical comparison. 
rests on aes need to be considered. c 
have no eff $ assumption that the communications 
only affe bee the conditionals, Paya or Ppjas, but 
thee DE in 4, A’, and B. By assumption, 
stitution Pr st oar ow the integration model and sub- 
into Bay i ne theoretical expression from Equation 
for Po 1uation 18 yields the following predictions 


The first case 


Pri j 
- > 
7 = (WoP ry + wrs: + wescj)P aya 


+ (w Pan + wr'sni’? + we'se;')Pajar. [19] 
scale a, ož Pression is additive in the manipulated 
model and 3 The joint application of the logic 
teraction " A egration theory thus implies zero in- 
agrees with (ae predicted values of Pp. Since this 
o bey ne assumption that the judgments of 
consistent wath ern model, the logic model is 

Integration theory in this case. 
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This test, it should be emphasized, can be made 
directly on the raw ratings of Ps. The problems of 
measuring the four terms on the right of Equation 18 
have been completely bypassed. Furthermore, be- 
cause of the respor aling feature of functional 
measurement, only an ordinal response scale is 
strictly necessary for judgments of Pj. This test 
does depend on getting communications that do not 
(lect the conditional probabilities, but that should 
be straightforward, at least if the logic model is 
correct. 

As the second case, suppose that one or both 
communications also affect belief in Pgs;4 and/or 
Pg 4. This could be arranged experimentally by an 
appropriate mixture of arguments. Equation 19 
must then be changed by substituting in these theo- * 
retical expressior This done, the predicted 


as well. 
values of Pai; no longer obey the parallelism predic- 
tion. This contradicts the original assumption that 
the averaging model applies to Ps. In this second 
case, therefore, integration theory and the logic 
model are inconsistent: if Pa obeys the averaging 
model, the logic model cannot hold. 

Both cases are needed for a proper critical test. 
Although the first case does not distinguish between 
the models, it is necessary to show that at least one 
of the models may be correct. The personality 
adjective task h iious advantag in experi- 
mental situation. The conditional probabilities 
would then depend on the implicational similarity of 
the traits. 

One essential difference between the two ap- 
proaches is in the conceptualization of the attitude 
change process for Py. In the integration model, the 
informational stimuli are considered to bear directly 
on Px. In the logic model, these changes occur in- 
directly, mediated by changes in Pa, Passa, ete. In 
the logic model, the exact analysis of Equation 19 
depends on whether Py 4 and Pras follow an aver- 
aging, adding, or compound averaging-adding model, 
but some simple predictions about matrix rank are 
available. É 

Bayesian statistics. Modern developments in 
mathematical statistics have emphasized personal 
probability and Bayesian statistics (Edwards, Lind- 
man, & Savage, 1963; Savage, 1957). The Bayesian 
approach has close parallels with logical-consistency 
theory. Part of its interest lies in its emphasis on 
the revision of statistical probability in the light of 
new evidence, analogous to opinion change. Al- 
though the Bayesian development is a strictly statis- 
tical theory, considerable experimental work has 
been done using it as a normative model of how men 
ought to process probabilistic information (see 
Edwards, 1968; Peterson & Beach, 1967; Slovic & 
Lichtenstein, 1970). 

It is the fate of normative theories that men do not 
behave as they normatively ought. Aside from the 
the r usually 
* than is statistically cor- 


ubiquitous order eflec onse i 


much more “conservative 
p" 


Several explanations of "conservatism" have 


rect. 
been given (Edwards, 1968), including misperception, 
misaggregation, and response bias. 
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In terms of integration theory, misperception and 
misaggregation would both correspond to weight 
parameters different from the statistically optimal 
values. Since integration theory is descriptive 
rather than normative, it is directly concerned with 
the determinants of the weights (Shanteau, 19704). 
Nonoptimal weighting in the Bayesian binomial 
inference task would be no more surprising than in 
number averaging (Anderson, 1964d, 1968d), but 
deviations from a statistical standard, however im- 
portant they might be in practical decision prob- 
lems, seem unlikely to say much about psychological 
processes. In the present approach, accordingly, 
the emphasis is on getting a theoretical framework 
that is psychologically congruent to the behavior. 

The response bias interpretation has recently been 
supported by DuCharme (1970). Of itself, response 
bias would not necessarily be serious, since the 
methods of functional measurement could be used 
to rectify a distorted response scale. However, the 
work of Shanteau (1970a) suggests a more funda- 
mental response problem. The standard bookbag- 
and-poker-chip task used in the Bayesian research 
asks for inference judgments of the probability that 
there are more white than red chips in a population 
sampled one chip at a time. Shanteau compared 
such inference judgments with estimation judgments 
about the proportion of white chips in the popula- 
tion. No difference was found between these two 
conditions, and Shanteau's careful experimental 
work indicated that both instruction conditions 
produced estimation judgments. Shanteau's re- 
sults may thus invalidate the Bayesian interpreta- 
tion of the binomial inference experiments. 


Proportional-Change Model 


The proportional-change model for attitude 
change (Anderson, 1959a; Anderson & Hovland, 
1957) postulates that the amount of attitude change 
is proportional to the advocated change: 


51 = so + w(sc — so) = wsc + (1 [20] 
where so and s; are the opinions before and after re- 
ceipt of a communication with value se and weight 
w. An analogous postulate was used in learning 
theory by Hull (1943). An extensive mathematical 
development was given by Estes and Burke (1953) 
and Bush and Mosteller (1955) for discrete choice 
ta and by Anderson (1961b, 1964a, 1964b) and 
Rouanet and Rosenberg (1964) for continuous re- 
sponse tasks. [n these models, the weight parameter 
represents the learning rate. 

In its original form, the proportional-change 
model has not been very successful. In learning, the 
critical tests are the sequential dependencies (An- 
derson, 1959b; Atkinson & Estes, 1963). The fail- 
ure of stimulus sampling theory of probability 
learning has resulted from its inability to account 
for these dependencies (see e ia Anderson, 1960, 
1964c, 1966b; Anderson & Grant, 1957, 1958; Ander- 
son & Whalen, 1960; Friedman et al., 19608; Jones, 
1971). 
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Attitude change experiments, in contrast to learn- 
ing experiments, typically involve only a few “trials.” 
As a consequence, the weight parameter will change 
over successive communications (Anderson, 1959a). 
A related difficulty arises in Abelson's (1964) gen- 
eralization of the Anderson-Hovland model to group 
interaction situations which, under fairly general 
conditions, predicts that everyone in a given group 
will eventually reach the same opinio: This 
difficulty can be avoided if the communications are 
treated as informational stimuli. A given communi- 
cation then loses its effectiveness over successive 
repetitions since it conveys no new information. 

The present serial integration model includes the 
proportional-change model as a special case and has 
considerably greater flexibility. A principal ad- 
vantage is its ability to allow for changes in the 
weight parameter over a sequence of informational 
stimuli. Complete serial position curves, which 
may include both primary and recency components, 
thus become available (e.g, Anderson, 1964d, 
1965b; Shanteau, 1970a; Weiss & Anderson, 1969). 


SOURCE AND COMMUNICATION 
PARAMETERS 


The weight and value parameters are 
properties of the source-communication 
combination. Much work on attitude 
change is not concerned with source effects 
and there may not even be an explicit 
source. When paragraphs about United 
States Presidents are given as historical 
fact (Sawyers & Anderson, 1971), the con- 
cept of source is relevant, but it resides at 
large in the cultural-experimental context: 
At the other extreme, the source may be 
explicit and its nature an integral part of 
the communication. 

_ The valuation problem is interesting in 
Its own right, and it is also relevant to the 
various design applications that have been 
considered. The problem is complex, de- 
pending heavily on the situational details: 
and a brief discussion necessitates somewhat 
cavalier treatment. The following rough 
analysis should be useful in many cases à? 
is given to illustrate some of the main 
points without explicit concern for Í 
evident limitations, 


Molar A nalysis 


ral 
Th . B ex er 

lo begin, it seems reasonable to pet 
that within limits the scale value will Þe, i 


“De as are 1 
dependent of the source. There are iallY 
portant exceptions, of course, espec™ 
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when the source is an integral part of the 
communication. In oral presentations, for 
example, warmth of voice and gesture 
could become attributes of the communica- 
tion and affect its scale value. However, 
the scale value corresponds to the position 
of the communication on the issue. In a 
su bstantial part of attitude change research, 
this position is conveyed by the semantic 
Content of the communication and should 
be independent of the source. 
, Determination of the weight parameter 
is markedly more complex as shown by 
numerous experimental studies summarized 
in McGuire's canonical chapter on attitude 
Change (McGuire, 1969, pp. 177 ff). 
ome communications, based on consis- 
tency or affective arguments, can have a 
completely source-free weight, but in most 
Cases the weight will depend importantly 
on the source. The source may be merely 
implicit, defined by the cultural-experi- 
mental situation, or source weight may be 
explicitly varied. The weight will also de- 
pend on specific properties of the communi- 
Cation such as clarity and redundancy. 
Nes fhe source and the communication 
: us contribute to the weight param- 
eter, If these contributions are indepen- 
tion A might be expected in some situa- 
eode would be of interest to know how 
Y combine. The most plausible hy- 
Bugs umi seem to be a multiplying 
del, though a composite adding-multi- 
Plying model might prove necessary. 
Ten ort source and communication 
eis eract with each other, and with the 
as well, in determining the weight 
geal A communication at variance 
mkhs EUM known position on the issue 
Hrs e partially discredited or discounted, 
At Is, given a lower weight. Source ex- 
ia ce would also interact with the issue; 
d Ne cd would presumably be more 
Cotics a pollution control than on nar- 
a iction, 
pplication 4 of the averaging model may 
E Bec. AUN Heu] in the analysis of source 
S and interactions. 


bee 


T For example 
Even c ere ] vacet 
S E:  enmipoi gii could be attributed 
l g = = H:3 
he erent sources in different columns of 
e desi 
esign. [n each case, source and com- 


munication could interact in any way. The 
interval scale of the weight parameter 
would then provide an assessment of the 
source-communication-issue interactions as 
reflected in the weight parameter. 


Molecular Analysis 


Thus far, the communication has been 
treated as a unit with a single weight and 
scale value. However, it will usually have 
a more or less complex structure, contain- 
ing various separate statements and argu- 
ments, and its molar effect will itself result 
from information integration. The follow- 
ing classification is neither novel nor com- 
plete, but it indicates how a molecular 
analysis might proceed. 

Means-ends assertions constitute one 
great class of persuasive arguments (see 
McGuire, 1969, pp. 153 ff.). The source 
asserts that a certain action or belief will 
produce a good or avoid an evil, thereby 
modifying the acceptance of that action or 
belief. Each such argument is a subcom- 
munication, and indeed a source-communi- 
cation-issue instance. The issue is the good 
or evil end, and is the primary determinant 
of the value of the subcommunication. The 
weight parameter will reflect the impor- 
tance of the end, and also the persuasive 
properties of the source and communication 
that link means and end. On this analysis, 
which is similar to that of Peak (1955) and 
Rosenberg (1956), each means-end argu- 
ment counts as one piece of information to 
be integrated into the overall opinion. 

The other class of persuasive arguments 
can be called inferences. Letters of refer- 
ence, for example, are usually amplified 
lists of personality adjectives. To judge 
the intelligence or motivation of someone 
described as level-headed requires an un- 
certain inference about the parameters of 
the given information with respect to the 
specified dimension of judgment. Both 
weight and scale value would be determined 
primarily by the semantic-actuarial rela- 
tions between level-headedness and intelli- 
gence or motivation. Source characteristics 
would also affect the valuation process of 
course. 
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Bare assertions by the source, “I dis- 
liked the movie,” and “I consider Harry 
Truman one of the greater United States 
Presidents,” have received considerable 
study as prestige suggestions. These may 
be considered inferences eyen though the 
inference may be a conditioned response of 
acceptance or rejection. Such statements 
have their weight determined by the source 
and their scale value by the semantic con- 
tent of the assertion. With a design based 
on two prestige sources, Application 1 
would provide a straightforward test of the 
integration model. 

Consistency arguments may fall into 
both classes. “If you consider self-reliance 
a virtue, then you should allow your chil- 
dren more freedom” requires an assessment 
of the force of the implication. A means- 
end evaluation is also needed. Some con- 
sistency arguments are simple direct in- 
ferences, however. 

This classification bridges the molar to 
the molecular. Itis thus one approach to 
the detailed processing and integration of 
complex communications. Many communi- 
cations will not stand a simple dissection, 
of course, but the total effect of those that 
do may be simple functions of the compo- 
nents. Suppose that Vus... are the 
weights and s;,s,,- - ^ the scale values of the 
components of some Communication, Aver- 
aging theory then implies that the scale 
value of the whole is the mean value of the 
parts, 

$ — P wis./D w, [21] 
and that the weight of the whole is the sum 
of the weights of the parts, 

w= w; [22] 
Equation 22 has interest as an adding rule 
within averaging theory. Inconsistency or 
redundancy among the components could 
affect their weights, of course. Theoreti- 
cally, Equations 21 and 22 should still apply 
with these altered weights. 


COGNITIVE CoNnsts 


NCY 


Cognitive consistency is central in atti- 
tude structure, and many Current theories 
of attitude change take some consistency 


Principle as their basic postulate and point 
of departure. The popularity of this ap- 
proach is to be seen among the 84 chapters 
of Theories of Cognitive Consistency: | A 
Sourcebook (Abelson, Aronson, McGuire, 
Newcomb, Rosenberg, & Tannenbaum, 
1968). The variety, interest, and impor- 
tance of the questions that have been raised 
by the consistency theorists are impressive. 

But as the basis for a general theory of 
attitudes, a consistency postulate seems to 
be inherently inadequate. Much attitude 
change does not involve inconsistency res- 
olution but only straightforward integra- 
tion of information (Anderson, 19682; 
Lindner, 1971). New information, includ- 
ing another's opinions and arguments, may 
alter attitudes and actions in the absence 
of any imbalance, incongruity, dissonance, 
or any other inconsistency. If this Is cor- 
rect, then a consistency principle is inher- 
ently too limited to support a general theory 
of attitudes, Just this limitation seems to 
be reflected in the conceptual insufficiencies 
of the various con istency principles, as 
well as in the problems that have arisen in 
attempts at quantitative treatment. Much 
of the difficulty seems to result from forcing 
the consistency principle into situations 
that do not involve inconsistency. d 

The present approach has developed 
differently, beginning with situations that 
do not require inconsistency prse 
The logic of this attack is straightforwarc ; 
nconsistency among the informationa 
stimuli will affect their parameter values: 

he test of a simple averaging or adding 
model, though it requires some care, * 
straightforward if the stimulus parameters 
are constant, But if they vary across i 
text, the analysis is markedly more difficu'*: 
It then becomes difficult to distinguls ? 
parameter changes from alternative inte 
gration rules, And it is also difficult 
distinguish basic defects in the model fran 
Shortcomings in the experimental techni 
ques, d 

Phe Success of the averaging model Mi 
Consistent information makes it possible E 
interpret deviations produced by integr 
tion of inconsistent information (Anders 
& Jacobson, 1965). Moreover, success : 
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the simpler situations helps validate the 
experimental techniques and the analytical 
procedures. The development of functional 
measurement theory, in particular, and the 
use of numerical ratings as equal-interval 
data have been central in this work. The 
Support that this methodology has re- 
ceived in the simpler tasks provides part of 
the basis for attributing psychological 
meaning to discrepancies from the simple 
model in other situations (e.g., Lampel & 
Anderson, 1968; Oden & Anderson, 1971; 
Sidowski & Anderson, 1967; Slovic, 1969). 


Inconsistency Resolution 


Mechanisms for resolving inconsistency 
have been studied by numerous writers 
(e.g., Abelson, 1959, 1968b; Adams, 1968; 
Aronson, 1968; Feather, 1967; Festinger, 
1957; Gollob, 1968; Hardyck & Kardush, 
1968; Kaplan & Crockett, 1968; Kelman & 
Baton, 1968; McGuire, 1966, 1968; Weick, 
968). _ The present analysis has various 
similarities to previous treatments, though 
perhaps with some advantages in precision. 
Ma sra direct method of resolving in- 
the md isin the valuation operation. If 
tiere Po stimuli are inconsistent, 
Det aer in their meaning or in their 
^ Dee men could reduce the inconsistency. 
itr en * Hà e model, changes in mean- 
"Bue _be reflected as changes in s; 
Sheen in importance would be reflected as 

in w. 
n et change experiments may explic- 
E, en) involve judgments of 
[eene T communication as well as the 
Mic S. he valuation process then becomes 
sideris ql as may be illustrated by con- 
tributed an unpleasant communication at- 
e ud E respected source. For the 
a itself, the evidence below 
ice lat w will decrease but that s will 
Panik Aen i The decrease in w may 
aea residui the source or from in- 
F been. discounting, equivalent here to 

: ee source and communication. 
stitutes SMS es the communication con- 
ione ies of information about the 
tnis ? lo the extent that it is not dis- 

€d, 1t will decrease the scale value of 
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the source according to the analysis given 
in the section on balance theory. 

Other mechanisms for resolving incon- 
sistency will not be considered here, but it 
should be noted that the relation between 
valuation and integration needs closer 
analysis. The present discussion has im- 
plicitly viewed valuation as preceding inte- 
gration. This is too simple a view: incon- 
sistency can only exist as a consequence of 
an attempted integration. Valuation and 
integration must then proceed concurrently 
and a more molecular analysis is needed to 
delineate the temporal course of the total 
process. 

Within the present framework, however, 
the two main modes of inconsistency resolu- 
tion are change in meaning and change in 
weight. The evidence on these two modes 
will be discussed briefly to illustrate some 
of the problems. 


Change of Meaning 


The hypothesis that words change mean- 
ing as a function of context has great intu- 
itive appeal. Certainly, it is true under 
certain conditions, as for words that have 
distinct alternative meanings (Anderson, 
1968c). However, whether words with a 
single principal meaning change as a 
function of context is doubtful and cer- 
tainly not well supported by existing data. 
Various lines of evidence have been con- 
sidered, but none gives very convincing 
evidence for change of meaning in this 
case. 

Evidence from primacy. One line of 
evidence that has received considerable 
study is the primacy effect in personality 
impressions first noted by Asch (1946). 
If good and bad traits are presented in 
serial order, the earlier traits have greater 
influence on the overall impression. Asch 
interpreted this in terms of change of mean- 
ing, as though the subject was selecting out 
those shades of meaning of the later words 
that would fit better with the earlier words 
(Anderson, 1965b; Asch, 1946). However, 
an extended series of experiments has pro- 
vided little support for this view (see 
Anderson, 1965b, 1971a). Instead, it ap- 
pears that the later adjectives keep a fixed 
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scale value but get lower weight. Whether 
this results from discounting or attention 
decrement, however, is not yet completely 
clear (see Hendrick & Costantini, 1970). 

In other work, Asch (1948) has argued 
that political statements change their 
meaning when attributed to different 
sources. These results, however, can be 
accounted for directly in terms of integra- 
tion theory. Itis only necessary to assume 
that the subject is attempting to judge the 
position of the source on the issue. For this 
he has two pieces of information: the state- 
ment itself and his prior opinion of the 
source's position. His judgment will bea 
composite or weighted average of these two 
informational stimuli and hence will vary 
directly with his opinion of the source. 
In this judgmental interpretation, it is not 
necessary to assume that the statement 
changes in any way. 

Component judgments. Further support 
for a judgmental view comes from work on 
judgments of the single items of a combina- 
tion. If subjects are asked to judge the 
likableness of each separate trait in a person 
description, there is a Positive context 
effect: the rating of each trait shifts from its 
context-free value toward the values of the 
other traits (Anderson, 1966a; Anderson & 
Lampel, 1965). The obvious interpreta- 

tion is that these judgments directly re- 
flect change in meaning. Beyond plausibi- 
lity, there is no satisfactory evidence for 
change of meaning, and the alternative hy- 
pothesis of a generalized halo effect is 
equally plausible. The available evidence 
(Anderson, 1971a; Kaplan, 1971) supports 
the judgmental view. 

Contrast and. assimilation. The concepts 
of contrast and assimilation have been 
widely but uncritically used in social psy- 
chology. Such effects, if real, would cor- 
respond to changes in meaning or scale 
value Assimilation, however, may result 
from composite judgmental processes ag 
already suggested, and it is well known that 
contrast effects may be artifacts of the re- 
sponse language rather than changes in 
stimulus value. Even with psychophysical 
stimuli such as lifted weights, contrast ef- 
fects are not always obtained (Anderson, 
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1971b; Schiffman, Goldstein, & Aroksaar, 
1970). l 
The problem of contrast with verbal 
stimuli has been considered by numerous 
workers (e.g., Anderson & Lampel, 1965; 
Campbell, Lewis, & Hunt, 1958; Dawei 
1971; Manis, 1971; Parducci, 1965; Segall 
1959; Upshaw, 1965). It seems clear that 
most observed contrast effects stem aon. 
from response language usage, not a 
true change in stimulus value. Pane 
(1971) report has special interest 1n n 
respect because it indicates that people w ae 
Opposite views are perceived as more. á 
treme than in fact they are. In pomo 
Some evidence suggests that true omo 
contrast may be obtained under cer e 
circumstances. Even if that is true, T d 
ever, it does not justify the use of s oem 
measures that are severely contamina 
by artifacts, a 
Judgmental context. The scale ve 
word will certainly depend on the i de- 
Sion of judgment. Masculinity is HD. nce. 
sirable in men than women, for Ar ave 
This case is like that of words a pe 
more than one principal meaning, 
illustrates the role of the judgment? ilarly; 
text in the valuation process. m 
the scale value of lighthearted WOU than 
different in judgments of likableness 
of dependability. 
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Discounting refers to decreased v 
Owing to interaction among Hie 
tional stimuli, Ifa new piece of infor! consis 
is inconsistent with the old, the T les 
tency can be reduced by assigning ce for 
Weight or importance. Direct evi orsonal- 
inconsistency discounting in the Anderson 
Ity-impression task is given by An an 
and Jacobson (1965) and Sch , using 
-ohen (1968). An important BET. dis. 
More realistic stimuli that bears i Bug 
counting processes has heen made D? 
ental, Kaswan, and Love (1970)., 
‘Scounting has also been studie a 
derson, 1968d). Moreover, the $ (egi 
orm of redundancy interaction q 
Brewer, 1968; Dustin & Baldwi ond T 
Schmidt, 1969) would also corres? 
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decreases in weight. In contrast to the 
change of meaning hypothesis, then, the 
change of weight hypothesis seems to be on 
firm ground. 

Stimulus interaction might possibly in- 
crease weight parameters. No evidence 
seems to exist, though increased weighting 
might occur when two stimuli form some 
natural unit. Chalmers (1969) has argued 
for both incremental and decremental 
Weighting in personality impression forma- 
tion by analogy with facilitation and inter- 
ference in verbal learning. (It should be 
noted that Chalmers used the term, change 
of meaning, to refer to changes in the weight 
parameter.) 

Discounting has received relatively little 
Study, and that mainly with indirect mea- 
Sures, though an attempt to extend dis- 
Counting analysis has been made by Schü- 
jog and Cohen (1968). Direct subjective 
des of importance would be most de- 

e, and some very tentative evidence 
or the usefulness of importance ratings is 
alan Anderson and Alexander (1971). 
E e of , such importance ratings 
analysis especially valuable for configural 
j , Since it is evident that discount- 


in E c 
ki must depend on the configural pattern 
the stimuli. 


CONCLUDING COMMENTS 


is n md feature of integration theory 
ED. * a functional theory of mea- 
Te gem caling thus becomes an organic 
Scales ii cons investigation. The 
the m m eveloped and used directly in 
fip ua RYE es ave and their validity 
i n 1e validation of the psychological 
Buy San sense, this must be true of 
Scaling a ue theory, but typically the 
erem prog blem in psychology has been con- 
heleda to be accomplished as a 
besa apical preliminary. In functional 
scaling ep theory, stimulus and response 
Mnotbani | the psychological law are co- 
^s ae in their development. 

wii aration theory is especially concerned 
gether Rein qud that require putting to- 
vt pieces of information. Such 
attitude s not unique to opinions and 

. earning, perception, judg- 
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ment, and decision making also involve 
information integration. There should be 
a unified theory that covers all these areas. 
The present formulation has shown some 
initial promise in each of these areas, 
and it may provide a basis for the develop- 
ment of a unified general theory. 
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GEOMETRIC FIGURES * 
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New York University 


A model of how humans perceive straight-line two- and three-dimensional 


geometric figures, for example, triangles, trapezoids, cubes, is presented. The 
major characteristics of the model are: (a) a figure is represented as a hierarchy 
of perceptual elements—lines, angles, areas; and (b) the process of perceptually 
organizing a figure is portrayed as a stochastic sampling of the elements. In 
addition, the total number of trials to sample the elements corrected by a 
measure of redundancy is proposed as a measure of the figure's complexity. 
The model successfully predicts the judged complexity of a wide variety of 
two-dimensional figures; with some additional rationale it also predicts the 
perceived three-dimensionality of figures such as necker cubes which can be 


seen in either two or three dimensions. 


In general, the predictions of per- 


ceived three-dimensionality are supported but there are discrepancies and 
other important unresolved problems in the treatment of perceived three- 


dimensionality. 


Eus piper outlines a model of how hu- 
simple atin simple geometric figures. By 
cele cer we mean two-dimensional 
aem: was of figures composed entirely 
angles lines. , Fimiliar examples are 
model ha Pr et cubes, etc. The 
^ iue; wo major characteristics. First, 
Perceptual tepresented as a hierarchy of 
e ements; second, the process of 
Portrayed Y Organizing these elements is 
ü R a stochastic sampling process. 
trials mic bas total number of sampling 
of a M E for the complete sampling 
Süre o corrected by a measure of the 
sure of pee HRS, is proposed as a mea- 
complexity of the figure. 


The 
è? MODEL: Two-DIMENSIONAL FIGURES 


Stimulus Elements 


is zm elements for two-dimen- 
ines, ee assumed by the model are 
metric 2 es, and areas. Any simple geo- 
ing iem es can be specified by enumerat- 
model 4 escribing these elements, and the 
iste assumes that the „perception of a 
A is also an evaluation of these ele- 
TM hi work was supported by Grant NS-07636-03 
Stroke ne Institute of Neurological Diseases and 
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Vi : Ds for reprints should be sent to Paul C. 
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ments. In general terms, these elements 
are the same as those of elementary geom- 
etry, although there are some minor mod- 
ifications required by the kind of stimulus 
figures under consideration and by certain 
apparently overriding intuitive consider- 
ations. 

Line. A line in our simple patterns is a 
straight line, except that it extends only to 
the intersection with another line and not 
beyond it. For example, in Figure 1 there 
are the following eight lines: AB, AC, AD, 
BC, BE, CD, CE, and DE. Our definition 
excludes the theoretically possible lines AE 
and BD. 

Angle. An angle is created by the inter- 
section of two lines. Angles are defined by 
the model only between lines which are 
adjacent within the figure. Again, in 
Figure 1, there are 12 angles: BAC, ABC, 


BCA, etc. The definition excludes angles 
A B 
D E 
Fic. 1. Example of simple straight-line figure. 
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such as BAD and ADE. In addition, an 
angle is defined as including the entire 
length of both the intersecting lines. This 
definition should be distinguished from 
definitions or uses of the term “angle” 
or "corner" which only include some un- 
specified length of each of the intersecting 
lines. Such uses, for example in Hebb’s 
(1949, Ch. 5) theory of the elements in- 
volved in the perception of a triangle, have 
different implications for perception. 

Area. Similarly, an area is restricted to 
the smallest component areas making up 
the entire figure. It will be seen by an 
examination of the example already given 
that these areas are generated by the lines 
and angles defined previously. In our ex- 
ample, the permissible areas are ABCA, 
ACDA, CDEC, and BCEB, but not 
ABECA or ABEDA.? 

The major reason for the exclusion of 
the logically possible elements mentioned 
above is that the excluded elements would 
have overlapped with elements which are 
already included. "That is, the included 
elements are the smallest set of nonover- 
lapping or independent elements which 
completely characterize the pattern. 

Although the model's elements can be 
taken as assumptions, they do have a logical 
rationale. They are the simplest set of 
elements at each level. For example, the 
unbroken straight lines of a Square are 
simpler than any other set of lines describing 
the square. We define the simpler of two 
sets of elements as that set with the fewer 
number of different elements; and if two 
sets have the same number of different 
elements, then the set with the least vari- 
ance in element size is the simpler. 

The reader might enquire why points, 
another basic geometric clement, are not 
considered as elements for the model, 
Points falling on a line, although theoreti- 
cally present, are assumed to have no per- 
ceptual existence. Points defined by the 
intersection of two lines are excluded, since 
Whatever properties they have are assumed 

“A fourth element, volumes, will be treated very 


briefly in the last section of this paper on three- 
dimensional perception. 
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to be represented adequately by the line 
and angles of which they are a part. 


Sampling Process 


The model represents the sampling o! 
elements of a given kind as a process o 
independent sampling with replacement. 
For example, if a figure contains four lines 
of equal length, these lines are sampled 
with equal probabilities on each trial until 
the process Stops sampling lines and begins 
sampling angles. "m 

The model assumes that the probabilities 
of sampling of the elements at a given level 
are not necessarily the same, but instead 
are a function of the magnitudes of "n 
elements. For example, consider a triangle 
with sides (lines) three, four, and i in 
length. "The model assumes that the pro vill 
bility of sampling the side of Length 5 pe 
be the greatest, and that for Length 3 th 
least. Specifically, the probability is hine 
to the proportion of the total magnitu e 
contributed by that element. In the abori 
example, the probability of sampling D 
line of Length 5 is 5/(3 + 4 +5) = 
for Length 4, the probability is 4/12; d 
Length 3, 3/12. The magnitude or 812€ : 
each of the classes of elements is define ths 
follows: of lines, it is the sum of the leng pa 
of the lines; of angles, it is the sum O° tal 
interior angles; and of areas, it is the e 
area of the figure. Returning to T. 
ample of the triangle with sides of three 
four, and five, we see that there are s we 
lines, three angles, and one area. 3/12; 
saw before, the line probabilities are, ngle 
4/12, and 5/12. The angles of this t! p z 
are 37°, 53°, and 90°, so that the C^. 
sponding sampling probabilities are cre i 
53/180, and 90/180. Finally, since @* þa- 
only one area, it is sampled with p! 
bility one. a ‘owed 

The perceptual process, then, 1S asse 
“Sa process of sampling different C^ as 
of elements with probabilities assign? a 
Outlined above. These classes of m ne 
are arranged in a hierarchy, be and 
Sampling process beginning with line? en 
ending with areas. All elements of 2 nove’ 
Class are sampled before the process | chy" 
On to the next class in the hier? 
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rue the mathematical justification for 
i» pesaing classes of elements, there is 
c ME Ed at least consistent empirical 
Bad ie i he a work of Hubel 
the See (1962, 1965) has demonstrated 
Bed sng of special bar or line detectors 
Ste ge angle detectors in the visual 
aoe the cat. Studies using stabilized 
ite fe the human retina suggest that 
iren up lines (angles), and planes 
Ber function as independent visual 
then = since they seem to disappear and 
a Tre in an all-or-none fashion 
vie pM & Hebb, 1960; Riggs & 
M n (Ia : 39). The recent work of Kauf- 
& R ufman & Richards, 1969; Richards 
omen me 1969) has demonstrated that 
of se jects tend to fixate on the center 
or m, of a geometric figure. This 
cept of is consistent with both the con- 
—_ ve enclosed area as a higher order 
M m os element and with the model's 
Ee is presented later, that 
often gpa ok a figure's symmetry 
gravity. valuated about its center of 
E cwm assumption of sequential 
Derat elements is consistent with the 
McFarlan a of perception proposed by 
ES 2 (1965, 1968). McFarland pre- 
geometric -o that human perception of a 
Sequential igure consists of analysis or 
gration etd e perception and then inte- 
gure. M a perception of the whole 
a kan of this research. also implies 
each lee i5 perceptual coding in which 
ater and of the hierarchy is a temporally 
erty. T] - more abstract stimulus prop- 
archicall eet arguments for a hier- 
istically x organized model of probabil- 
sented dn at elements have been pre- 
recent teft odwell (1961). In addition, a 
e mira of the perception of sequential 
mensis: rae and Todd (1969) has 
tic samplin e the usefulness of the stochas- 
in eon, of hierarchically coded elements 
he pe the perceptual process. 
OF Presu mp exity of a figure unadjusted 
Prats. ere; redundancies in the sampling 
Mid or hich will be discussed later, is 
Ts ^ed tie sum of the number of trials, 
aken to sample the elements of 
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In the case of two-dimen- 


each class. 
the unadjusted complexity 


sional figures, 
3 
equals 27 T; when T; equals the trials to 
ii 
sample the lines, T» for angles, and Ts for 
areas. The actual value of 7; for any 
particular set of probabilities is gotten by 
using a Monte Carlo method embodied 
within a simple computer program.* Thus 
the computer goes through the sampling 
process that the human visual system is 
portrayed as using. This sampling process 
must have a stop rule, since without it the 
process would go on indefinitely. The 
rationale for the stop rule is that the sam- 
pling stops after some fixed number of 
trials without sampling a new element. 
This number, symbolized SRC (stop rule 
criterion), is estimated from the data as 
described below. The perceptual interpre- 
tation of SRC is that the perceiver samples 
the elements of the figure until there has 
been a failure to detect a new element after 
SRC samples. Obviously the perceiver's 
certainty of having "seen" all the elements 
increases as the stop rule criterion increases. 
The major effect of SRC is to stop the sam- 
pling process before it spends inordinate 
numbers of trials "looking" for the last 
one or two extremely small elements. For 
this stop rule to work, it is also assumed 
that each element when sampled must be 
identified uniquely, and we assume this is 
done by tagging each element with its 
orientation and/or specific location. This 
information which is needed for the process 
to know if it has previously sampled an 
element is presumably related to the loca- 
tion of the sampled line in the population 
of line-sensing neural elements. In the 
case of K equally probable elements where 
K <40, Ta with SRC =75, is an increasing 
straight-line function of K; 


approximately 
the slope of the line is 


when SRC is smaller, 
less. 

There is the alternative of assuming the 
process knows when all elements have been 
sampled and then quits. Although this 
assumption is obviously unreasonable, it 
does allow one to represent the process by 


4 Copies of the computer program are available 
from the first author. 
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the expected number of trials, E(T), for 
a fixed set of probabilities. In the case in 
which a class of elements is composed of 
K equally probable elements, the expected 
number of trials to sample this class, E(T), 
is given by the following simple expression : 
E(T) = K (à 4- 1/2 4- 1/3 4- --- +1/K) 
(Uspensky, 1937, p. 181). 

In the more common case where the ele- 
ments are not equally probable, obtaining 
the expression for the expected number of 
trials requires some elementary matrix 
algebra and Markov chain theory.5 The 
solution was developed and brought to our 
attention by Michael V. Levine and it is 
satisfactory when the number of elements, 
K, is small, since the number of terms in 
the matrix of the Markov chain for E(T) 
is of the order of 2?—1, For example, in 
the case of three elements a, b, and c with 
probabilities of p, q, and v, respectively, the 
total expected time before sampling all 
three, E(T), is: 


p q r bq (+r) 

1 

Ri i aaa 
pr (1-g) rq (14-5) 
d-5-»*ü-5ü-39 


Obviously when K is much larger than 5 or 
6, the procedure is too cumbersome for 
hand computation. 

We prefer the Monte Carlo method be- 
cause it is fast, conceptually simple, based 
on a more plausible perceptual assumption, 
and because the stop rule parameter keeps 
the process from continuing for large num- 
bers of trials just to pick up the last small 
element. 

The procedure for estimating the size of 
SRC is simply to select two values—one 
high, one low, for example, 100 and 25— 
and move in by jumps of 5 or 10 until 
that value is found which gives the best 
fit between the predicted values and the 
data. In all experiments discussed here, 
the best estimation of SRC was 75. We 
expect this value of 75 will be typical for 
experiments in which the subject controls 


ë The general rationale for finding E(T) can be 
found in Kemeny and Snell (1960), especially 
Chapters II and III. 
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the length of time the stimulus is viewed 
before making a simple judgment. Of 
course the best fit SRC value would be 
smaller if the presentation were shortened 
tachistoscopically and/or if the experi- 
menter introduced various costs associated 
with length of viewing time or with errors 
in reproducing or identifying the figure. 

There is another question raised by the 
sampling process: How many sample runs 
are needed before the estimate T; is as- 
sumed to be stable? "That is, one sampling 
run to criterion can give a very unrepre- 
sentative estimate of T; especially for 
figures with many elements or with very 
discrepant probabilities. "Throughout this 
paper, we used 3,000 runs for all T; values, 
although it is likely several hundred runs 
would be sufficient, since with 3,000 runs 
the standard deviation of the T; value was 
always less than 1%. In either case, 
we must assume that the perceptual system 
also makes large numbers of runs. Perhaps 
there are large numbers of independent 
subsets of line-sensing elements, each of 
which can be considered an “urn” and the 
number of trials or time to sample all or 
most of them can be considered as the 
analog to our estimate of T; That some- 
thing like this is likely is argued by Ganz 
(1969). He discusses and presents phys- 
iological evidence that the visual cortex 
of the cat must have a great deal of redun- 
dancy since Severe cortical loss still allows 
the animal to function effectively on visua 
discrimination tasks. 

Examples of arrays of probabilities an 
the obtained Monte Carlo trials 7; te 
sample them are shown in Table 1. , 

The Preceding definition of the sampling 
Process assumed that each element Was 
sampled independently of all other elements 
and with replacement. On intuitive groun 
It seems extremely unlikely that such 6" 
assumption of complete independence 2 
Justified. For example, consider a squat 
and a simple trapezoid—in Figure 4, Nd 
and 16. The number of independent Um 
to sample each pattern is about the sam" 
Since both figures have the same num)’ 
of lines and angles, and the elements ar 
hot that different in size, This implies us 


id 
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TABLE 1 
EXAMPLES OF ARRAYS OF SAMPLING PROBABILITIES AND TRIALS TO SAMPLE, 
Ti, FOR THE LINES AND ANGLES OF STIMULUS FIGURES 
IN EXPERIMENT | 
Stimul set 
QU d Line probabilities n Angle probabilities T: 
3—medium rec- 
E tangle (1547, .1547, .3453, .3453) 10.52| (4, 5, 5 D 8.31 
—isosceles tri- 
Ane (.2062, .3969, .3969) 6.28 | (1666, 44167, 4167) 7.04 
m stretched" 
hexagon (.0933, .0933, .0933, .0933, .3134, 4, 3, By ty E 14.75 
16—s; 3134) 22.39 
—simple trape- 
as " (1372, .2481, .2782, .3365) 9.99 | (.1667, .2083, .2917, .3333) 9.38 
arrow (1081, .1081, .1081, .1081, .2838, (0792, .0792, .1389, .1708, .1708, 
22— «t, .2838) 19.44 | .3611) 2148 
EUH (.0263, .0368, .0790, .0790, .1263, (.0833, .0833, .0833, .0833, .0834, 
Suilding (1579, .2316, .2631) 43.53) .0834, .2500, .2500) 29.47 
saw edge" | (.0410, .0410, 0410, .0525, .0525, | (0312, .0416, .0416, 0416, .0625, 
.0702, .1170, .1170, .2047, .2631) | 50.42 .0625, .0950, .2080, .2080, .2080) | 52.81 


eee are quite close in complexity. 
si eb. y, however, the square is con- 
tó in simpler, and its simplicity seems 
dt. cem the fact that the sides and angles 
related ap This property is closely 
ee oi of the pattern. 
Bm m iis gd that identical elements 
nont ae ed with greater efficiency, 
X m that some sort of redundancy 
samplin ind the representation of the 
sampling pronos However, the present 
decns p Ocess becomes unwieldly if 
Cm JUS SEDUCE ATE element redun- 
samplin ie incorporated directly in the 
is xd probabilities. Therefore, what 
simple ch isa plausible rationale for some 
samplin ange in the representation of the 
redundz, proensa that adjusts for pattern 
the folla ey. Phis has been handled by 
ma Wing rationale. 
nm approach is to evaluate the 
that is Bi E figure on the assumption 
quately oi degree of symmetry ade- 
metry ig reflects its redundancy. Sym- 
and hort evaluated about a set of vertical 
metry ai axes. (Other axes of sym- 
that i treated.) Since it is assumed 
and in the E y s epalnaten independently 
it will S mae about each axis, 
evaluating ver o present the procedure for 
rtical symmetry. 


Definition 1: Vertical axis of symmetry. 
A figure's vertical axis of symmetry is a 
vertical line drawn through the figure's 
center of gravity which allows all elements, 
that is, all lines, angles, and areas of 
the figure, to map into each other about 
the axis. Common figures with this very 
simple kind of symmetry are shown in 
Figure 4, No. 1-5, 9-11, 13-15, 17, 20, and 
21. Figures (in Figure 4) which also show 
horizontal symmetry are No. 1, 3, 4, 9, 14, 
13, 15, 19, 20, and 21. 

Definition 2a: Degree of symmetry, S, for 
figures with vertical and horizontal symmetry. 
A figure's degree of redundancy, S, is its 
number of axes of symmetry plus one. 
Thus, all two-dimensional figures with 
both a vertical and horizontal axis of 
symmetry have the maximum degree of 
redundancy of three. 

Definition 3a: Two-dimensional figure com- 
plexity. The complexity of a symmetric 
two-dimensional figure is the sum of the 
number of trials to sample the lines, angles, 


the figure, > Ts divided by S. 
& 


This definition of complexity assumes that 
the redundancy in sampling a figure can 
be taken into account by the simple correc- 
tion of dividing by the number of axes of 
symmetry plus one. (Adding one to the 


and areas of 


Fic. 2. Example of double mirror-image symmetry. 


number of axes avoids dividing by zero 
in the case of completely asymmetric 
figures.) Thus, a figure with one axis of 
symmetry is divided by two and is assumed 
to be 50% redundant. 

Definition 4: Vertical axis of double 
mirror-image symmetry. A vertical axis 
through the center of gravity of a figure 
creates double mirror-image symmetry if 
one of the two halves of the figure, when 
rotated about the horizon tal axis, creates a 
new figure with simple or mirror-image 
symmetry. The parallelogram, Figure 2, 
is a familiar example of this kind of sym- 
metry. Other examples of this kind of 
symmetry shown in Figure 4 are No. 6 
(rhombus) and 23. In all cases of which 
we have any knowledge, figures with verti- 
cal double mirror-image Symmetry also 
have horizontal double mirror-image 
symmetry, 

Definition 2b: Degree of symmetry, S, for 
figures with double mirror-image vertical and 
horizontal symmetry. Each axis of double 
mirror-image Symmetry counts as one half 
an axis of mirror-image Symmetry. For 
example, a rhombus or parallelogram each 
have an S of $011 or 2.00. The 
rationale for the choice of i is that this 
kind of symmetry is obviously less than 
mirror-image symmetry. In addition two 
folds about the axis of Symmetry are needed 
to get complete mapping, while only one 
fold is needed for mirror-image mmetry, 

The two-dimensional complexity of a 
double mirror-image symmetric figure is the 
same as Definition 3, that is, CSE ISi 

Many figures which are not perfectly 
Symmetric about one or both axes never- 
theless have a great deal of redundancy 
which will be considered as a kind of sym- 
metry called partial symmetry. The eval- 
uation of partial symmetry is more in- 
volved, yet the basic idea is still closely 


Mri ie 
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related to simple symmetry. Again we wi 
consider only Partial vertical symmetr 
since both axes are treated in the sam 
manner. 

Definition 5: Partial vertical symmetry 
This symmetry is evaluated by placing | 
vertical axis on the figure so as to maximiz 
the proportion of corresponding element: 
Which map into each other, or, if no cle 
ments map, then the vertical axis is placec 
So as to maximize the proportion of corre. 
sponding elements which are identical with 
respect to their size or orientation. 

Definition 6: Corresponding elements. 
Elements on either side of a vertical axis 
are corresponding if they are the same 
number of elements in distance from the 
same point of intersection of the vertical 
axis with the figure. If the distance of an 
element can be measured from more than 
one point of intersection of the vertical 
axis, then it is measured only from the 
nearest point of intersection, If an element 
should have more than one corresponding 
element, then it is tested against all corre- 
sponding elements until one, but no more 
than one, identity of size or orientation 1s 
found, if there is any. Corresponding ele- 
ments are shown in Figure 3, 

Definition 7: Point of intersection. Any 
point where the vertical or horizontal axis 
intersects the figure is a point of intersec- 
tion. Examples are marked by arrows in 
Figure 3. g . 

Definition 8: The size of an element. This 
was defined aboye in the discussion of the 
sampling probabilities, y 

Definition 9a: The orientation of a line 
is the angle of intersection of the line with 
the vertical axis. 

Definition 9b: The orientation of an angle 
is the orientation of the line which bisects 
the angle. 

Definition 9c: The orientation of an are 
is the orientation of the area considered ej 
a plane in the third dimension. For 4 
two-dimensional figures, the orientation E 
areas are the same, In the case of pen 

mensional figures with more than o 
area, this causes a small problem. In Ee 
Cases it is assumed that the correspondent 

s : on 
of two-dimensional areas is evaluate 


E rA 


c 


a 
| 


F: ? å 3 ; 3 
ler. p Examples of partial symmetry. (Dotted lines are axes of partial symmetry; identical 
ters, for example, a and a’, designate corresponding lines; arrows denote points of intersection with 


axis.) 


s tonion of size and also shape. Adding 
Pe preis two characteristics as the 
aly a in e all other elements and allows 
veta rfectly symmetric figures to have 
um S values of 3. 
Rr con of partial symmetry with spe- 
ae EUN to corresponding elements 
vn in Figure 3. 

x dem 10: Degree of correspondence. 
aee PRUEQE S of element size, the de- 
Pn disi is the proportion of 
es. yee elements that are identical 
the dicun degree of correspondence on 
Proponas "à of element orientation is the 
cal in oria 9 corresponding elements identi- 

5 entation. 

i 2c: Degree of symmetry, S, for 

i hot je partial symmetry. In this case, 
Gal And id the same for both verti- 
einer horizontal symmetry; nor is S 
mente f y the same for each class of ele- 
angles’ a example, the redundancy of its 

E: net ee reason, S is evaluated 
order te class of elements separately in 
Slave a aa the trials to sample each 
ted iw appropriate redundancy esti- 
UM TM rs Š y the lines is symbolized 
equals the oe =a etc. e lines, Sı 

Sitea ie ME ence of the 

pde about the vertical axis plus 
about ee correspondence of the lines 

fined 2 seien axis plus one; S, is 

m ogously for the angles. 
plexity i da 3b. The two-dimensional com- 
a figure with partial symmetry 


3 
is © (T;/S). The slight difference be- 
i=1 
tween this definition and 3a is that S is 
subscripted, symbolizing the measure of 
redundancy specific of the T; for each 
class of elements. Because it is more 


" 
general, X (T;/5;) typically will be used 
t=1 


as the formula for complexity. 

Table 2 presents degree of correspond- 
ence and S; for the partially symmetric 
figures shown in Figure 4. A little study of 
it along with the examples discussed below 
will quickly familiarize the reader with this 
part of the model. 

Two examples of the computation of S 
for figures with partial symmetry will be 
given. Figure 3A has perfect vertical 
symmetry ; its partial horizontal symmetry 
is evaluated about the dotted horizontal 
axis. This axis has been set up so as to 
bisect the two nonhorizontal sides. These 
bisected or split elements are special cases 
with a special rationale for their treatment 
in computing S. Obviously the two parts 
of a split line always have the same orienta- 
tion and they are always corresponding 
elements. This artifact should not con- 
tribute to a measure of figure redundancy, 
and therefore the model does not include 
split elements in estimating S unless there 
is another line, or element, also split by the 
same axis that has the same orientation. 
This is not the case in Figure 3A where the 
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two P i have different orientations. 


If they had had the same orientation, then 
the lines would have been treated as 2 
redundant on the dimension of orientation. 
In addition, an axis can always be put 
through a line so as to bisect it and thus 
to automatically set up two parts that cor- 
respond and are identical in length. Again 
in evaluating split elements, the model 
ignores this artifactual matching unless the 
same axis also bisects another line or ele- 
In Figure 3A this is the case; thus 
for lines on the dimension of size, the 
figure is 2 redundant and its total horizontal 
line redundancy, Sin, is (.00 4- -67)/2 or 
-33, and therefore the T; to sample the 
lines of Figure 3A is divided by the S, of 
1.00 (vertical Symmetry) + .33 (horizontal 
symmetry) + 1.00, which equals 2.33 (see 
Table 2). 


TAB 
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In Figure 3B, both axes for evaluating 
partial Symmetry are shown. None of 
Figure 3B's eight lines are of the same size 
when matched with their corresponding 
elements about either axis, (Note: The 
Split elements do not count here since no 
axis bisects two lines.) However, all of 
the lines match their Corresponding line in 
orientation about both axes. (Note: The 
split elements count since both lines split 
by each axis have the same orientation.) 
Thus, both Siv and Siy = (.00 + 1.00)/2 
= .50; and Si = .50 + 50 + 1.00 = 2.00 
(see Table 2). 

When evaluated about the vertical axes 
on size, four of the angles of Figure 3B are 
identical in size, and four are identical in 


orientation. The same is true when evalu- 
ated abou t the horizontal axis. ‘Thus Sov 


and Soy both equal (.50 + .50)/2 and 


LE 2 


DEGREE or CORRESPONDENCE AND DEGREE or SYMMETRY S, For THE LINES AND ANGLES OF 
STIMULUS FIGURES wrrg PARTIAL SYMMETRY UsEp iN EXPERIMENT | 


— 
Lines Angles 
Stimulus » A _ B y c D 
number | Z Vertical corre. | e ore corre- S Pe vertical corre- | aiii corre: s 
spondence (partial d artia s ^ (partia jea di corres * 
Vim. agence (hari ia maet (Partial $a omes Partial “ponwence (partial C24 
- i 
Size Orien- Sion Orien- " E 
Size tation Sigi tation Size Quer | sta ur 
2 100 100 67 90 | 233 100 100 67 0 2.33 
(maps) | (maps) (maps) (maps) ` 
5 100 100 67 0 2.33 100 100 0 0 2.00 
(maps) | (maps) (maps) (maps) 
; 0 0 67 o | 133 0 0 0 0 -00 
10 100 100 80 0 2.40 100 100 80 0 2.40 
(maps) (maps) (maps) (maps 
12 0 0 67 0 | 133 0 i 0 0 | 1.00 
14 100 100 86 0 243 100 100 86 0 243 
(maps) | (maps) A $ (maps) (maps) 
16 0 50 50 50 1.75 0 0 0 0 1.00 
17 100 100 75 100 | 287 100 100 50 50 2.50 
(maps) (maps) 7 ) (maps) (maps) 
18 0 50 50 | 50 1.75 0 0 0 0 1.00 
19 100 100 100 | 100 3.00 0 33 100 100 2.16 
(maps) (maps) aps aps 
22 0 100 0 | 100 2.00 50 50 sapa) me 2.00 
24 0 0 0| 0 1.00 0 0 0 0 1.00 
25 80 100 50 80 2.55 20 60 0 60 1.70 
26 0 0 0| o 1.00 0 0 0 0 1.00 
27 0 0 0 | 0 1.00 0 0 0 0 1.00 
28 25 25 25 | 50 1.62 0 0 0 0 1.00 


Fic. 4. The stimulus figures used in Experiment I. (Figures are in 
order of increasing judged complexity.) 


S, = 
2). 50 + .50 + 1.00 = 2.00 (see Table 


Ir dá 

reader” uesihg these computations, the 

is the bes a recall that angle orientation 
c : à Pact 

angle, tentation of the line bisecting the 

In t 

th Mes case of two-dimensional figures 

Eras one area, the model does not 

The S In such cases Ts = 1.00. 

ncy Lee aie sd discussion of the redun- 
aluation for partially symmetric 


Wi 
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figures may seem complex and/or difficult. 
In fact, with a little practice it becomes 
quite routine. In addition, it is conceptu- 
ally a straightforward extension of the 
notion of perfect mapping of elements to 
figures where some of the elements map 
on only one of the relevant dimensions. 
The formal presentation of the model 
may have disguised some of its simple 
fundamental characteristics. First the 


aJ y T 


NT of a figure is determined by 
three basic properties (everything else 
equal): (a) the more elements in a figure, 
the greater its complexity; (b) the more 
the elements differ in size, the greater its 
'complexity; and (c) the greater the sym- 
metry, the less the total complexity. These 
three properties can often allow one to 
decide which of two figures the model will 
predict as more complex prior to actual 
computation of the 7 values. 

According to the model, the simplest 
closed geometric figure (see Table 4) is 
the equilateral triangle. This receives some 
support in the work of Hanes (1950), 
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this model is something of a mixture. In 
an important sense, the model does claim 
that perception is equal to the sum of its 
parts. Both the parts and the process of 
summing them are central to the model. 
On the other hand, the selection of simple, 
coded elements and their organization in a 
hierarchy has Strong similarities to the 
Gestalt emphasis and is quite different from 
the normal S-R or elementist position. 


SUPPORTING Data: Two- 
DIMENSIONAL FIGURES 


In specifying the two-dimensional com- 


Bitte: 


rman, Krauskopf, 


(1954), and Borg (1964). 
In terms of the traditional Gestalt and 
S-R interpretations of figure perception, 


TABLE 3 — 


JupcED Compiexiry (MEAN Rank) 
OF THE STIMULUS FIGURES USED 1 


and Hochberg 


AND BREAKDOW: 


plexity of 


of different figures. 
designed to test this pr 


any simple geometric figure, the 
model predicts judgments of the complexity 


Experimen 


t I was 


operty of the model. 


N OF THE COMPLEXITY EVALUATION 
N EXPERIMENT I 


— 
Stimulus Lines Angles 
pube = Rr Eobbdy 
Figure 4 $ TiS; ny T0) 
T Sı Ci Ta E C mee w | 
1 8.31 3.00 2.77 8.31 3.00 2.77 5.54 2.00 
2 548 2.33 2.35 5.48 2.33 2.35 4.70 2.85 
3 10.52 3.00 3.47 8.31 3.00 2.77 6.24 3.50 
4 15.32 3.00 5.11 8.31 3.00 2.77 7.88 4.30 
5 6.28 2.33 2.70 7.04 2.00 3.52 6.22 4.75 
6 8.31 2.00 4.16 9.89 2.00 4.95 911 8.35 
7 5.94 1.33 4.46 6.67 1.00 6.67 11.13 8.95 
8 11.98 2.00 5.99 12.50 2.00 6.25 1224 9.15 
9 14.75 3.00 4.92 14.75 3.00 4.92 9.84 11.40 
10 11.36 2.40 4.73 11.36 2.40 4.73 9.46 12.10 
1 22.39 3.00 7.46 14.75 3.00 4.92 12.38 12.50 
12 6.71 1.33 505 | 1041 1.00 10.44 1549 12.85 
13 21.90 3.00 7.30 21.90 3.00 7.30 14.60 13.15 
14 18.05 243 743 18.05 243 7.43 14.86 13.75 
15 30.98 3.00 10.33 21.90 3.00 7.30 17.63 14.50 
16 9.99 1.75 5.71 9.38 1.00 9.38 15.09 14.70 
17 25.60 2.87 8.92 29.04 2.50 11.62 20.54 15.60 
18 11.14 1.75 6.37 14.50 1.00 14.50 20.87 15.95 
19 19.41 3.00 6.47 21.49 2.16 9.95 16.42 17.75 
20 17.39 3.00 5.80 32.11 3.00 10.70 16.50 18.15 
21 21.90 3.00 7.30 | 6081 3.00 20.27 27.57 20.00 i 
22 43.54 2.00 21.77 29.47 2.00 14.74 36.51 20.35 
23 22.38 2.00 11.19 | 37.45 2.00 18.73 29.92 22.10 
24 17.33 1.00 17.33 17.96 1.00 17.96 35.29 22.90 
25 50.42 2.55 19.77 | 5281 170 | 3106 50.83 25.05 
26 25.72 1.00 25.72 35.19 1.00 35.19 60.91 25.35 
27 38.60 1.00 3860 | 3491 1.00 3491 75.51 26.50 
28 23.29 1.62 14.38 4243 1.00 42.43 56.81 27.39 
" "ae - zure has been omi ince it i iei The | 
sd he contribution due to the eee raanei i Na >een omitted since it is negligible and constant for each figure. Th | 


Experiment I > 
Method 


puo The subjects were 20 adults, 10 male 
E eme with a median age of 22 and an age 
E d 548; they were undergraduate and graduate 

udents, secretaries, and technicians. 
pest The 28 geometric figures used as stimuli 
an ds in Figure 4. These particular figures 
s selected on the general intuitive grounds that 
An Wiepvesented a wide variety of different features. 
E had the same area and were thus of 
APRI iid equal perceived size. Each figure 
b nS Drogstphied on glossy paper, cut out inside of 
EO s belts and covered with clear plastic to 
The feu ng photograph and to aid in handling. 
Orie ie ires were oriented as shown in Figure 4. 
entation was designated by a small dot at the 

ottom of the card. 

Pic te After looking through the deck of 28 
e disks, the subject was asked to'divide the 
SER into two piles: the 14 stimuli, he judged to 
ie iua and the 14 he judged to be more 
Hon aL Fhe subject was allowed to use any defini- 
M mue or complex he wished. The subject 
Rank D to rank the 14 simpler from simplest 
of these to most complex (Rank 14). The first 10 
ETE ranks were recorded, and the corresponding 
stimuli DECIES See aside. The remaining 4 
complex gre mixed in with the original 14 more 
Eu pam ums and from these 18 stimuli, the 
ius vas asked to temporarily put aside the 
ost complex stimuli. The remaining 14 
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stimuli were then ranked for simplest (Rank 11) 
to most complex (Rank 24) Ranks 11-20 were 
recorded and the corresponding stimuli permanently 
set aside. The remaining four stimuli plus the four 
set aside as most complex (Ranks 21-28) were then 
ranked. This modified ranking procedure, taking 
about 15 to 20 minutes, was not difficult, and by 
breaking down the task into smaller sets, aided the 
subject in his rankings. 


Results 


The major results are shown in Table 3 
and in Figure 5. Table 3 presents a sum- 
mary of the complexity of the lines and 
angles of each of the 28 figures shown in 
Figure 4. Table 3 also shows the total 
complexity of each figure based on a stop 
rule criterion of 75 and the obtained judged 
complexity (mean rank). Figure 5 is a 
graph of the relation between the total 
(predicted) and the judged (obtained) com- 
plexity values. To geta quantitative esti- 
mate of the relation shown in Figure 5, we 
computed a product-moment correlation 
between logio of the predicted complexity 
value and the judged complexity. This 
correlation was .97 and obviously represents 
considerable support for the model. 
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Fic. 5. Results of Experiment I. (Judged complexity [mean rank, N = 20] 
as a function of predicted figure complexity.) 
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Fic, 6. 


A possible correction in the model may 
be implied by a small discrepancy in the 
results of Experiment I. In Figure 4, 
No. 22, which consists of entirely vertical 
and horizontal lines, seems to have a pre- 
dicted value that is slightly too large (see 
Table 3). There is considerable evidence 
that humans perceive vertical and hori- 
zontal lines with greater efficiency than 


The stimulus figures used 
order of increasing 
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wy 


in Experiment T. 


E (Figures are in 
judged complexity.) 


lines at other orientations (Campbell, 
Kulikowski, & Levinson, 1966; Higgins & 
Stultz, 1948). The model assumes al 


paentations are sampled with equal proba- 
bility and this may need correction. 
Experiment IT 


A large number of the figures used 1! 
Experiment | were highly symmetric. I? 
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Experiment II, 18 primarily asymmetric 
gures were used to test again the model's 
Capacity to predict judgments of figure 
Complexity. 


Method 


Subjects, The 20 subjects, 10 males and 10 
males, ranged in age from 12 to 70; median age 25. 
Sy were selected to get a variety of ages and 
Professional backgrounds. 
ey The stimulus figures are shown in 
i i 6. Eight of these figures, No. 1, 3, 5, 7, 8, 
at be and 14, are No. 12, 18, 24, 22, 26, 25, 27, 
ing T) Raat in Experiment I. The remain- 
AS arc ures; No. 2, 4, 6, 9, 10, 13, 15, 16, 17, and 
a anderplas shapes—their Vanderplas num- 
are as follows: Figure 2, No. 9 and 25; Figure 3, 
1 and 28 igure 4, No. 5 and 11; Figure 5, 
s I 1 ; Figure 7, No. 2 and 12 (see Vander- 
fo 1959, pp. 149-151). The figures were 
Sut MEET with equal areas, photographed, cut 
s In a circle approximately 7 inches in diam- 
io and covered with clear plastic. The orienta- 


tion, + 
z g fhion in Figure 6, was designated by a dot on 
Plastic cover. 
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could use any definitions of simple or complex they 
wished. Next they ranked the nine simplest shapes 
in order from simplest to most complex. The first 
six figures (ranks) were recorded; the other three 
were returned to the set of the nine most complex 
figures. From this set of 12 figures, the subject 
removed the three he judged as most complex. The 
remaining nine figures were ranked from simplest 
to most complex. The first six of these figures 
(ranks) were recorded. The remaining three figures 
plus the three judged as most complex formed the 
final set of six which were then ranked. The 
procedure lasted about 15-20 minutes. 


Results 


The major results are shown in Table 4 
and Figure 7. Figure 7 shows that in 
general the model predicts the judged com- 
plexity values quite well. As in Experi- 
ment I, logioC values were used to correct 
for the ceiling effect in the judgments. The 
product-moment correlation between logioC 
and the judged complexity rank is .96. 


D. ! 
rocedure. A modified ci c imila 
E Sl ranking procedure similar 
given tpe ment. 1 was used. The subjects were Brown's Random Polygons 
he Set of 18 figures, in a scrambled order; so Mite cud ; Wo. 
mud lin told to divide them into two sets—the One difficulty with the previous experi- 
95t simple and the nine most complex. They ments is that the representativeness or 
TABLE 4 
-Á— T: FOR THE VANDERPLAS FIGURES USED IN EXPERIMENT II; TorAL FIGURE COMPLEXITY, 
-OG COMPLEXITY, AND JUDGED COMPLEXITY FOR ALL FIGURES USED IN EXPERIMENT II 
GIN 
Stim Total gure 
fron S, number Lines Angles a ien Judged complexity 
ij Ti Ta C = Z Ti/Si Logu C mean rank 
E (N = 20) 
S = 1,00 for all 
Vanderplas figures 
ju» a . 15.49 1.190 Ae 
9.25 11.30 20.55 1.313 2.70 
$ü8 a a 20.87 1.319 2.90 
9.29 21.18 30.47 1.484 5.00 
a a : 35.29 1.548 540 
16.57 17.54 34.11 1.533 6.50 
ams í a 36.51 1.562 6.85 
da b : 60.91 1.785 9.25 
10 27.79 26.15 53.94 1.732 9.40 
11 Qgy 18.96 28.13 47.09 1.673 9.85 
1207). y a 50.83 1.706 11.35 
ig ) s s 73.51 1.866 12.25 
14. (28), 49.90 61.51 111.41 2.047 12.65 
T NEA x š | 56.81 1.754 12.70 
16 26.89 52.51 79.40 1.900 13.35 
17 95.07 98.17 | 193.24 2.286 16.00 
18 27.55 74.37 | 101.92 2.008 16.20 
E 119.76 141.25 261.01 2.417 17.95 
"Sce Tay =l 


able 3 7 
for these numbers and complexity values. 
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1 Ag:4-65 NS 


Aņ2:2.60 


A4:3.75 


Fic. 8. Set A and B stimulus figures of Brown 
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: MEAN RANK 
. 


JUDGED COMPLEXITY 
^ 
re S A, 

. 

e 


FIGURE COMPLEXITY : LOG, Z ( Ti si) 


Fic. 7. Results of Experiment I]. (Judged complexity [mean rank, N = 20] 
as a function of predicted figure complexity.) 


Bg: 4.72 
A7:5.12 Bo:2.67 | B7:4.98 
K v » 
Ag:5.50 B3:3.18 Bg:5.55 
a A & 
Agi 648 B4: 3.70 Bg:6.15 
z y x 
A19:6-68 Bs:4.2 B49:6.68 


and Brumaghim with scaled 


complexity values. (Adapted from Brown & Brumaghim, 1968.) 
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T TABLE 5 plexity and scale complexity value is .96; 
5 ara Tora FicuRE CowrLExiTY, AND ScaLEp for Set B the correlation is .99. (Logio of 
SMELERXITY VALUE FOR BROWN'S SET AAND X.T/S was not used since the linear scale 
REE B RANDON POLYGONS did not have the ceiling problem of the 
Simitus d. d MS. — ra ranks found in Experiments I and 
— ii Angles | complexity value 1 
A 12.7 10.0 227 1.75 Discussion 
ix ^r 2 E iM Although these correlations are quite 
at 26.6 45.9 72.5 3.75 high, the discrepancies do point to a weak- 
A 66.4 73.3 139.7 4.25 ness in the model. In Set A for No. 3 and 
A, bue 69.8 116.5 4.65 4, 5, and 6, and Set B for No. 5and 6, the 
As d "p m as predictions are slightly off. In each of 
^d 934 106.5 199.9 618 these cases, the two figures have the same 
10 96.4 120.8 217.2 6.68 number of sides and the model seems to 
B, - a : . miss because it underestimates the com- 
B, : 23.2 35.1 1.75 plexity of the figure with the more jagged 
B, 9.9 35.0 44.9 2.67 or irregular outline. In retrospect we 
B 35.1 32.6 67.7 3.18 : 
a 294 618 91.2 370 observed what appears to be the same 
n, 65.2 125.0 125.0 4.28 problem with No. 19, 20, and 28 of Figure 4 
B, ut 74.3 123.7 4.72 in Experiment I and with No. 10, 15, and 
B, m. iq 159.8 498 17 of Figure 6 in Experiment II; that is, 
P 96.9 i nee e Ub these figures also have very jagged outlines 
ai 90.6 132.3 222.9 6.68 and have predicted complexity values that 
1 are somewhat too low. Perhaps elements 
Benerality f . x based on the differences between adjacent 
Y of the two stimulus sets is hard lines and angles could be identified and 


0 evalu rainy 
ate. This is not the case, however, sampled by the model. Something of this 


T rand 
Of pe generated shapes or polygons. kind would have to be added in order to 
is relevance is the work of Brown correct this problem. 
colleagues. Specifically, Brown The present model has several important 


Br ; 
o pe un (1968) have developed advantages in comparison with other ap- 
omplexity. random polygons scaled for proaches to the prediction of figure com- 
Y. These two sets, A and B, are plexity. It can be applied to any two- 


ch set of be scale values in Figure 8.5 dimensional figure whether symmetric, 
Of scal; 0 was selected from the results | partially symmetric, or asymmetric, and it 
iri atte using a large popula- predicts, without having run an earlier 
u lt generated polygons. The analysis of responses to a similar set of 
di re selected” 8, 12, 16, or 20 sides and figures. Furthermore, as a model of the 
Btant ed to be approximately equi- perceptual process, it is an integrated state- 
Plexi y 9n a unidimensional visual com- ment, however oversimplified, about how 
ould je tinum, The present model figure perception takes place, and therefore 
Complexin, pable of predicting these scaled — it makes predictions about other perceptual 
n p, LY values. Table 5 presents the processes. For example, as described be- 
the n e "y XT S, values predicted by Jow, the model can be applied to various 
Kile eT and each polygon's associated issues involving the perception of three- 
Etween ue. Figure 9 shows the relation dimensional figures. 
ale val the model's predictions and the 
Correlati ues. In the case of Set A, the THE MODEL AND SUPPORTING DATA : 
ion between predicted total com- THREE-DIMENSIONAL FIGURES 


TD R P 1 bere aris " : 
is origina Down graciously provided examples of The importance of the third dimension 
figures, for a model of figure or pattern perception 
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COMPLEXITY : SCALE VALUE 


6.0 


5.0 


SCALE VALUE 


40 


3.0 


2.0 


COMPLEXITY : 


50 


FIGURE COMPLEXITY A 


Fic. 9. Scaled complexity of Brown and 
figures as a function of predicted figure comple 


Brumaghim, 1968.) 


is obvious from the three-dimensional 
character of most of everyday experience 
and from the problem posed by two- 
dimensional drawings which are perceived 
or recognized as three-dimensional objects. 

Probably, the best known theoretical 
approach to the prediction of the three- 


100 


150 200 


Z(T/S) 


aghim's Set A and Set B 
(Adapted from Brown & 


dimensionality of a figure is that of Hoch 
berg and McAlister (1953), Hochberg The 
rooks (1960), and Attneave (1954). 


asic argument is. 


" bo 
the more information required to specify each S e 
native shape Which may be reported in respon that 
^ given stimulus-figure, the less likely it 9 


WU 


PERCEPTION OF SIMPLE GEOMETRIC FIGURES 


that sha 


al pe will be seen or reported relative to its 


ternates [Hochberg & Brooks, 1960, p. 339]. 


Bon the simplest figure out of a hypo- 
the a array of possible figures will be 
€ perceived. 
leg and Brooks (1960) do not 
NU a rationale for identifying this 
i etical array of perceivable alterna- 
ad gures. However, they do propose 
the riae much supporting evidence for 
owing more specific hypothesis: 


the n 
er Erie Parent tridimensionality of each mem- 
ions of ^ m y of reversible perspective representa- 
simple fore three-dimensional object will be a 
the d ction of the geometrical complexity of 
mensional figure [p. 340]. 


This ; 

am erpretation follows from the basic 

complexity of = as the two-dimensional 

assumed 4 gi a figure increases, it can be 

Sional So the alternative three-dimen- 

Complex E alg are relatively less 
» Examples of reversible perspec- 
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tive drawings of families of three-dimen- 
sional figures are shown in Figure 10. To 
test the hypothesis, Hochberg and Brooks 
developed three measures to predict per- 
ceived tridimensionality. The one measure 
of complexity which they found as the 
most suitable they denoted as Equation 3 
and it equals: [Z [Ts + Ti + 2 (T) } / N. 
In this equation Ts equals the total number 
of interior angles; Tı» equals the total 
number of different angles; T4 equals the 
total number of continuous line segments. 
N equals the number of the preceding tests 
which actually distinguish between the 
figures of the family being considered. 
(Note: each score of the above tests was 
also standardized on a 0-10 scale.) These 
properties of the equation and their partic- 
ular weightings were based on a factor 
analysis of figure properties related to 
judged three-dimensionality. This mea- 
sure has the advantage of easy computa- 
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TABLE 6 


Two-DIMENSIONAL COMPLEXITY Z(T/S), HOCHBERG AND BROOKS MEASURE or PERCEIVED 
TRIDIMENSIONALITY, AND JUDGED TRIDIMENSIONALITY FOR THE STIMULUS FIGURES 
SHOWN 1N FIGURE 10 


[09] Q Judge im a) Q) (3) 
" Equation No. 3 s ric ir ad i 
Famil: E C lexit; d ial F: l. " ity T 
EL JM — ML OE mm 
pd 
A F 
1 0.0 38.0 0.0 1 14 
2 47 49.9 5.3 2 7.5 vd e 
3 9.0 70.2 7.5 3 4.3 2817 3.9 
4 10.0 101.1 10.0 4 2.5 97, 
0. 01 2 0 6.9 
na =.98 | rn = 92 5 2.5 1812 10.0 
" E Ti; — — 3 to, = — .31 
1 0.0 28.1 0.0 1 0.0 17.9 0.0 
2 48 74.2 3.7 2 10.0 49.3 13 
3 44 108.3 10.0 3 10.0 51.8 8.9 
4 10.0 168.3 10.0 4 í 
na = .79 Toy = .91 B TA Ed 10.0 
- n ni-.97 | rn -.98 
1 0.0 17.3 0.0 1 3 d 25.8 0.0 
2 4.7 29.3 3.9 2 31 50. 43 
3 6.0 23.9 8.0 3 i $4 6.6 
4 10.0 60.4 10.0 4 Be i 84 
nua = 98 ra = .76 5 s 137.7 10.0 
^ ' fz.93 | NE os 
1 0.0 25.1 0.0 
i: i 1 
2 5.0 72.0 7.6 2 ys ins Ht 
7.5 88.5 3 i i 2. 
4 8.2 87.4 100 i E ae T 
ris = .98 To = .99 : 5 1 S 10.0 
i na = .79 gis eq 
1 0.0 274 0.0 
2 1.5 31.0 34 
3 32 48.3 6.9 
4 5.0 121.3 10.0 
rig = .99 Tn = .87 


tion; however, it is not a generally appli- 
cable measure of two-dimensional com- 
plexity since it requires a figure to be a 
member of a family; nor is it developed 
with any special connection to an interpre- 
tation or theory of the process of perceptual 
organization. 

The present model's measure of two- 
dimensional complexity can be used to 
predict perceived three-dimensionality. We 
will use the data from Hochberg and 
Brooks for this purpose and to explore the 
general problem of predicting perceived 


three-dimensionality, The stimuli in Figur. 
8 were shown by Hochberg and Br ook 
college students who rated each mem f 
of the family of related figures on its deg? 
of threc-dimensionality, 


seemed most strongly “solid” or “three-dimension’ 
at 10 on the scale, the “least solid” or “most ure? 
at 0, and then indicate where the other f8 an? 
belonged between these two extremes. The 
Were standardized to a 0-10 scale within each ra 
[Hochberg & Brooks, 1960, p. 3417. 


. "E 
The obtained three-dimensionality V? 


PERCEPTION OF SIMPLE GEOMETRIC FIGURES 


shown here in Table 6 were taken from 
Table 1, Test 1 of Hochberg and Brooks. 
E results in Table 6 show that both 
e 2 exity measures generally do well at 
edicting perceived three-dimensionality. 
poa ertheless, there are obvious failures, 
al Family F and I, and these 
th gest extending the model to incorporate 
€ third dimension. 
efore discussing the addition of the 
Bio. dimension to the model, a few words 
a re stimuli in Figure 10. Family A 
ae s the well-known Necker or Kop- 
ice cubes. These and most of the 
that is T figures are reversible figures, 
Seco. have two structurally identical but 
*ptually different locations in space. 
sid. the figures within a family are 
debe in terms of increasing judged tri- 
des ? hus (Figure 10 was taken from 
qu of Hochberg and Brooks. There 
drawn T figures. which were incorrectly 
Ee ch n the original plate, these have 
stimuli y to coincide with the actual 
should h A model of figure perception 
bility. iy able to easily predict reversi- 
Perception addition, the various alternative 
Special or of a pattern should have some 
example nee within. the model. For 
Can be’ 0. 1 of Family A (Figure 10) 
almost ei as a three-dimensional but 
Ween : igure with the alternating sets 
then E riangles appearing above and 
inter N It is possible to 
t as the top view of a six-sided 
pyramidal solid. No. 2 of Family C also 
least two interpretations in three- 
Onality, It can be seen as a side 
a figure consistent with the other 
S of the family—or the center 
d shape can be seen as a front 
and the two triangles as part of a 
ack plane also of a trapezoidal 


thir 


ar 


Mem ber, 


"'apezoj 
lane 


ADDING THE THIRD DIMENSION: 

i Tue Easy Way 
rhe: Ay á ` 
finitign 8UMents behind the previous de- 


of elements and their properties is 


i indebted to Julian Hochberg and 
Stim rooks for providing copies of their original 
"S figures, 
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easily extended to define volumes as non- 
overlapping and completely enclosed spaces 
formed by the intersection of planes, which 
are sampled in proportion to their size. 
With the extra dimension, there are three 
possible axes of symmetry and S takes on 
a value of 4.00 for most of the stimuli in 
Figure 10. 

Furthermore, the orientation of lines in 
three dimensions, necessary for the specifi- 
cation of correspondence in evaluating 
sampling redundancy, is specified by the 
orientation of the line relative to the plane 
defined by the horizontal and vertical axes 
and by the orientation of the line relative 
to the plane defined by the vertical axis 
and the axis of the third dimension. Thus, 
to specify orientation of a line in three 
dimensions, two angles are necessary. One 
can be thought of as the orientation of the 
projection of the line onto the x-y plane, 
defined in the two-dimensional sense, and 
the other as the orientation relative to the 
y-z plane. Correspondence in three dimen- 
sions requires identity on both the x-y and 
y-s orientation. 

Given the figures' element properties, the 
model can be used easily to evaluate a 
three-dimensional complexity. How to get 
these properties is a major problem, how- 
ever. For the present, we will use the weak 
approach of using the information from 
all figures in a family, and we will assume 
the figures are all of the same three-dimen- 
sional object. For example, No. 1 of 
Family I is interpreted as two rectangular 
planes approximately 2 X 1 in size and 
meeting at a 90? angle. Ambiguity about 
angle sizes is resolved by the additional 
simplifying assumption that as many angles 
as possible are assumed to be equal, for 
example, for Family I, angles are only 90°. 
The original hypothesis can be used to 
argue that the difference between a figure's 
two- and three-dimensional complexity 
should be a better predictor of perceived 
dimensionality than two-dimensional com- 
plexity by itself. It is the relative sim- 
plicity of the two-dimensional and three- 
dimensional representation that counts. 
For most of the stimulus families in Figure 
10, this interpretation will not change 
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matters, since each figure in the Family 
has the same three-dimensional complexity, 
and subtracting a constant will have no 
effect. 'This is not, however, the case for 
Family I, since No. 1 in Family I is rever- 
sible unlike the rest of its family; also in 
Family F, only No. 1, 2, and 3 are reversible. 

An important problem is to estimate how 
much more complex a reversible three- 
dimensional figure is compared to the 
"ordinary" nonreversible variety. The 
model is based on the assumption that a 
reversible — three-dimensional figure is 
weighted, that is, multiplied, by the number 
of three-dimensional angles in the figure 
which have two three-dimensional inter- 
pretations. By a three-dimensional angle 
is meant the angles. created at the inter- 
section of two or more planes. For example, 
a Necker cube has eight corners, resulting 
from the intersection of three planes, and 
each corner has three three-dimensional 
angles having two three-dimensional inter- 
pretations; hence its reversibility weighting 
factor is 8 X 3 or 24. The reversibility 
weighting factors for Families F and I are 
shown in Table 7, column 7. For No. 1 of 
Family I (Figure 10), the intersection of 


two planes has two corners with three more, many figures which could have equa 
TABLE 7 
THE Two-DIMENSIONAL Q-D) ComPLEX 


COMPLEXITY FOR TH 


- dimensional drawing. 


try MINUS WEIGHTED Turee-Dimensiona (3-D) 


PauL C. Vrrz anp Tuomas C. Topp 


angles at each; and the number of rever- 
sible three-dimensional angles (reversi- 
bility weighting factor) is 12, for No. 3 of 
Family I, the reversibility weighting factor 
is 18, etc. | 

Using this rationale for reversible figures, 
the model's new predictions of perceived | 
three-dimensionality, shown in Table 7, 
are appreciably better, The correlation 
between predicted and judged three-dimen- 
sionality for Family F changes from —.31 
to .53; for Family I, from .62 to .95. 


ADDING THE THIRD DIMENSION : THE 
Harp (AND UNSOLVED) Way 


The central issue is how to represent the | 
three-dimensional figure given the two- 
In the preceding 
treatment we used information from the 
family of related figures but, of course, 
much of the time, this information is not 
Present. Also, to assume all angles are 
equal if they can be equal is a questionable 
assumption. How does one know in arigor- 
ous fashion if it is possible for the angles of ^ 
figure to be equal and consistent with the 
two-dimensional Further- 


information? 


iS OF | 


ee 3-D elements oes] 
plexity —— ne] Oy 3-D | weighted | Judged | mgr 
Family, PTUS veigh v - | &-D (from ev 
Ed eoe | m Ial o 9 “factor | XRFWa a | Tae | 9 
a) Q) (3) (6) (7) (8) (9) (10) 
F 7 84 |400| 312 
1 174.9 Ep 9 za 18 561.6 | — 386.7 0.0 
2| 2730 p | B c 30 116350 | —13620 3.5 ER: 
4| 95 84|400| 312 * |95|-mes| 35 "9-10 
4 P 2 4 pis 31.2 65.8 6.9 
5 | 1812 15.0 | 4.00 | 54.5 1 54.5 126.7 | 10.0 
I 
3.0 | 4.00 | 135 6 
2| do» mlam] d | 1] Ba ot 2] q8 
i| gus Ei RS 1 13.5 40.0) 28 |r =i 
4 | 808 cad ae 1 23.5 66.3| 10.0 
5 887 | 5.6 | 400] 235 ME: 23.5 65.2| 10.0 
| L. 


^ RWF = Reversibility weighting factor. 


PERCEPTION OF SIMPLE GEOMETRIC FIGURES 


angles obviously do not look that way. For 
example, slightly “squashed” Necker cubes 
are quite possible. The reader may have 
already noticed the slightly distorted prop- 
erties of the Necker cubes, and of other 
auli in Figure 10. In short, there are an 
In nite number of rational or logically 
ols three-dimensional interpretations 
a given two-dimensional drawing. The 
poem is to find those underlying assump- 
E ü the perceptual process which re- 
sh oeque interpretations to one or 
Bose y d at most. $ In an intuitive sense, 
Which three-dimensional interpretations 
imple "IRE seem to be especially 
NS. d he idea of reducing the possible 
that YG figures to the simplest requires 
a dae ADU be thoroughly described 
E ut As a first approximation, the 
with S figure appears to be that figure 
the T fewest number of angles and with 
«nore. variability in angle size. An 
Such a EE ,move toward conceptualizing 
Publisher mn principle has recently been 
a ey Attneave and Frost (1969). 
sional +e clue to solving the three-dimen- 
fh the Presentation problem is contained 
which Pies work of Hochberg (1968) 
zt ry clearly identifies the importance 
HR m created by line intersection as 
md undamental units in the perceptual 
Tuction of the three-dimensional figure. 
Eo Anus the model's two dimensional 
Predict; ity values do moderately well at 
te rated three-dimensionality ; the 
ue between a figure s two- and 
ln mensional complexity ^ predicts 
Suffers nat , better. However, model 
Behera) seriously from the absence of a 
ON eed for interpreting the three- 
ibe ts, ‘onal representation of a figure from 
9-dimensional drawings. 
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THE DYNAMICS OF STRUCTURED PERSONALITY TESTS: 1971+ 


DOUGLAS N. JACKSON? 


University of Western Ontario 


Issues raised by Paul Meehl's classic manifesto on the dynamics of structured 
personality inventories are reviewed in the light of a quarter of a century of 
experience. A series of principles relating to personality scale development 
is proposed, which highlights the role of psychological theory and item content 
in formulating a multifaceted item pool, and the importance of suppressing 
response biases. Because of the competing and often interacting relation of 
trait and method variance, multivariate procedures for scale construction 


and evaluation of discriminant validity are considered essential. 


It is con- 


cluded that considerably more is now known about personality and scale 
construction than 25 years ago. Personality assessment specialists need not, 
and indeed as scientists should not, abandon their unique human capacity to 


judge and evaluate item content. 


Er im e of a century since Paul 
manifest 45) published his now classic 
Bene n on the dynamics of structured 
ea ity inventories. Meehl's article 
Derson s a very important new era in 
a bit ui assessment, with perhaps just 
recae justifiable confidence and even 
i e ed Those doing research in 

tremen d o personality assessment owe a 
and the ous debt of gratitude to Paul Meehl 
traditic, many people who followed in this 
generated Nur ai enthusiasm was 
Minoris or the Minnesota Multiphasic 

ing im X Inventory (MMPI), stimulat- 
and rese og in personality assessment 
many ei 1. Equally important were the 
raised esearch questions this procedure 
minants eed relevant to the deter- 
actuarial responses to personality items, 
egree t versus clinical prediction, the 

ls in i which the structure of personality 
Mie ed in responses to personality 
may ES and the sources of bias that 
and 4 = into both the individual scores 
ey 1e interpretation of scores are just a 
of the many fertile areas of research 


Y p, H 
n or t 
inyi lions of this paper were presented as an 
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stemming at least in part from Meehl's 
original article. 

The purpose of this paper is to survey 
certain theoretical developments in per- 
sonality assessment since the publication 
of Meehl's article. Controversial issues 
will be raised regarding much of what 
Meehl and some of his followers have 
proposed. But it is hoped that out of 
discussion of these issues there may ensue 
a richer and deeper understanding, partic- 
ularly if the search for knowledge is 
conducted in the spirit that Meehl so 
strongly advocated, that is, a willingness to 
be guided by empirical data and to correct 
one's conceptions and misconceptions on 
the basis of empirical findings. 

In the past quarter of a century, there 
has been reason to be justly proud of some 
very real achievements in personality 
measurement. Some of these con tributions, 
like the work of Meehl and Hathaway 
(1946) represent important milestones. In 
a manner similar to Sir John Suckling's 
"Constant Lover," personality assessment 
has been like an adolescent seeking roman- 
ations with a variety of novel 
developments. But many contributions, 
like early loves, have à lasting impact. 
Notable among these contributions turned 
milestones have been the introduction into 
assessment of questions of substance by 
Cronbach and Meehl (1955) and by 


3 [t is noteworthy that Meehl's recognition of the 
nce of theory to personality assessment as 
d in the important construct validity article 


tic infatu 


importa 
containe 
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Loevinger (1957); the distinction between 
convergent and discriminant validity and 
method and trait variance, explicated by 
Campbell and Fiske (1959); and the identi- 
fication by Cronbach (1946, 1950) and 
others of response sets and styles followed 
by a vigorous and controversial research 
literature regarding the identification, role, 
and correlates of response style. Another 
milestone is not indicated by a single article 
or single contribution, but by the cumula- 
tive weight of a great many investigations. 
Reference is made here to the impact of 
test and measurement theory, of factor 
analysis, and of the whole class of multi- 
variate techniques upon present thinking 
in personality. The field of psychometrics 
has been very much a growing, cumulative 
field of science. It has provided powerful 
techniques, which, when coupled with the 
modern computer, afford a degree of power 
and precision that simply did not exist in 
1945. The opportunities for applying these 
techniques for further developments in the 
psychometric or in formation-processing tra- 
dition should not be overlooked. In fact, 
they should be capitalized on whenever 
possible. 
Despite their already acknowledged con- 
tributions, none of these milestones has 
provided any sort of touchstone to the 
measurement of personality, primarily be- 
cause each has not been pushed to its 
logical extension and because investigators 
have been too little aware of the full impli- 
cations and the full integration of these 
various approaches. Stated positively, 
much more knowledge about the nature of 
personality assessment is available than 
the fruits of some fragmented develop- 
ments might imply, and an integration of 
developments will reveal possibilities for 
some: {truly significant forward thrusts. 
This paper focuses on a set of principles 
derived in part from these milestones upon 
which personality assessment theory and 
method may proceed. 


has substantially influenced the present formulation. 
Controversial issues raised in the present paper 
should therefore be construed as referring to certain 
interpretations of the 1945 Meehl article and not to 
Meehl's present thinking on these issues. 


Aa | 


Doucras N. Jackson 


THE Case ron EMPIRICAL [TEM SELECTION 


Since Paul Meehl's manifesto regarding 
the construction of personality tests is 
taken as a point of departure, it might be 
appropriate to review Meehl’s position. 
Meehl defined structured personality tests 
as those in which the test material consisted 
of conventional culturally crystalized ques- 
tons to which the subject must respond in 
one of a few fixed ways. Meehl also sug- 
gested that there were essentially two 
approaches to assessment, one which he 
called the traditional approach and one 
which he characterized at that time as a 
modern approach because it provided, in 
his words, more sophisticated views of the 
assessment situation. The first approach 
assumed that a person's responses to a 
Structured personality test involved a 
kind of sample of behavior, a “second best 
source of information when direct observa- 
tion was unavailable." "The essence of this 
approach was that the investigator on 
intuitive grounds defined the content and 
sought to identify verbal behavior in a way 
Which would permit him to make some 
kind of inference about the traits, Mech! 
rejected this approach as inopportune for à 
number of reasons, It is possible to agree 
heartily with Meehl's assessment of the 
quality of many attempts at personality 
scale. construction prior to 1945, while 
disagreeing with his attribution of the 
cause. [n this paper an attempt is made 
to show that a sampling conception of an 
attribute, when properly applied, is the 
preferred approach in the great majority 
of instances, l 

The second approach, according 
Meehl, was that a person's response to @ 
questionnaire item was in itself a basically 
Interesting datum of behavior whose Cot 
relates must be uncovered empirically. 
Was not for the investigator to say what 
this behavior meant, but rather this Wa? 
something that should and must be deduce 
from the empirical correlates of the re- 
Sponses to the items. He used as his 
example the experience with the Strong 
Vocational Interest Blank (SVIB) and a 
MMPI. Both of these devices, at th 


to 


time, were relatively new.! Both showed 
great promise. They were themselves 
milestones in that thev were among the 
first multiscore structured personality tests. 
Both used the method of empirical item 
Selection in their construction, and both 
Purported to represent comprehensive, if 
not universal, characterizations of the 
domains which they sought to approach; 
namely, vocational interests and psycho- 
Pathology. Both were developed in an 
intellectual climate of the radical empiricist 
reaction against earlier philosophical tra- 
ditions in psychology. Although this reac- 
tion was, I believe, both necessary and 
healthy, it did involve an unverbalized 
belief that rigorous science is equated with 
the rejection of theory. 

Perhaps as a consequence of such beliefs, 
It was almost 30 years since the develop- 
Ment of these two devices before substantial 
inet developed in systematically study- 

8 item content as it might be linked to 
Scale Scores, correlates, or criterion groups. 
4 hile there were some important excep- 
ieee many adherents it was almost 
dem ered unscientific or mystical to in- 
(CEU ia categories of content. Certainly, 
rece s who followed in this tradition 
article if support from Meehl's (1945) 
iss (C not from his more recent contribu- 
mb. ronbach & Meehl, 1955). Meehl 
Bud that the use of empirical item 
PUR 5s did not require every subject to 

me i le item content in the same way. 
well „= data Sein for example, might 
. nterpret the word frequently in the 


1t 

at “I have headaches frequently,” as 
ag that is more regular than the 
Mea Pochondriac. Thus, according to 


Du the nonhypochondriac might have 
á jets just as frequently, but the criti- 
bur ue was that the hypochondriac inter- 
Der; the item differently. This illustrates 
"lst oe a most general point in the empiri- 
empio ition. That is, the scoring of an 

ically derived inventory such as the 


4 
195 Cause the first SVIB manual was published in 
it ag it may not be entirely accurate to characterize 
“arly relatively new” in 1945. However, in the 
Was years, recognition for Strong's achievement 
by Slow in develo ing, but had taken firm hold 
the 1940s. ping, 
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MMPI does not require accurate self- 
rating to be useful. In Meehl's own words, 


What a man says about himself may be a highly 
significant factor, even though we do not entertain 
with any confidence the hypothesis that what he 
says would agree with what complete knowledge of 
him would lead others to say of him [p. 299]. 


A corollary of this approach is that items 
completely lacking in any rationale, or items 
which might even appear to be opposite 
from what might be expected from theory, 
are permitted with this technique. Thus, 
Meehl suggested that there is no good ra- 
tionale why an individual who might be 
characterized clinically as a psychopathic 
personality would respond true to the item, 
"My parents and family find more fault 
with me than they should”; surely as a fact 
such a characteristic would be difficult to 
validate. But because this item has been 
shown to discriminate empirically between 
such a criterion group and normals, it is 
considered useful in diagnosis. A number 
of other even more psychologically remote 
examples of items keyed on MMPI scales, 
apparently bearing no substantive relation- 
ship to the dimension of psychopathology 
putatively represented by the scale, were 
introduced. These items were considered 
to have had positive value, in that they 
provided a foundation for overcoming 
faking and distortion by providing content 
which even an expert in psychopathology 
had no good rationale for linking to the 
nature of the disturbance. 

Since the approach advocated here de- 
parts significantly from the empirical tradi- 
tion, there will be occasion to return to 
many of these points, as well as to many of 
the contributions alluded to by other re- 
search workers during this era. An attempt 
is made to demonstrate that what might 
appear to be sources of controversy and 
potential bases for argument may actually 
be integrated into a formulation of the 
dvnamics of structured personality inven- 
tories: This formulation, while incomplete 
and partial, may serve to offer a compre- 
hensive basis for moving forward with the 
valid, substantively meaningful assessment 


of personality. 
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IMPORTANCE OF THEORY IN 
SCALE CoNsTRUCTION 


The first general principle is that per- 

sonality measures will have broad import and 
substantial construct validity to the extent, 
and only to the extent, that they are derived 
from an explicitly formulated, theoretically 
based definition of a trait. This principle is 
based on the broad assumption that every 
act and every criterion performance will 
usually reflect a wide range of underlying 
traits. But it will not reflect each trait 
with equal power or precision. Thus, be- 
cause every behavior, verbal or otherwise, 
is a manifestation of some underlying set of 
characteristics, these characteristics may be 
assumed to be measurable by a sampling of 
behavior. But it will not do to sample just 
any behavior. Assessing that set of be- 
haviors most relevant to the trait in ques- 
tion is the most efficient way of measuring 
the trait. Ordinarily, issues, questions, or 
situations directly bearing on the character- 
istics of interest will show the highest 
probability of reflecting them. Thus, for 
example, if we wish to measure a trait 
such as dominance, items bearing directly 
on characteristic modes of leading or direct- 
ing other people and derivatives therefrom 
would be more likely to be reliable and 
valid indicators of such a trait than would 
more remote, substantively oblique mani- 
festations of dominance. Certainly, one 
of the least efficient ways of proceeding to 
measure such a trait would be to compile 
randomly a heterogeneous, unselected group 
of items and then to derive empirically a 
handful of items differentiating criterion 
groups. 

Of course, a substantive approach places 
a certain responsibility on the scale de- 
veloper. His responsibility is to have some 
understanding of what it is he Wishes to 
measure. He must not be content with the 
mere prediction of a criterion without 
knowledge of the psychological makeup of 
the characteristics involved. But it is 
doubtful whether very many psychologists 
are solely concerned with the prediction of 
a criterion. Probably few of us use psy- 


chological tests in the classical manner of 
the applied psychologist, who might have 
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been concerned with 
predicting which salesman will sell the 
most insurance. Few are even interested 
in predicting how a Psychiatrist will diag- 
nose a patient. Rather, most of us are 
primarily interested in imputing latent 
characteristics to respondents on the basis 
of their test scores, Even the MMPI and 
the SVIB, both originally empirically 
derived, are used typically to form 
some characterizations of underlying traits. 
Indeed, the recent revision of the 
SVIB (Campbell, 1969), in a radical de- 
parture from previous practice, now re- 
commends that a new set of rationally 


such questions as 


“constructed scales form the major basis for 


interpretation. 
Cronbach and Meehl (1955) have sug- 
gested scales 


that. empirically derived 

might serye to enrich understanding by a 
bootstrapping technique, much as in the 
manner of Alfred Binet, who, when he 
started, purportedly knew little more about 
intelligence than Was contained in teachers’ 
criterion ratings of bright and dull pupils. 
But such a procedure is justified only under 
circumstances of complete or almost totally 
complete ignorance. Ordinarily, psycholo- 
gical science will be served better if an 
investigator is willing to take a stand on 
the nature of a Construct, to define it, 
to measure it, and to test its implications 
in a manner that allows him to confirm 
or disconfirm his hypotheses, There was a 
point when we Were psychologically naive 
about the areas of psychopathology and of 
vocational interest. It is perhaps in one 
sense fortunate that Edward K. Strong an 

Hathaway and McKinley proceeded in the 
way they did at a time when the issue © 
whether or not vocational interests an 

Psychopathology could be measured at al 
was a matter of some dispute. But a 
Ignorance about personality to which Meeh 
alluded a quarter of a century ago hardly 
seems a suitable defense at the present time: 
Research studies in psychology account for 
one of the largest sources of publishe 

Scientific literature, and a substantial pot 
tion of this research concerns areas relate! 

to personality, There is even considerab - 
evidence that a nonpsychologically traine 
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individual can make quite refined distinc- 
tions regarding personality, if the questions 
are properly put to him. 

l'o support the assertion that there is an 
overwhelming substantive basis for per- 
Sonality item responding, data from two 
general sources will be drawn upon, the 
first an analysis of content dimensions in 
the Personality Research Form (PRF) and 
in the SVIB, and the second derived from 


Tesults of studies of judgmental processes 


p perception of personality and per- 
y scale content. 
the TRE the course of the construction of 
Ecce (Jackson, 1967c), one prerequisite 
vien mf D item pools was a very careful 
Eu ES the definitions of variables of 
a td given by Murray (1938), which 
and s x y cases also involved an elaboration 
These e en of these definitions. 
E- efinitions were then translated into 
e E sample of items, designed to serve 
lig prehensive representation of the 
insofar ristic, removing sources of. bias 
Seed was possible, and writing items 
ib € tical regard to the conceptual over- 
we p ES scales. On several occasions, 
cal hee to both formal psychologi- 
e Bored. and to more informal substantive 
the bs of the trait. This is precisely 
empirici i. rational process about which 
but 'sts have been so skeptical, and 
Which Meehl cautioned a generation 
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of test constructors. We did not, of course, 
stop at merely a substantive appraisal of 
the dimension, but sought to evaluate this 
procedure in a number of ways. The first 
important evaluation stemmed directly 
from the item-analysis procedures. Listed 
on Table 1 are 7 of the 20 substantive PRF 
scales, together with the initial number of 
items written for each scale. These items 
had already been subjected to thorough 
editorial review and revision, so that a 
good many items which were initially writ- 
ten for the scale were not even included in 
the item analysis for failing to pass the 
first substantive hurdle. These items were 
administered to 309 male and female uni- 
versity students in two sessions separated 
by one week, a procedure necessitated by 
a concern for the endurance it would have 
required to answer almost a thousand per- 
sonality items in one sitting. 

First, an examination of the Kuder- 
Richardson Formula 20 reliabilities on 
Table 1 reveals that these are as high as 
those for some of the better achievement 
tests of comparable length. 1f any one 
should suggest that personality assessment 
has not yet achieved the level of fidelity 
attained in achievement testing, it might 
be appropriate to point out that comparable 
levels of reliability can be attained, and 
that, in addition, more is usually demanded 
of a personality test. Few achievement 
test specialists, for example, demand that 
their procedures demonstrate convergent 
and discriminant validity. The level of 
reliability attained encouraged further eva- 
luation of item properties. Biserial cor- 
relations were computed between each item 
and its own total scale, together with cor- 
relations with each irrelevant scale and a 
desirability scale. For 
t 40 items were chosen 
a which seeks to 


specially devised 
each scale, the bes 
on the basis of a formul 
maximize the content saturation of the 
item in relation to its saturation with a 
desirability scale, the Differential Relia- 
bility Index, developed in collaboration 
with Samuel Messick (cf. Neill & Jackson, 
1970). Listed in Table 1 are the mean 
biserial correlations (7;,) of the best 40 
items for each scale with its own total scale. 
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It is of interest to inquire to what extent 
irrelevant items, that is, items written for 
a different scale, correlated more highly 
with a particular scale than did the mean 
of the 40 items finally chosen for inclusion 
in Forms A and B of the PRF. Hypotheses 
about traits implicit in each item can thus 
be evaluated, providing an index of the 
degree of confusion between personality 
constructs. The number of times that this 
did not occur in relation to the total number 
of opportunities for the intrusion of an 
irrelevant item can be expressed as a pro- 
portion as indicated in the final column in 
Table 1. For the Dominance scale, there 
were no occasions when 1 of the 819 irrele- 
vant items exceeded the mean content 
saturation of the best 40 Dominance items. 
This perfect batting average occurred for 
the Understanding and Impulsivity scales 
as well. For the other scales, the percentage 
of hits was almost as high; in no case was 
it lower than 99.8%. For the entire pool 
of items, there were approximately 5,700 
opportunities for irrelevant intrusions of 
the type described, but these occurred 
only a total of five times, and only as often 
às twice in any one scale. This is strong 
evidence, I believe, for the proposition that 
à group of item writers can have more than 
trivial knowledge about the nature of 
personality constructs, and can proceed 
from a position other than from one of 
sheer ignorance. It should be emphasized 
in this comparison that these items were 
not simple restatements of the same theme, 
but covered the gamut of situations and 
modes of response appropriate for sampling 
broadly a personality construct. 

It would not be accurate, however, to 
imply that we were never wrong in our 
miniature hypotheses contained in each 
item about a particular personality con- 

struct. On a number of occasions, items 
did not show the high levels of content 
saturation that were demanded, and there- 
fore sometimes manifested an inappropriate 
pattern of correlations with irrelevant 
Scales. On other occasions, an item cor- 
related too highly with the desirability 
scale, or showed disproportionate endorse- 
ment frequencies. But in the aggregate, 
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hypotheses were much more frequently 

confirmed than disconfirmed. The sub- 

jecting of a hypothesis to empirical test was 

a chastening as well as a rewarding experi- 

ence, in that a great deal was learned about 

how complexities and ambiguities of word- 

ing could extract a toll. But it is infinitely 

more efficient to proceed by postulating 

explicitly the nature of that which was to 

be measured than by starting with an 

unselected item pool garnered from any 

convenient source. Furthermore, a sub- 

stantive basis for item selection tends to 
lessen the chances of initially selected items 

failing when administered to a new sample 
of respondents. It is my hunch that the 
great majority of subtle items uncovered 
with the MMPI (Wiener, 1948), and con- 
sidered a positive virtue by the proponents 
of empiricism, are present in MMPI scales 
due to errors in sampling items and sub- 
jects in the initial item-selection procedures 
of the MMPI. Most subtle items have 
been shown to correlate negatively with 
the rest of the items contained in a parti- 
cular MMPI scale, raising the suspicion 
that they did not belong there in the first 
place. 
The above assertions imply that a sample 

of item content most directly relevant to à 
particular trait will be the most efficient 
way of going about measuring it. Ordi- 
narily, this assertion has a great deal of 
merit, but it must be amended in certain 
particulars. |f an investigator shoul 
Want to measure sadistic impulses, for 
example, it would be quite improbable that 
he would observe a person torturing an 
mals, nor would any but a very smal 
minority of people endorse item content 
regarding the satisfaction they derive from 
torturing animals. Of course, if such ber 
havior were to be observed, an inference 
about sadistic impulses would be quite 
defensible. But ordinarily it would be nec 
essary to incorporate su btlety and relatively 
more remote characterizations of sue” 
impulses. For example, items inquiring 
into a preference for the occupation ° 


; f t 
butcher or into the person's attitudes abou 


punishment of criminals might 
appropriate, 


the 


"- 
For most traits of personali? 
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and psychopathology, it is possible to 
identify behavior capable of being incor- 
porated into item content in such a way as 
to avoid arousing such a degree of defen- 
Siveness as to render valid assessment 
unattainable. 
ae SVIB is considered by many as one 
E most successful psychological tests 
wm the finest example of the use of 
UM T Boni selection with criterion 
da ed prune because of the rationale 
Campbell is the original test author (cf. 
ance to loc ape there was a general reluct- 
Years demie at item content, and for many 
conid item content in the SVIB was 
enil M je d fit matter for investigation. 
lis SE pus owever, David Campbell and 
Sota co at the University of Minne- 
son, & pagos Borgen, Eastes, Johans- 
Struct pu 1968) proceeded to con- 
Borrela ir a combination of rational and 
zd ive € means a set of content keys 
art, lini tB. covering such areas as music, 
others TEIG, and some two dozen 
torg im t was possible for these investiga- 
bank rcd the very extensive data 
Occupati d responses of a diversity ol 
SVIB i groups accumulated on the 
of occupy perusal of the results of a variety 
sing a I groups reveals that in every 
ive y rss a criterion group substan- 
Scale P reus, to the particular content 
ale, UN ges the highest score on that 
ranked 4 that the other groups were 
tivity cording to their relation to the 
* Thus, mathematicians expressed 
lor such activities as calculus and 
more than did any other group. 
aad expressed a preference for musi- 
artist. ivities, salesmen for selling, and 
S rising. ee This should not he " 5 
Meg le perhaps it might be 0 
dition, ur committed to the empirical 
at s ittle ather, it is perhaps surprising 
Met o MA was focused on this 
Dr Re Urnin cational interest preference. 
Per, à t to personality assessment 
th eae might wish to challenge the 
Rut thes analysis findings on the grounds 
t wl item writers at Stanford Univer- 
Misti M i the work was done were @ 
Y talented group who possessed 
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very much more than an average degree 
of psychological training and insight. This 
may have been true, but, at the same time, 
there is evidence to indicate that this in no 
way diminishes the important role played 
by substantive considerations in responding 
to a personality item. There are data 
indicating that subjects of no more than 
average sophistication in the psychology 
of personality have shown remarkable 
acumen. Much of this work derives from 
investigations of trait inference theories 
utilized by judges in studies of person 
perception. 

In one such study (Jackson, 1970), a 
number of hypothetical people were char- 
acterized in terms of a narrative descrip- 
tion. One hypothetical person was pre- 
pared to represent each pole of three 
personality dimensions, Autonomy, Im- 
pulsivity, and Dominance. For example, 
the description below represented a person 
at the positive pole of Autonomy. 


Edward Archer is a free-lance writer. At one time 
he worked for a newspaper, but quit because he 
felt restricted by the company's regulations. He 
now enjoys his work because he is his own boss. 
Edward spent the last year travelling alone through 
several European countries. He has currently taken 
up temporary residence in a large city's Bohemian 
district. Edward strongly advocated the view that 
today's young people are mere imitations of each 
other. He frequently tells people that the world 
needs more "individuals." His friends describe 
him as independent, headstrong, and freedom-loving. 


Items from each of the three PRF scales 
were also selected to represent each pole 
of these dimensions. Judges, 59 officer 
candidates from the Canadian Forces, were 
given the task of judging the probability 
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of each of the three hypothesized dimen- 
sions, and all of the appropriate PRF items 
were projected on the appropriate dimen- 
sions with no exceptions and no nonpre- 
dicted items appearing where they should 
inot. On the separate analyses, the ran- 
domly divided groups showed convergence 
for the three dimensions as indicated by 
product-moment correlations for the three 
respective dimensions of -99, .99, and .98. 
Clearly, when the questions are properly 
put, judges can reliably evaluate the under- 
lying content dimensions contained in a 
set of personality items. 

Results from the above study 
sistent with findings recently 
Lay and Jackson (1969) of a very high 
degree of agreement in multidimensional 
scaling analyses of trait inferential judg- 
ments of male and female judges. Further- 
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"mapping" of traits, as, for example, in a 
finding of a negative inferential relation 
between an Order Statement and an Im- 
pulsivity Statement, were also represented 
in the factor space of PRF scales. Figure 1 
illustrates the extent of agreement obtained 
between the inferential and empirical PRF 
dimension. This implies that there is a 
veridicality to implicit personality theory 
in the sense that inferred relationships are 
mirrored in the rea] world. A particularly 
striking further application of the use of 
implicit personality theories is in a study 
by Stricker, Jacobs, and Kogan (1970). 
These authors discovered that female high 
school students, when asked to group per- 
sonality items from the MMPI Pd scale 
on the basis of what they thought was 
being measured and then to name the trait 


more, it was found that the implicit or characteristics being measured, showed 
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a remarkable congruence with factor-analy- 
tic investigations of the same content. 


SAMPLING THE CONTENT DOM: 
PROBLEM OF SUBTLETY 


AND THE 


What implications can be derived from 
poults such as those described in the above 
Section? It is now possible to state two 
portant further principles which might 
?€ considered as corollaries of the first 
Seneral principle regarding the theoretical 
foundations of personality assessment. 
E pool may be viewed as a sample of 
is m il should be evaluated in terms of 
i ME ag and how representatively il 
content the content domain or universe of 
ded ; PM by the definition. The ques- 
nl Sampling content hardly needs 
pace i elaboration except perhaps to sug- 
role ] is judgment plays a very important 
B ues ui , In fact it has no equal. Tech- 
Such ii aich formalize human judgment, 
sional S those employed in the multidimen- 
Movie din studies described previously, 
the S a substantive basis for evaluating 
En, o bility of discriminating between 
COE a ren If judges cannot dis- 
Such p s 'etween related constructs using 
li bu ur bid techniques, it is quite un- 
at the hat respondents can either. But 
devel Initial stages of. personality scale 
in eal one is not likely to have items 
Which pa quantity and quality with 
to def 2 work. It is necessary, therefore, 
ind t he what one is seeking to measure 

d develop an item pool. 

H eid the dimension of risk taking. 
at + Sle a might wish to survey the Eam. 
ake panong in which a person might under- 
stable ks of various sorts. One would then 
invo] ish a. matrix of various situations 
iiec E risk, including those related to 
iea Personal relations, financial activities, 
i u bodily injury, and some of the 
manifestations of risk taking in 
Addition king and ende hand te 
jud, one would proceed to defe the 
arig of behavior entailing ris IS 0 these 

activit sorts and of typical ain 
Working” such as driving an AM dum ə " 
8 with machinery, or participating 
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in sports. If it is assumed that risk taking 
is a more or less unitary dimension, one 
would expect manifestations of risk taking 
transcending each of these situations and 
showing a pattern of consistencies. Psy- 
chologists, prone to emphasize situational 
aspects of behavior, have pointed out that 
situations often determine the responses of 
groups of individuals. However, if a pre- 
dominant response across all of the situa- 
tions can be specified, and if these can be 
shown to be substantially associated with a 
single continuum, then one can speak of 
risk taking as a unitary dimension. The 
question of the evaluation of the viability 
of a trait or construct conception of per- 
sonality characteristics thus remains empiri- 
cal, employing the criteria of response 
homogeneity and stability, and hetero- 
method convergent and  discriminant 
validity. 

But, while recognizing the importance of 
these evaluative criteria, the prior question 
of the adequacy of item sampling remains. 
Short of actually sampling the behavior of 
very large numbers of individuals in their 
natural state in the manner of Barker's 
(1955) One Boy's Day magnified by a factor 
of 10,000, there is probably no way to 
judge the adequacy of an item pool and of 
a definition of a trait except by the use of 
rational processes, either of an individual 
psychologist or of a group of judges. 

But if anyone wishes to dispute the 
utility of such rational processes in item 
selection, and to question whether such 
approaches involve ecological as well as 
construct validity, the author issues a 
challenge. For any trait for which sub- 
stantive definition is possible, let the most 
elaborate empirical item-selection proce- 
dures using criterion groups be pitted 
against two hours of work by a couple of 
good item writers. In this case, it is sug- 
gested that the selection of item writers be 
substituted for the selection of items. lf 
anyone wishes to accept the challenge, these 
two alternatives might well be appraised. 
The rules of the game would be that the 
al procedure would employ a heter- 


empiric ; : : 
ogeneous item pool like that contained in 
the California Personality Inventory or the 
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MMPI, either published or unpublished, 
whereas the substantive approach would 
involve two item writers working for two 
hours each. It would be necessary to 
exclude item pools which already contain 
substantive item selection of traits similar 
to the one at issue. One might extend this 
challenge even further. It might even be 
possible to use unselected item writers. It 
might be interesting, for example, to have 
an introductory class of psychology stu- 
dents write one item each with regard toa 
defined dimension, with perhaps just a bit 
of screening for substantive cogency and 
clarity of style, and conduct the comparison 
on that basis. The comparison Proposed 
would be, of course, that of the empirical 
validity against a criterion relevant to the 
construct in question. The author would 
fully expect under cross-validation that 
even an inexperienced item Writer would 
be superior to empirical item selection with 
a typical heterogeneous item pool. 

This, of course, is attacking the problem 
on the home ground of the empirical ap- 
Proach to item selection. This is the raison 
d'etre of this approach—all other considera- 
tions are secondary to empirical validity. 
The primary reason for believing in the 
superiority of substantive Over empirical 
procedures is based on experience with the 
PRF and with a number of studies con- 
ducted with my associates at the University 
of Western Ontario, and by others (e.g., 
Goldberg & Slovick, 1967). In the case of 
the PRF, for example, none of the item- 
selection procedures had reference to the 
empirical validity of an item. However, 
when the correlation between each scale 
and a predicted peer behavior rating was 
appraised, the median validity proved to 
be .52 for 20 scales. A search of the litera- 
ture has failed to uncover any instances of 
higher validities from any approaches focus- 
ing on empirical validity in item selection, 

The reasons for the modest cross-valida- 
tional records for empirical techniques are 
apparent. Rarely do item pools in use by 
those who seek to maximize empirical 
validity in item selection constitute an 
adequate sample of the behaviors relevant 
Rarely are they 


to a particular trait. 


sufficiently free from forms of response bias. 
Rarely are the criterion groups that well 
defined that they have not incorporated 
bias in their selection, and rarely do they 
constitute a comprehensive characteriza- 
tion of the trait. Rarely are scales so 
constituted mutually exclusive with dis- 
criminant properties. But if a rational 
item pool is developed carefully and well, 
these problems can be overcome; so too 
can the problem of item transparency in 
personality assessment, an observation 
Which leads to a second corollary of the 
first principle. 

Since good subtle items require a sensitive 
appreciation for the remole, discriminating 
manifestations of a trait, a necessary pre- 
condition for developing a pool of sublle 
items of adequate quality and quantity is 
substantive specification of item content. Of 
course, subtlety is a positive attribute in 
a personality questionnaire, particularly 
when one is measuring undesirable traits, 
such as in the previous example involving 
sadistic impulses, It might appear as à 
contradiction that both the direct represen- 
tation of content as a sample of an appro- 
priate universe and the positive values of 
subtlety are Considered positive virtues. 
An example will help clarify this apparent 
paradox. Consider the item, “I think new- 
born babies look very much like little 
monkeys," A respondent might be hard 
Pressed to tel] what is being measure 
by such an item. But if he knew that this 
item was written to represent an attitudinal 
exemplar of the negative pole of the Nur- 
turance dimension, and in fact did so very 
vell, the interpretation of the item becomes 
clearer. Given a definition of a trait, it iS 
possible to identify meaningful, valid, an 
relatively remote exemplars, They become 
remote by virtue of the fact that the indivi- 
dual may not know precisely what traits 
are being measured. But if the average 
judge knows about the nature of the trait 
being assessed, he can, as the social percep- 
ton studies indicate, quite accurately infe! 
the nature of the content, and we might be 
quite suspicious of it if he could not. This 
ability is not uniformly present in the same 
degree in all respondents (Stricker, 1969) 
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but, like knowledge of connotative attri- 
butes of content, is differentially present 
(Jackson & Messick, 1969). Thus, there 
is a fundamental distinction between subtle 
prune as described here and as identified 
n the tradition of empirical item selection. 
Whereas subtle items on the MMPI were 
identified post hoc solely on the basis of 
their sample estimate of a correlation with 
& criterion, we require, in addition to an 
empirical relationship, a substantive a 

Priori link to an underlying construct. 
l Whereas Mech! argued for a projective 
En to an item in which validity hinged 
the ia misinter pretation of some aspect of 
Sp E iem, one might well argue for the 
dia m Those items are the best which 
E and which contain referents 
mw which people agree. This is not to 
fe respondents have to see the con- 
the etween the underlying trait and 
a prs But in the example of the item 
ie. newborn babies looking like monkeys, 
Ea Pu aeos to assume that individuals 
Cod lad sufficient experience, either di- 
and i indirectly, with both monkeys 
tlie ie to answer meaningfully. In 
5s ee 15,000 items that the author 
Sale occasion to review in the past 
live sunore than a few ambiguous items 
Taa y his editorial screening. 
items almost invariably have proved to be 
S with mediocre statistical properties. 
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E method of choice in personality scale 
re ‘naa is one which focuses on a linear 
alent te between items and a single underlying 
odyi, continuum. Other approaches, i 
(PHI more complicated. models such as 
Curio nic and class models, have the status oj 
ast sities. Much has been written in the 
inc daher century about the problem ol 
that uring personality. Many have argued 
li,,, Pereonality is complex and that the 
Classi approach of. factor analysis and 
Dis. test theory is unequal to the task. 
atiy, there has emerged a variety of alter- 
ci ay models which, implicitly or expli- 
these las guided test construction. Among 
Se are the class model of psychiatric 
“gnosis implicit in the development of 
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the MMPI, in which an individual is con- 
sidered to be a member of one of several 
postulated classes, possibly genetically de- 
termined (cf. Meehl, 1959).  Loevinger 
(1957) has called attention to dynamic 
models in which opposites appear similar, 
as in the manifestations of the psychoanaly- 
tic defense mechanism of reaction forma- 
tion. There are the familiar arguments 
regarding configural scoring (Meehl, 1950) 
and of interpreting patterns of scores on a 
psychometric instrument. 

Unfortunately, authors who argue for 
configural approaches rarely have taken 
the trouble (Meehl is an exception) to 
specify mathematically the model appro- 
priate for their evaluation. Even more 
rarely has empirical evidence been adduced 
to support such conjecture. While the 
author's credentials, both in terms of a 
fairly extensive reading, for example, of 
psychoanalytic literature, and an early 
exposure to the lore surrounding the inter- 
pretation of the MMPI profiles, might 
encourage a sympathetic appraisal of such 
approaches, convincing shreds of empirical 
evidence for the preferability of a class, 
dynamic, or nonlinear model for combining 
items, or for any significant gain in applying 
configural scoring over the usual linear 
scoring to items, are encountered rarely, 
if at all? This should remain an open 
question, and those willing to expend effort 
and energy in pursuing these areas deserve 
our warm, if skeptical, encouragement. 
But until really convincing evidence is 
forthcoming, the substantial body of knowl- 
edge in classical test theory should not be 
abandoned. Until that time, such models 
have the status of curiosities; their exist- 
ence in isolated cases is not denied, but 
ates do not encourage the 


their low base r: ! 1 ! 
much time in their 


expenditure of 
identification. : 

Jane Loevinger has taken an opposite 
point of view. She has been an incisive 
critic, not only of ad hoc methods of per- 


5 A distinction should be made here between the 
scoring of items, to which the above 
comments apply, and the configural combination 
of scales. The latter approach mi ^ well prove to 
be a fruitful avenue to the investigation of psycho- 
pathological syndromes. 


configural 


240 


sonality scale construction, but of classical 
test theory and factor analysis as wel]. 
One might hope, then, that her own investi- 
gations would provide more sensitive, subtle 
insights into personality than those pro- 
vided by some earlier Investigators. Un. 
fortunately, the empirical Work is at best 
discouraging. In one series of studies, 
Loevinger and her associate (Ernhart & 
Loevinger, 1969) used a type of homo- 
geneity clustering to identify Subsets of 
items empirically sharing overlapping vari. 
ance. Five clusters of authoritarian family 
ideology were isolated, but on Cross-replica- 
tion all but one of these failed to reappear, 
and the one that did reemerge appeared to 
reflect a broad dimension highly correlated 
with intelligence, social class, and educa- 
tion, recalling earlier findings with the 
California F Scale. The F Scale has been 
shown to elicit a broad response-bias dimen- 
sion, uncorrelated with attitudinal content, 
but related to style of Wording (Clayton & 
Jackson, 1961), and substantially related 
to intellectual ability, educational level, 
and social class. 

While Loevinger and her colleagues are 

to be commended for seeking Cross-replica- 
tion in a diversity of samples, the applica- 
tion of such a homogeneity keying proce- 
dure, without adequate Concern for content 
sampling within scales or for response 
biases elicited by item style, involves some- 
thing equivalent to the crudest application 
of a linear model, This results in total 
scores grossly oversimplifying complex com- 
binations of variance from a number of 
sources. A procedure in Which the dimen- 
sions to be measured are specified in terms 
of their putative manifestations, and in 
which the expected response biases are 
carefully controlled, would have been 
preferable. While the routine application 
of factor analysis may have certain prob- 
lems, until its critics can devise more sensi- 
tive procedures, one cannot be impressed 
by the substitution of simplistic, atheoreti- 
cal procedures for combining items. 


PROBLEM OF RESPONSE STYLES 


Now, if sufficient concern has been con- 
veyed for measuring content, consideration 
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may be directed to the fifth general princi- 
ple regarding the dynamics of structured 
personality inventories. It concerns re- 
Sponse styles, which may be considered 
complementary to the content component 
of response consistency. The principle is 
that fo construct Psychological measures in 
disregard Jor sources of method variance tS 
fo court disaster, It is to the credit of 
Meehl and Hathaway (1946) that they 
recognized that scales constructed on the 
basis of internal consistency analysis typi- 
cally would emerge with broad dimensions 
of test-taking attitudes, Unfortunately, 
they did not Carry the implications of this 
argument far enough, Had they done so, 
further attempts at scale construction 
might have proceeded quite differently and 
might have developed sets of scales far less 
Saturated with broad general factors lacking 
in discriminant validity. One such issue 
is the definition of content and of style and 
the relative importance of each in particular 
assessment contexts, Jackson and Messick 
(1962) factor analyzed the MMPI on three 
Separate samples, identifying two large 
dimensions, accounting for about 75% ol 
the common variance. One factor was 
highly Correlated with mean desirability 
scale values of Scales, and the second 
Separated true- ang false-keyed subscales. 
Jackson and Messick interpreted these 
dimensions as stylistic (Jackson & Messick, 
1958) in the sense that they were general 
Over a wide range of content, were elicited 
by aspects of item form, that is, the desira- 
bility and direction of keying of items, and 
Possibly reflected individual differences 
relevant to Some criteria, such as the sepa- 
ration of normal from deviant populations. 
Block (1965) (cf. Block, 1967; Jackson, 
1967a, 1967h) Preferred to focus on a sub- 
Stantive interpretation of the two large 
MMPI factors, arguing that their criterion 
Correlations implied interpretations in terms 
of “ego resilience,” and "ego control. 

Rorer and Goldberg (1965) reported cor- 
relations between original and reversed 
MMPI scales which appeared to reflect 
response consistency to item content, How- 
ever, Jackson and Messick (1965) identified 
the same two large dimensions in the 
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Rorer-Goldberg data—one interpreted as 
esirability and the second associated with 
, the keying of scales. Most of the variance 
|! attributed to content in correlations be- 
tween individual MMPI scales by Rorer 
and Goldberg proved to be a function of 
these two large factors, which Jackson and 
Messick interpreted as stylistic. 
Obviously, the point of view reflected in 
the Work of Jackson and Messick involves 
a different. definition of content and of 
Style than does that of proponents of the 
Eition f empirical item selection. The 
Ber retadun of the role of content and of 
a me inevitably reflect the definitions 
Hon EE each. _One problem with the tradi- 
tn empirical keying with respect to 
) al criteria is that its usual application 
ha no firm basis for imputing latent 
am mms tor spondents as a function of 
i est scores, no firm basis for relating 
B ut to scale content, no substantive 
ton for the relevance of items to 
a theoretical dimensions, no struc- 
i: its ion for the relation of the pattern 
fart responses to the pattern of scale 
Ses, and, finally, no necessary require- 
Convergent and discriminant prop- 
scale scores (Campbell & Fiske, 
structy aes! respect either to the internal 
e of the test or to external criteria. 
Such Sis definition explicitly incorporates 
:equirements (cf. Bentler, Jackson, & 
k 1971; Jackson & Messick, 1965; 
| HM "ger, 1957), then one would look with 
|o ch. 9t On imputing discriminant content 


lar: ui A 
sc 'aCteristics to sets of highly redundant 
Cales 4 c 


E d randomly with respect to specific 
item p: but systematically with respect to 
orm (Edwards, 1966; Jackson & 
Salt 1962). On the other hand, if 
inc i nal validity is the sole criterion for 
he i “ng or excluding an item from a scale, 
beco Asis for consistent responses to items 
`p, Mes secondary to its external validity. 
Jus, Vithin this framework it really makes 
rhy little difference theoretically or practi- 
Y whether one prefers the interpretation 
W] “Xtremely general “content” dimensions 
"ch fail to show substantial discriminant 


validity, or the interpretation of pervasive 
stylistic dimensions which show certain 
empirical correlates. If, in the tradition 
of radical empiricism, one eschews theoreti- 
cal or substantive definition and prefers to 
rest one’s case solely with external cor- 
relates, there is, indeed, no basis for 
distinguishing content from stylistic vari- 
ance, and the issue becomes equivocal and 
a source of much needless contention. 
Carried to its extreme, this point of view 
places the investigator in the nihilistic posi- 
tion of denying the relevance of evidence 
relating to the response process, and hence 
being unable, apart from external validity, 
of specifying the nature of responses to 
content and to item form. While not 
denying the relevance of external validity, 
particularly convergent and discriminant 
validity, the author takes the position that 
a great deal is to be gained by learning 
something about why a person answers a 
particular item the way he does. To 
capture elusive content variance, it is 
essential to attend to all sources of variance, 
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including stylistic or method variance 
(Wiggins, 1968). i . 
This viewpoint is reflected in a series of 
studies undertaken by the author with a 
number of collaborators at the University 
of Western Ontario. Trott and Jackson 
(1967) sought systematically to manipulate 
the parameters affecting acquiescence by 
varying item length, content dimensions, 
exposure time, content saturation, and 
subjects’ verbal ability in two factorial 
designs each conducted separately for 
males and females. By selecting items so 
that content was balanced as to direction 
of keying, it was possible to obtain a score 
reflecting content, based on equal numbers 
of true- and false-keyed responses, and a 
second reflecting acquiescence, based on an 
all true key for each content scale. Of 
particular interest was the powerful effect 
of speed of presentation on acquiescence. 
Figure 2 shows that acquiescence decreased 
monotonically as a function of increasing 
presentation intervals. On a separate 
sample, when content scores were factor 
analyzed, the largest factor for the fast 
speed condition was one separating true- 
and false-keyed subscales for the same 
content dimension ; for the longer exposure 
condition, a similar factor was only the 
fifth largest. Item length singly, and in 
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(Reprinted with permission from an article by D. 
M. Trott and D. N. Jackson published in the 
Journal of Experimental Research in Personality, 
1967, Vol. 2. Copyrighted by the Academic 
Press, 1967.) 
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interaction with speed of presentation, was 
significant in affecting acquiescence scores 
for males but not for college females 
(Figure 3). Content saturation, as re- 
flected in separately determined biserial 
correlations with the total scale, interacted 
with both scale content and item length 
in affecting acquiescence. Trott and Jack- 
son employed an information-processing 
rationale for these and similar findings. 
The critical role of content saturation was 
also highlighted in a pair of studies (Jack- 
son & Lay, 1968; Morf & Jackson, in press): 
In the first study, separate factors were 
defined by marker variables for desirability, 
acquiescence, and adjective endorsement, 
but no substantial loadings were obtained 
on response-style factors for highly satur- 
ated PRF items, which defined only specific 
content dimensions. However, when Mor! 
and Jackson selected PRF items which, 
although showing highest correlations with 
their own scale, did not correlate highly 
enough to be included in the PRF, every 
PRF subscale had a loading on at least one 
response style factor and on one content 
factor. In addition to stylistic factors 
marked by adjective checklist scales and 
PRF Desirability scales, Morf and Jackson 
identified two factors representing distinct 
acquiescence processes. The first, identi- 
fied as true responding, represented, for 
example, double endorsement of both 
original and of reversed items from the 
same scale; the second, identified as item 
endorsement, involved the tendency to en- 
dorse items as self-descriptive regardless 
of direction of wording, as in responding 
true to a statement affirming the presence 
of a particular trait and false to a statement 
negating the presence of the trait. Wher 
these two factors were represented on 
orthogonal axes, 43 of the 44 scales fell in 
the predicted quadrant, a finding hardly 
to be expected by chance.’ Indeed, the 


ê Block (1971) has recently conjectured that the 
results from the Morf and Jackson study are due te 
capitalization on chance in the rotation of axes: 
This was tested by Jackson and Morf by correlating 
component scores generated by two separate factor 
analyses of two distinct parallel sets of the variables 
based on the original data. The same was done si 
random data. Whereas for the real data compone? 
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Sid of these and similar findings may 
i pP iu mathematical formulation of 
R oretical model (cf. Jackson, 1968; 
v MM 1970). It is important to note, 
qM er, that four very clear content 
m s emerged in the Morf and Jackson 
Em i representing the four content scale 
there melren in the analysis. Thus, 
avin were eight identifiable factors, four 
E to do with content and four with 
oe = style. The point of this analysis 
E nonse ta: prove that either content or 
aa "x le was predominant but rather 
M on rat by a properly designed study, 
identify each. 
às Mirum ol course, other response styles 
and Ja € to the four identified by Morf 
Paet E There is, for example, the 
E hed defensiveness dimension 
m n in terms of MMPI Lie-scale- 
Eum Here, two item properties, 
oko any and frequency of endorsement 
BP way pis Messick, 1969), operate in such 
esce: hat when there is a discrepancy 
Person n these two, it is inferred that the 
y Crow, lying." But evidence developed 
y Elles ne i Marlow (1964), Kogan and 
Bree poon, and others would place a 
on this phisticated construct interpretation 
p response style. 
a et of years ago, Sechrest and 
iatis (1963) spoke of various forms of 
as pote ñ, and one form that struck them 
Vas sh Sin interesting for investigation 
soaa randomness of responding. In 
Vere E ting the PRF, a variety of items 
^ som UI which, while not written to 
Sich mae involved quite atypical behavior, 
fole l learned to repair watches in 
in zerland," or “I visited the Republic of 
e during the past year.’ Subjecting 
altho to item analyses, it was found that 
ery nn their endorsement frequency Was 
sut low, they tended to show the conver- 
. and discriminant properties expected 
br: Content scale. Later, Bentler (1964) 
ine Stigated this response style in predict- 
Tesponse stability when people were 
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tested on different occasions, and he found 
that it accounted for approximately 50% 
of response variability for content scales. 
Bentler and Jackson (Jackson, 1970) also 
found that the test-retest reliability of a 
content scale proved to be an inverse 
function of the level of Infrequency scale 
score (Figure 4). This suggests that not 
only should one specify the reliability of a 
test, but also the reliability of samples of 
people, as relative reliability appears to 
be a stable property of people, as well as 
of scales (cf. Hendel & Weiss, 1970). 
Rather than provide a complete catalog 
of response styles, it is hoped that by 
describing a few, one should avoid the 
tempting alternative of proceeding by 
ignoring, ostrichlike, the possible impact of 


stylistic variance. Although the response- 
style literature has become rather technical 
in the psychometric sense, the personality 
research worker need not be discouraged. 
Fortunately, the evidence pointing to the 
strong effect of content saturation 1n reduc- 
ing the impact of response style suggests 
that substantially defined scales which are 
purified by item-selection strategies seeking 
to maximize saturation can effectively sup- 
press the role of response style. The 
omnipresence of evaluative bias virtually 
requires that one of several statistical 
procedures (Neill & Jackson, 1970) for 
identifying items implicated in this form of 
bias be incorporated into any serious at- 
tempt at scale construction. Item and 
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scale variance has finite limits; that ac- 
counted for by content saturation can 
powerfully reduce that traceable to stylistic 
factors. On the other hand, procedures 
such as empirical item selection, which are 
indifferent to content saturation, usually 
result in heterogeneous aggregates. When 
such heterogeneous items are summed, 
they yield scales reflecting largely what 
they have in common, namely, stylistic 
variance. Such scales fail to show really 
worthwhile levels of validity and are 
indistinguishable from many other similarly 
developed scales bearing quite different 
names and conceived for quite different 
purposes. 


MULTIVARIATE ANALYSIS OF SrRUCTURE 


Perhaps it is appropriate after the discus- 
sion of response style to introduce a further 
principle. A componential, multivariate 
approach to assessment is essential to the 
study of the structure of scales in order to 
develop the Properties of convergent and dis- 
criminant validity. One inevitably must 
consider the relationship between the mani- 
fest responses and the latent attribute 
that is being measured. In other words, if 
one wishes to infer something about an 
underlying trait, it is important to relate 
how a person responds to an item to the 
presence of the trait. As indicated above, 
there are a number of possible alternative 
models, including dynamic models and the 
classical univariate test theory approach. 
There are sophisticated approaches to 
homogeneity such as those outlined by 
Guttman (1950) and most recently by 
Bentler (1969). Univariate models gen- 
erally will not stand by themselves in the 
analysis of the heterogeneous item pool. 
If one important aspect of test scores in- 
volves the multiplicity of response styles 
that have already been described, quite 
apart from content, a mindless search for 
consistency among sets of heterogeneous 
items will typically yield evidence of the 
tendency to respond in a particular stylistic 
Way, contributing spurious forms of homo- 
geneity and fulfilling the Meehl and Hatha- 
way prophecy. This is not to say that 
homogeneity is not important, but it re- 
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quires a more sophisticated and more 
delicate balance of content and the balanc- 
ing of response styles in such a way that 
they do not importantly determine the 
Structure of the total score. What is 
required is a careful specification of what 
is important to measure and what is not. 
If one is interested in measuring a single 
trait, both what the trait is and what it 
is not must be specified. A trait involving, 
for example, need for Social Recognition 
is different from one involving Exhibition 
in certain specific ways. If the conceptions 
underlying these definitions are accurate, 
these distinctions should be borne out in 
the structure of the item relationships and 
in the discriminant validity which will 
appearlater. If the definitions of the traits 
are fuzzy or incorrect, these will also be 
reflected in item relationships. Similarly; 
if no clear distinction is made between à 
trait and a generalized tendency to respond 
undesirably, then it is likely that a number 
of such traits will show spuriously high 
intercorrelations and be inadequately dif- 
ferentiated. It is easy to make the error 
implicit in the multiple-correlation ap- 
proach. Some investigators, for example, 
have argued that because acquiescence an 
attitude content are correlated with a 
particular criterion, they are, therefore, 
justifiably incorporated within the same 
scale. But if one is interested in measures 
Which reflect constructs, the indiscriminate 
attempt to blend in unknown proportions 
of components of content and of style 
inevitably will lead to excessively highly 
correlated scales, replete with method vari- 
ance and showing inadequate levels. of 
discriminant validity (Jackson & Messick, 
1962). s 
Much can be learned from an analysis 
of structure through the use of factor 
analysis. Factor analysis, however, may 
lead to an inappropriate focusing on char- 
acteristics either too broad or too narrow 
to be useful as explanatory constructs. 
Factor analysis at the item level often 
results in extremely specific, sometimes 
even trivial, exemplars of a trait. These 
trivial exemplars may or may not have 
general import in the description of per- 
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S A 
port depsoding on whether or not im- 
the ite characteristics are represented in 
een da pool. At the item level, it has 
are own psychometrically that there 
eee mathematical relationships be- 
lon 1e biserial and poin t-biserial correla- 
son D the first centroid factor (Henrys- 
Comp 62), leading to a preference for the 
Ber ual simpler approach of using 
eney item indexes of internal consis- 
a Ses Itis better, however, to incorporate 
Sine: ns for taking the massive desirability 
E en into account. A number of 
a E approaches are possible. Re- 
a y, Neill and Jackson (1970) completed 
tudy in which more than a dozen indexes 
item selection were contrasted, several 


a function of number of item: 


s selected on the basis of 
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of which used unique rationales for sup- 
pressing the desirability factor. Figure 5 
presents a comparison of two of the indexes 
with a random strategy for selecting from 
a predefined substantive item pool in terms 
of reliability on à cross-replicated sample. 
The important conclusion was that, as with 
the PRF, it was possible very substantially 
to suppress the desirability dimension at 
the item level, and yet obtain high levels of 
ion with relatively short 


content saturati 1 
scales. Since that time, it has been possible 


to vastly elaborate item-selection proce- 
dures in a number of directions. Rather 
than use, for example, the total score as 
the criterion against which to select items, 
one may identify some component of the 
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total score which is entirely independent 
and uncorrelated with unwanted sources of 
variance, such as the desirability dimen- 
sion. It has also been possible to seek to 
maximize item variance and to seek to 
obtain maximum discrimination among re- 
lated scales. 
This latter point raises a particularly 
interesting question. Factor analysis, in 
the tradition of Thurstone and Cattell, has 
as one of its major alleged advantages the 
imposition of a more parsimonious inter- 
pretation among a set of statistical vari- 
ables. In the case of personality variables, 
the hope is that one can reduce the rank 
of a correlation matrix in such a way that 
the entire matrix can be approximated by 
a matrix of smaller rank. Factor-analytic 
investigators have traditionally employed 
only a single method of measurement and 
have sought to identify the structure of 
personality among a set of personality 
scales. They might find, for example, that 
scales measuring Achievement and Endur- 
ance define one pole of a dimension and a 
scale reflecting Play defines the opposite 
pole. The personality theorist might object 
that these variables have a good deal of 
specificity which might be useful in mea- 
surement. But it is an element of strong 
faith on the part of the factor-analy tic 
investigators that the truth is to be identi- 
fied in what is common among variables, 
rather than what is unique. Inevitably, 
however, content and method variance 
within a single method of measurement 
usually have a similar structure (Campbell 
& O'Connell, 1967; Jackson, 1969b). This 
causes variables to correlate much more 
highly within a given method of measure- 
ment than they should, resulting typically 
in distorted factors often misrepresenting 
seriously the nature of the structure of 
psychological variables. This led to a need 
for developing procedures which would 
yield many, rather than few, factors and 
would focus on heteromethod validities. 
There are both sound psychological and 
psychometric reasons for preferring factor- 
analytic solutions which allow appropriate 
distinctions between method and trait fac- 
tors and between theoretically distinct 
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traits, even if this requires the sacrifice of 
maximum parsimony. For example, apply- 
ing multimethod factor analysis (Jackson, 
1969b) to PRF parallel forms, precisely 22 
factors emerged, one for each trait, with 
only the appropriate pair of scales salient 
on each factor. Ina similar manner, it was 
possible to show that 18 factors accounted 
for 20 traits in a space defined by PRF 
scales, peer behavior ratings, and self- 
ratings (Jackson & Guthrie, 1968). 

It is possible to carry this line of reason- 
ing considerably further and to suggest that 
factor-analytic investigators who have 
sought to identify reality within the struc- 
ture of correlations between tests might 
better be advised to seek it in the reliable 
but specific variance within scales, keeping 
in mind the problems of method variance. 
Clearly, if one wishes to maximize the 
predictability of a battery, entirely uncor- 
related tests would be appropriate (cf. 
Guttman, 1958). We urgently need 
methods of analysis which will aid us in 
approaching this ideal. This, as well as 
theoretical distinctiveness, will foster the 
kinds of convergent and discriminant prop- 
erties of tests which Campbell and Fiske 
(1959) have considered so important. With 
these requirements is mind, it is clear that 
efforts should be directed not only at the 
evaluation of tests, but just as properly at 
a program of test construction focusing at 
€very state on convergent and discriminant 
validity. 


OVERVIEW 


What can be said then about the picture 
that has been drawn of the dynamics o 
Personality assessment with the benefit of a 
quarter of a century of further experience. 
In one Sense, the picture is more compli- 
cated than the simplistic and perhaps 
mechanical model proposed in an earlier 
era. Personality items may be conceived 
of as eliciting a combination of a number 
of sources of variance including the fol- 
lowing: content variance appropriate to the 
trait or disposition being measured, content 
Variance associated with irrelevant dimen- 
sions, method variance, specific variance 
unique to the item, stylistic variance, an 
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various interactions between stylistic and 
content variance in which, for example, 


content is highlighted by a particular for-- 


mat, or vice versa. Items can thus be 
viewed as differing in content saturation 
and in their stylistic-eliciting potential, 
while persons can be conceived of as differ- 
ing in sensitivity to item content and to 
stylistic features, as well as showing varying 
Positions on latent dimensions. i 
On the other hand, however, the picture 
may be considered simplified. One need 
not abandon as unscientific his unique 
Capacity to judge and weigh content. One 
need not go out and seek a new properly 
Sampled criterion group or groups every 
time there is a very particular criterion to 
Predict, nor does one need to change per- 
Sonality scales, like style in clothing, every 
year or two, as might be the case with 
€mpirically derived scales. While in one 
Sense a response to a personality item bears 
the unique stamp of the person, it does not 
do so with equal precision for all of his 
traits. The Pandora's Box of blind empiri- 
eism in which a single item is keyed on a 
Mur of scales merely because it has 
Pei elated with. different criteria can be 
Jected as both inefficient and troublesome. 
hile responses to personality items are 
thutifaceted, as are the items themselves, 
Bancs Is no reason to abandon completely 
eptions of trait and test theory as they 
tif evolved since the beginning of scien- 
Psychology. 
Wes is, indeed, reason to be satisfied 
h the enormous progress that has been 
Made in the past quarter of a century in 
Our understanding of the dynamics of per- 
Sonality inventories, progress which in no 
Small part is due to the efforts of indivi- 
tals like Paul Meehl, who proceeded opti- 
Mistically forward when many confidently 
Advised that it could not be done. 
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The conceptual framework of Coomb's data theory is used to highlight similari- 


ties and differences between two repi 
hypotheses: McClelland's hypothesis 


discrepancy theory. 


resentative but different discrepancy 
: and Dember and Earl's complexity 
It is concluded that mapping of data into the models 


cannot be done unambiguously unless assumptions are made concerning as 


yet unspecified parameter values and properties of data. 
One two-stage strategy with simplifying 


search strategies are discussed. 


Two different re- 


assumptions is suggested as a possible approach toward quantitative specifica- 


tion of model parameter values. 


Considerable interest has recently been 
shown in discrepancy hypotheses as theo- 
retical explanations for motivational pro- 
Césses in diverse content areas. These areas 
include intellectual development (e.g. 
Hunt, 1965), verbal processes (Kammann, 
1066), exploratory behavior (e.g., Walker, 

964), and environmental psychology 
ohlwill, 1966). Discrepancy hypotheses 
e conceptual similarities with various 
‘verted-U hypothesis formulations (e.g. 
chroder, Driver, & Streufert, 1967; Vitz, 
966) which state that behavioral measures, 
Piten affective in nature, are nonmonotonic 
pede functions of stimulation. Con- 
eptually related to inverted-U hypothesis 
ormulations is the notion that there is an 
Optimal level of stimulation which is most 
Pleasant or most effective in mediating be- 
havior (e.g., Fiske & Maddi, 1961). Both 
of these general notions are Very old and 
80 back at least to Wundt in 1874 (Berlyne, 
1960, p.201). Discrepancy hypotheses are 
Consistent with various inverted-U formu- 
ations, but they go further and tend to 
€mphasize the behavioral effects of an 
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optimal discrepancy or difference between 
some residual (often times momentary) 
organismic state and some source of stimu- 
lation. Early statements of discrepancy 
hypotheses which have been particularly 
influential include Hebb (1949), Dember 
and Earl (1957), and McClelland and Clark 
(1953). These statements remain as per- 
haps the most popular in the literature. 
This paper attempts to clarify distinc- 
tions between discrepancy hypotheses and 
to specify clearly underlying assumptions 
together with parameters which ultimately 
must be specified if the models are to be 
developed further. To achieve these goals, 
two models have been selected for primary 
consideration, McClelland’s hypothesis 
(McClelland & Clark, 1953) and Dember 
and Earl's (1957) complexity-discrepancy 
model. Emphasis is placed on these models 
primarily because they appear to be the 
most well-stated positions. However, it is 
argued that the models still possess too 
many unspecified free parameters for un- 
ambiguous mapping of data into the 
models, or for unambiguous predictions. 
Thus, whether data are said to support 
(e.g., Kammann, 1966) or fail to support 
(e.g., Verinis, Brandsma, & Cofer, 1968) 
a discrepancy hypothesis often boils down 
to differences in assumptions concerning 
the nature of data and selection of model 
parameter values. It is argued that there 
are equally plausible and perhaps more 
parsimonious alternative interpretations of 


such data. 
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The terminology of data theory (Coombs, 
1964) has been selected as a common con- 
ceptual framework within which to view 
similarities and differences between models. 
This paper begins with a consideration of 
the models as viewed within this framework 
after which difficulties of making unam- 
biguous predictions from the models are 
considered. Most research relating to 
discrepancy hypotheses and other inverted- 
U hypothesis formulations is characterized 
as qualitative, because research strategies 
have been primarily concerned with finding 

some nonmonotonic inverted-U function 

(e.g., Haber, 1958; McCall & Kagan, 1967; 

Vitz, 1966). Problems of this approach 

and limitations of data interpretation are 

considered. Since qualitative procedures 
cannot be used to specify discrepancy 
model parameters in the absence of 
stronger, probably unrealistic assumptions, 

a two-stage procedure for quantitatively 

estimating parameters is described. 


DISCREPANCY HYPOTHESES As Qla 
Data MODELS 


As viewed here, the conceptual notion of 
discrepancy implies an operationally defin- 
able metric or distance. Specifically a dis- 
crepancy hypothesis is defined as some 
measurable real distance between an or- 
ganismic state and some hypothetical 
optimally situated stimulus most effective 
in mediating behavior. Stimuli at other 
than an optimal distance from the organism 
have different behavioral effects. Thus, 
discrepancy hypotheses should be viewed 
as another class of models which attempt 
to represent behavior within a spatial 
framework. At present, however, theo- 
retical formulations have been implicitly 
restricted to consideration of behaviors 
which may be represented in a single di- 
mension. Discrepancies may be repre- 
sented by distances along a line. 

A common conceptual (and spatial) 
framework for considering the hypotheses 
is Coombs’ (1964) theory. All forms of the 
discrepancy hypotheses appear to corre- 
spond with what Coombs calls Qla models 
for preference data. In Qla, subjects (Ss) 
and stimuli are mapped into, and may be 


Horex THOMAS 


represented as, points within the same 
psychological space called a joint space. 
The basic notion is an S will order his 
preference for stimuli as some function of 
the S's distances from the stimuli in this 
space. 

Figure 1 illustrates three possible prefer- 
ence functions for the unidimensional case. 
Figures 1A and 1B represent the two differ- 
ent discrepancy hypotheses and illustrate 
that discrepancy hypotheses vary in stating 
how these preferences will be ordered as 
a function of S-stimulus interpoint dis- 
tances. Figure 1C is considered later. In 
each figure, the abscissa is a J or joint scale 
of Ss and stimuli. A, B, and C represent 
three hypothetical stimulus points with 
ordinates located at unequal distances 
along the J scale. Three stimuli are suffici- 
ent to illustrate the problems to be dis- 
cussed. Following Coombs, the location 
of individuals (or groups of Ss) will be 
called I or ideal locations. These locations 
may be thought of as adaptation levels. 
While stimulus points and I locations were 
selected for illustrative purposes, they are 
probably typical of experimental situations. 
In Figure 1, the heights of the stimulus 
ordinates represent preference. Preference 
is interpreted in a specific way. It repre- 
sents a response by an S to stimuli where 
the stimuli are evaluated by being refer- 
enced to S's own internal standard. A 
preference of steak over chicken or Camels 
over Winstons would be examples. Ex- 
amples of preference measures used in 
studies to evaluate discrepancy hypotheses 
include, as measures of high preference, 

looked at longest," "played with the 
most,” “liked,” while low preference would 
be, respectively, “looked at least," 
avoided," "disliked" (e.g., Earl, 1957; 
Haten 1958; McCall & Kagan, 1967). 
reference represents the dependent vari- 
able which typically will vary from one 
content area to another. However, not 
all measures called preference are Qla 
data. For example, preferences for lots of 
money or high grades are probably evalu- 
ated with respect to an absolute external 
Standard and are not preference data a$ 
viewed within data theory. 


DISCREPANCY HYPOTHESES 


Lo Preference Hi 


Lo Preference Hi 


J Scale 


Pwo 


© 


Lo Preference Hi 


J Scale 


Fic. 1. J-scale preference functions for three different peer ee 
(A = McClelland’s discrepancy hypothesis; B = Dember an za 
plexity-discrepancy theory; C = Coombs' preference function. 
f i “trough” 
Figure 1A represents McClelland and the right or left such that the Ye HN 
Clark's curve relating preference for stimuli. coincides with pim aa. Sumi 
The curve shown in Figure 1A is arbitrary ber of possible Focal oc infixed. From 
but satisfies their conditions. It is sym- points along the A ee Pa have Bend 
metrical, nonmonotonic, an inverted double among Ped ES dicated. These Ts 
curve (McClelland & Clark, 1953, p. 43). selected T decent cats Teea 
Relative preference is specifed by the exhaust me nue iois HER "Thus, 
height of the ordinates. For Se gonted arder relations: C7 Auc 39-yme— 
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lation is ACB shifting the curve slight? oe 
p preferred most, and Biest Tro would RET to the right CBA, 
Preference order relation is indicated CAB, SIN y £ 
i 1 : 1 " s Slig 
directly below I’. While preserving the etc. g 


ht changes in curve location do 
cessarily ce different preference 
Shape of the curve it may be moved to not necessarily produc I 
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order relations because preference order 
relations are associated with discrete re- 
gions along the J scale. 

Figure 1B represents Dember and Earl's 
(1957) preference function. Their formu- 
lation is of central importance to theories 
developed by Sackett (1965) and Walker 
(1964) and appears very similar to the 
theory suggested by Fowler (Fowler, 1967, 
p. 219). For Dember and Earl, the J 
scale is defined as a complexity scale, 
They predict that highest preference is for 
a slightly more complex "pacer" stimulus, 
with remaining stimuli being preferred in 
terms of their similarity or distance from 
the pacer. This implies a symmetrical 
curve peaking at the optimum hypothetical 
pacer with I some unknown distance below 
this peak. As illustrated in Figure 1B, 
the curve is nonmonotonic inverted U- 
shaped (or V-shaped) for stimuli more 
complex than individuals at I^ and below 
I’ or to the left, a continuously falling-off 
curve. Figure 1B is interpreted in the 
Same way as Figure 1A. For I’ the 
preference order relation is BCA. Dis- 
placing the curve to the right or left but 
Preserving the same peak-to-I’ distance 
produces different preference order relations 
as indicated. Figure 1A and 1B may be 
directly compared because five of the I 
locations are identical. 


DiscrEPANcy MODELS AND THEIR 
AMBIGUOUS Data Expectations 


Before considering their predictive diffi- 
culties, it needs to be stressed that dis. 
crepancy models as currently developed 
assume that the responses to stimuli 
satisfy the criterion of a unidimensiona] 
ordered scale. The assumption is crucia]. 
If it is not satisfied, comparisons among 
stimuli are difficult to make, since it could 
be maintained that Ss were responding to 
the stimuli as if they were represented on 
different dimensions. In fact, it is argued 
below that this is a reasonable interpreta- 
tion for many peculiarities in data. Thus, 
unidimensionality is a necessary prerequi- 
site for in terpreting data within discrepancy 
models. Yet, as far as is known, the as- 
sumption has not been treated as a testable 
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answered in data in any 
designed to evaluate any 
discrepancy hypothesis. The problem is 
particularly troublesome given the trend 
toward selection of heterogeneous stimuli 
such as paintings of modern art (Wohlwill, 

1968) and poetry (Kammann, 1966). That, 
responses to such multiattribute stimuli 
satisfy unidimensionality is an assumption 
which appears difficult to defend on rational 
grounds. 

That which follows requires that the 
assumption of unidimensionality hold. 

Regardless of methodology, that is, the 
dependent variable measured, or the ex- 
perimental unit of interest (individuals or 
groups), or whether stimuli are presented 
singly or in sets, to compare data with a 
discrepancy model, the data must ulti- 
mately be reduced to preference order 
relations. For th resent, it makes no 
difference whether Wd McClelland hy- 
pothesis is viewed with an eye toward pre- 
dicting preference order relations, or 
whether the data are to be mapped into 
the model in a post hoc manner, The 
outcome is the same, 

_Figure 1A illustrates that under con- 
ditions of the McClelland hypothesis, if 
the I locations of the Ss were unknown, 
or were expected to be different for different 
Ss, any preference order relation is possible. 
Thus, all responses would fit the model 
unless there is some a Priori way of knowing 
which Preference order relations were more 
Probable, Furthermore, Ss could have 
different | locations and produce the same 
preference Order relations, Thus CBA 
could be obtained from Ss in two different 
noncontiguous regions of the J scale. 

"ppose however, as in some experimental 
Procedure, the Ss’ | location could be 
Specified. Let that location be I^, Then 
Preference order relation ACB would be 
expected. But note this preference order 
relation is not invariant for this particular 
I location, If changes were made in the 
shape of the Curve, or there were changes 
In the interpoint stimulus distances from 
those which were arbitrarily specified in 
Figure 1A (both Changes can easily be 
visualized), then again it is not clear which 


hypothesis to be 
published study 
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preference order relations to expect. Not 
always, of course, would all possible 
preference order relations fit the model 
when many stimuli were used. The point 
to be emphasized, however, is that knowing 
which preference order relations to expect 
cannot be determined in the absence of 
other stimulus distance relations and model 
parameter values which have never been 
specified. 

Consider next the Dember and Earl 
hypothesis, illustrated in Figure 1B. When 
I locations cannot be specified, the prefer- 
ence order relations compatible with the 
model are restricted, in the case of three 
stimuli, to four of the six possible preference 
order relations. Both CAB and ACB are 
incompatible with this model. This fact is 
considered below when the Coombs model 
Is discussed. Here it may be noted that 
data expectations for single-peaked curves 
are easier to predict than those from 
double-peaked curves. 

For preferences to the right of I’ in 
Figure 1A and 1B, no conceptual distinction 
Can be made between the two models, and 
they share common problems. Comparing 
data expectations for the stimuli to the 
right of the I’ locations in Figure 1A and 
1B illustrates difficulties when shapes are 
Changed. In Figure 1A, at I’, C is preferred 
Over B, but the reverse is true in Figure 

Furthermore, if the curve shape is 
Preserved in Figure 1B but the I'-to-curve- 
Peak (pacer) distance is changed, the 
Preference order relation changes. This can 
2€ visualized by shifting the curve peak 
toward stimulus C, but keeping I’ fixed, 
Until for individuals at I’, preference order 
relation CBA would be expected. 

In summary, it can be stated that un- 
ambiguous mapping of data into discrep- 
ancy hypotheses or achieving unam- 
biguous predictions from discrepancy 
hypotheses requires specification of the 
following parameters: (a) curve shape, 
(DYI location (or locations), and (c) dis- 
tances between stimuli and between stimuli 
and I locations. Note that these param- 
eters must be specified jointly. In the 
absence of parameter specification, almost 
àny data may be mapped into or found at 
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variance with the models by making differ- 
ent assumptions about parameter values. 
In addition to the above considerations, 
the unidimensionality assumption must 
hold. 


QUALITATIVE TESTS OF 
DISCREPANCY HYPOTHESES 


Lacking estimates of parameter values 
concerning distances along the J scale and 
the shape of the preference curve, studies 
testing discrepancy hypotheses have been 
restricted to predictions regarded here as 
qualitative, because the predictions con- 
cern only the general form of the preference 
function and not its specific shape. Actu- 
ally, however, such predictions are not 
of central importance to discrepancy 
hypotheses. 

For example in the case of McClelland’s 
hypothesis, if an I can be specified it would 
be expected that preference above or below 
I should reveal progressive increases in 
preference until a maximum is reached 
after which preference should continuously 
fall off for stimuli located beyond this 
maximum. Such a procedure requires 
that I locations imbedded within a stimulus 
series be specified to within an order- 
preserving transformation, and that the 
stimuli to one or both sides of 1 be assumed 
to span a sufficiently broad range of the di 
scale such that if a "peak" preference 
exists, it will be revealed in the preference 
order relations. It would have to be 
assumed that the responses to the stimuli 
satisfied undimensionality. Since metrics 
along the J scale and the curve shape 
parameters are unknown, no specific prefer- 
ence order relation concerning which stimu- 
lus should be maximally preferred can 
reasonably be predicted, although such 
predictions have been made (e.g., T 
& Kagan, 1967). Hypotheses regarding 
distances from I to maximally motivating 
optimal discrepancies, hypotheses which 
are truly central to the models, would re- 
quire that distance between stimulus points 
and between stimulus points and I locations 

<nown. 
ou the prerequisite conditions, the 
experimental procedure involves searching 
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for some kth stimulus to one side of I for 
which preference is maximum. Preferences 
for stimuli on both sides of & must fall off 
iba tonically (toward I for stimuli 
to one side of k; away from I for stimuli 
on the other side of k). This condition 
defines a single peaked preference function. 
Let N, equal the number of stimuli above 
I, and include the stimulus coincident with 
I if any. In the Appendix it is shown 
that the probability of finding a single 
peaked preference function is 


20-0 —2 


N,! € 

The numerator is the number of such 
single peaked preference functions. The 
denominator is the number of possible 
preference order relations. The equation 
is useful when there are no ties in preference 
and for Na 2 3. It assumes error-free 
data, that is, no chance fluctuations in the 
function relating preference with stimuli. 
With moderate numbers of stimuli, the 
probability of finding a single peaked 
preference function given that the prefer- 
ence order relations were randomly gen- 
erated is remote. For example, if N, = 6, 
p(A) < .05. Equation 1 would hold for 
stimuli below I as well as for stimuli above 
I for McClelland's hypothesis. Let N, 
equal the number of stimuli below I and 
include the stimulus coincident with I, if 
any. Assuming preferences above and 
below I are independent, if p(A)-p(B) < a, 
the McClelland hypothesis would have 
qualitative support. With Na and N, > 3, 
and the total number of stimuli > 7, 
a « .05. 

For Dember and Earl, Equation 1 would 
hold for stimuli above I, but there is only 
one preference order relation which falls 
off strictly monotonically below 1l; its 
chance probability is 1/N»! 

Consider now studies which have made 
qualitative tests of discrepancy hypotheses. 
Haber (1958) adapted the hands of his Ss 
to water temperatures, after which they 
made preference judgments for water at 
temperatures below and above their adap- 
tation levels (i.e., I locations). Usually 


P(A) = [1] 


N. = N, = 5, and the form of the prefer- 
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ence curves obtained for each S were in 
perfect harmony with the McClelland 
hypothesis. Haber's success can probably 
be traced to superior method considera- 
tions. His stimuli were psychologically 
simple; to control for judgment error he 
replicated both within and across Ss, and 
Haber carefully assessed his S's adaptation 
level before obtaining judgments. 

_ Verinis et al. (1968) report four studies 
intended to provide data relative to Mc- 
Clelland's hypothesis. They conclude that 
each study fails to support it. The studies 
illustrate a number of problems regarding 
valid tests of discrepancy hypotheses in 
general, and interpretation of negative 
data in particular. For example, in their 
first experiment, college students read brief 
paragraphs concerning blind dates, college 
grades, Christmas job earnings, etc. These 
paragraphs were designed to establish an 
expectancy set (I). Afterward, Ss were 
provided with information. at different 
degrees of discrepancy from this set. The 
Ss then evaluated on rating scales pleasures 
and motivation associated with the dis- 
crepant information. In general, monotone 
functions were found. The greater the 
positive discrepancy, the greater the 
pleasure. The greater the negative dis- 
crepancy, the greater the displeasure. 

, Given the nature of the stimuli, this 
finding would be expected. As viewed 
here, their stimuli called for stimulus com- 
parisons (i.e., Coombs QIII data), not QI 
or preference data. It is hardly surprising 
that college students were unhappy about 
the prospects of making any lower quiz 
grade, or were most happy making a lot 
more, rather than just a little bit more; 
money on their Christmas jobs. Such 
judgments of money, grades, gas mileage, 
etc., are not preference data as viewed 
within data theory. 

This difficulty aside, are the data in- 
compatible with the McClelland hypothesis 
as claimed? Actually, by so interpreting 
the data, they are probably making un- 
Warranted assumptions about unknown 
parameters. As stated, they found a mono- 
tone function falling off below I. Consider 
this finding in relation to Figure 1A. As- 


DISCREPANCY HYPOTHESES 


sume that their Ss were located at I’. 
Suppose that the stimuli Verinis et al. 
selected happened to have been located 
below l' and to the left of the peak of 
maximum preference. Under these con- 
ditions, preference would be expected to be 
a decreasing monotone. By similar reason- 
ing it can be seen that preferences might be 
an increasing monotone if the stimuli 
selected happened to be, for example, 
below I’ but to the right of the peak of 
maximum preference. 

Thus, failing to find a single peaked 
preference function cannot be taken as 
evidence against inverted-U (or V) hy- 
potheses unless strong, probably unrealistic 
assumptions are made concerning properties 
of data and model parameters. Studies 
Which report negative data have made such 
assumptions (Block, 1964; McCall & 
Kagan, 1967; Verinis et al., 1968). 

Except for Haber's (1958) study, data 
reported to be in correspondence with 
discrepancy hypotheses or other inverted- 

hypotheses have typically shown more 
than one and often several peaks in the 
preference function (i.e., multiple peaks in 
the curve to one side of the individual's I 
location). Thus, the functions are often 
complex with many fluctuations (e.g 
Conners, 1964; Kammann, 1966; Verinis 
et al., 1968). Two possible interpretations 
Can be given for these curve fluctuations 
besides, of course, rejection of the dis- 
crepancy hypothesis under test. The in- 
terpretation which is implicitly accepted 
is that the multiple inflections in the curve 
are chance deviations from the “true 
function. This interpretation 1s difficult 
to evaluate since there is nO known error 
distribution—a troublesome problem, but 
one hardly unique to this situation. 
Second interpretation is that the responses 
Were multidimensionally determined. 

There are two reasons why this second 
interpretation is just as reasonable. First, 
the stimuli used in many studies are very 
Complex multiattribute stimuli. As an 
illustrative example, consider Kammann $ 
(1966) test of the Dember and Earl model. 
Among the findings were preference func- 
tions which were bimodal, a finding clearly 
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incompatible with the model under test. 
But Kammann's stimuli were poems, which 
were probably responded to in no simple 
unidimensional manner. Other studies 
with similar difficulties include Conners 
(1964), McCall and Kagan (1967), Walker 
and Walker (1964), and Wohlwill (1968). 
A second reason for this interpretation is 
that Tversky and Russo (1969) have shown 
that when Ss are told the stimulus dimen- 
sion along which to respond, their judg- 
ments are still influenced by other stimulus 
properties. Thus, unless Ss are constrained 
in their preferences, or data are tested to 
evaluate the degree to which a unidimen- 
sional criterion is satisfied, the interpreta- 
tion of results when complex preference 
functions are obtained is unclear. 


SUGGESTED PROCEDURE FOR QUANTITA- 
TIVELY ESTIMATING DISCREPANCY 
HYPOTHESIS PARAMETERS 


An important and interesting problem 
concerning discrepancy hypotheses is the 
measurement of the discrepancy itself, 
that is, the distance between T and those 
optimally situated stimuli presumably most 
effective in mediating behavior. 

Since there are too many free parameters 
to be initially estimated from data, it is 
suggested that distances among stimuli 
along the J scale be estimated by creating 
experimental conditions under which a 
Coombs function can be obtained. Once 
these parameters have been estimated, 
experimental conditions can be created 
under which it can be expected that dis- 
crepancy hypotheses will hold, and then 
the remaining parameters can be estimated. 
An empirical example is described below, 
but consider first the rationale for this 
suggested approach. What follows is 
based on Coombs' discussion of the unfold- 
ing model (Coombs, 1964, Ch. 5 and 9). 

A Coombs function is illustrated in 
Figure 1C. In this preference model, 
stimuli which are coincident with I, if any, 
are most preferred, the remaining stimulus 

references correspond "to the rank order 
of absolute distances of the stimulus 
points from the ideal point [Coombs, 1964, 
p. 80]." Although this change from the 
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discrepancy models seems slight, there are 
two important implications of this strategy. 
First, while the particular preference order 
relations expected | under discrepancy 
models can vary with changes in the shape 
of the preference curve, in this model 
preferences are insensitive to changes in 
shape. Provided the curve remains sym- 
metrical and continuously falls off about 
I, the same preference order relations will 
result. Second, while for both discrepancy 
models some preference order relations can 
imply different J-scale locations, here a 
preference order relation has a unique 
discrete region along the J scale. Under 
this model, preferénce order relations 
change only when a midpoint between two 
stimuli has been crossed (see Figure 1C— 
the dotted ordinates represent midpoints 
between stimulus pairs). 

Viewed somewhat differently, the 
Coombs model is both a scaling method 
and a scaling criterion. Its use asa scaling 
criterion is seen as particularly important. 
As a criterion, the model permits an em- 
pirical test of the unidimensionality as- 
sumption. The primary criterion for 
unidimensionality (i.e., one-dimensional or- 
dinal representation) is transitivity of the 
(preference) order relation (Torgerson, 
1958, p. 247) As a scaling criterion, 
Coombs' model is an application of this 
test and requires transitive preferences. 
Given that preferences are transitive, a 
sufficient condition for rejection of a com- 
mon unidimensional J scale is finding 
more than one mirror-image preference 
order relation (Coombs, 1964, p. 86). In 
Figure 1C, this preference order relation 
is ABC. Its mirror image is CBA. Note 
that data collected under conditions of the 
McClelland hypothesis as illustrated in 
Figure 1A reveal three mirror-image 
preference order relations. However, the 
Dember and Earl hypothesis, illustrated 
in Figure 1B, also reveals only one mirror 
preference order relation, the same as under 
Coombs' model. Viewed in this way, data 
collected to test hypotheses of the Dember 
and Earl model may be useful for answering 


the unidimensionality question. 


As a scaling method, under some condi- 
tions, Coombs' model can extract metric 
information from the data. These pro- 
cedures have been described by Coombs 
(1964, Ch. 5). 

Once these J-scale stimulus distances 
have been determined, experimental con- 
ditions can be created under which a dis- 
crepancy model can be expected to hold, 
and ordinate heights to be used to deter- 
mine the shape of the curve may be esti- 
mated from measures of strength of 
preference. 

Consider the following example. In 
studies of exploration (Thomas, 1969b), 
cats wandered for several minutes in a 
triangular box with three compartments. 
A different visual pattern stimulus was in 
each compartment. It was hypothesized 
that responses to the set of stimuli would 
satisfy a unidimensional criterion. Time 
spent in each compartment was used to 
specify, for the cat, a preference order 
relation; the more time spent in a com- 
partment, the more the pattern in that 
compartment was said to be preferred. 
One use of the preference order relations 
was to determine if the criterion for a uni- 
dimensional scale could be achieved. The 
hypothesis was supported, 

No useful metric information can be 
extracted when three stimuli are used, so 
Coombs' model was used as a scaling 
criterion only. However, it was reasoned 
that by slightly adapting cats to selected 
stimuli, an I and stimulus point could be 
caused to coincide. By presenting Ss with 
two choice stimuli, one on each side of I 
if Coombs’ hypothesis holds, Ss should 
prefer the closest stimulus. Using this 
procedure, distance order relations were 
obtained and a behaviorally defined metric 
was derived using Frank Goode's A solution 
(Coombs, 1964). To test the validity of 
the metric, additional cats were similarly 
tested, but Predictions were made as to 
which of two stimuli should be preferred. 
The predictions were supported (Thomas, 
19692). 

In additional research, cats have been 
adapted lor longer periods of time with the 
Same stimulus in all compartments, ‘Then 
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two choice stimuli were presented; the 
third compartment contained the same 
adaptation stimulus. Amount of time 
spent with the choice stimuli has been used 
to estimate the height of the stimulus or- 
dinates. Knowing distances between the 
stimuli, discrepancies and parameters of 
the preference curve have been estimated. 
Strength of preference for different choice 
stimuli has been found to relate to length 
of adaptation—hardly a surprising finding. 
This suggests that discrepancy hypothesis 
preference curves are parameters, not 
constants. 


DISCUSSION 


The scaling approach suggested does 
have limitations, difficulties, and unstated 
assumptions, and it is important to consider 
some of them. An obvious practical diffi- 
culty is that the procedure is laborious. 
Yet there seems no obvious way to eschew 
the problem of parameter estimation since 
all parameters cannot be estimated jointly. 
Some conceptual simplification must take 
place which unfortunately increases em- 
pirical labor. 

One working assumption is that the J- 
scale distances obtained under one experi- 
mental procedure will not, from the S's 
viewpoint, "shrink" or “expand” when the 
experimental procedures are changed. This 
assumption may be false, but it can be 
tested experimentally. Another difficulty 
concerns Goode's A solution for numerically 
representing the ordered metric data. The 
procedure requires arbitrary decisions, and 
itis not clear if it is the best approach 
for producing metrics sturdy enough to 
Withstand experimental verification. There 
are other numerical procedures available. 
The iterative-based procedures stemming 
from Shepard's (1962) pioneering efforts 
are obvious possibilities. However, these 
Procedures were originally developed for 
Similarities (QIV) data, not QI preference 
data, but Klemmer and Shrimpton (1963) 
have offered a modification of Shepard's 
procedure for preference data. : 

One reason for suggesting that stimulus 


distances be estimated under conditions 


favoring a Coombs function is that the 
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-Coombs model has no strong distributional 


assumptions. However, for some situa- 
tions, the model may be inappropriate. 
It is an individual differences model which 
in the usual application involves assem- 
bling different preference order relations 
from different individual Ss. The model 
would be useless if, for example, all Ss 
gave the same preference order relation. 
A more likely situation, however, is that 
with many stimuli the preference order 
relations for different Ss will be so varied 
that rejection of a common unidimensional 
J scale may be quite likely. Thus, the 
model is very vulnerable, that is, much 
data will probably not satisfy its condi- 
tions. Furthermore, it is not clear if the 
model is useful for grouped data. 

Although there are conceptual differences 
between discrepancy models and the 
Coombs model, it is likely that under 
some conditions data could not distinguish 
the Coombs model from discrepancy 
models. This would depend on the dis- 
tances separating stimuli, the I location, 
the depth of the "trough" as in Figure 1A, 
or the l'-to-pacer distance in Figure 1B, 
etc. As an example, stimuli to which Ss 
were adapted which presumably fixed the 
S's I location were not, generally, most 
preferred on choice trials as Coombs’ 
model would predict (Thomas, 1969a) ; 
but this did not affect the hypotheses 
under test. 

The paper has focused on the concept- 
ually simple case of discrepancy models 
as unidimensional models. lIt has been 
argued that the question of unidimension- 
ality should be a hypothesis to be answered 
in data and not an untested assumption. 
When the data are examined to see if uni- 
dimensionality is satisfied, the question 
being asked is whether the data satisfy 
the axioms of an ordinal scale. While 
transitivity of the order relation is not the 
only property of ordinal scales (cf., e.g. 
Pfanzagl, 1968, P- 29 and Ch. 4), it is 
probably the most important empirically 
testable property. However, it should be 
stressed that satisfying an ordinal criterion 
does not automatically reveal the stimulus 
attribute or attributes on which the psycho- 
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logical dimension is based. It may be that 
several different physical attributes are 
combined by the Ss into one psychological 
dimension. Or it might be that no obvious 
physical attributes underlie a single psycho- 
logical dimension. These are questions 
for experimental isolation. What is prob- 
ably more likely is the data will not satisfy 
a unidimensional criterion. (In the writer’s 
opinion, it is the probable failure of existing 
data to satisfy a unidimensional criterion 
that has caused the greatest single difficulty 
in evaluating from data the tenability of 
discrepancy hypotheses.) 

There are at least two possible solutions 
to the problem if a unidimensional criterion 
cannot be achieved. One approach is to 
expand discrepancy models into the multi- 
dimensional case. This, however, leads to 
a number of difficult problems. For ex- 
ample, in the case of spatial models, one 
problem concerns the nature of the space. 
Should it be Euclidean or non-Euclidean? 
Furthermore, the problem quickly becomes 
conceptually complex as the number of 
dimensions increase beyond two or three. 
As far as is known, only Earl (1961) has 
considered the joint effects of two dimen- 
sions within a spatial-motivational frame- 
work. His formulation is necessarily tenta- 
tive because experimental work on the 
problem is lacking. 

A second alternative would be to refor- 
mulate discrepancy hypotheses within a 
choice theory framework (cf., e.g., Greeno, 
1968). Some of the problems of spatial 
models would be eliminated, and choice 
theories may be able to handle more 
adequately than discrepancy theories psy- 
chologically interesting multiattribute stim- 
uli such as symphonies, paintings, etc. 
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APPENDIX 


Let the integers 1, 2, 3, ++", " represent the 
Na ordinally represented stimuli, and let k 
e the maximally preferred stimulus. Ifk=1 
Or » there could be no single peaked preference 
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function as defined above, for there could be 
no strictly monotonic falling off on both sides 
of k as required by definition. Thus to have 
a single peaked preference function, & must be 
some jth stimulus 2 through » — 1. To the 
left of j there are L stimuli; to the right, R 
stimuli; and preferences for these stimuli must 
fall off strictly monotonically from j to the 
left and right, respectively, as required by 
definition of single peaked preference function. 
This implies that there can be only one prefer- 
ence order for the L left stimuli and one 
preference order for the R right stimuli for 
any jth stimulus. It is clear that as j varies, 
L and R vary, and both L and R have limits 
of 1 and N,—2, and L+R=N.—-1. 
These N, — 1 stimuli may take any order for 
any jth stimulus provided the L left and Rright 
stimuli retain their same order for that jth stim- 
ulus. The number of possible sequences of the 
Na — 1 stimuli with this constraint is i B 2) 
(cf. Feller, 1968, p. 36, Example e). Suppose 
j= 2; then L — 1, and R= N,—2. There 


Na—4 
are 1 


For j23 


possible orders for the Na — 1 


Na mih 


stimuli. there are 2 ) 


possible orders for the Na — 1 stimuli. Gen- 


: 5 Na—1 
eralizing and summing yields p» ( pz D 
1 
which is algebraically equivalent to 2a!) — 2, 


the numerator of Equation 1. 


(Received June 22, 1970; revision 
received December 14, 1970) 
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THEORETICAL NOTES 


COMMENTS ON BIEDERMAN'S 
A FAILURE IN 


"CONTINUITY THEORY REVISITED: 
A BASIC ASSUMPTION” 


DANIEL B. BERCH1 
California State College, Los Angeles 


Biederman has argued that a basic assumption of Spence's discrimination 


learning theory is incorrect. 


It is suggested that one can account for 


Biederman's data on the basis of the stimulus generalization of habit while 
maintaining that the inhibition associated with the negative stimulus increases 
monotonically over the course of learning. It is concluded that Biederman's 
evidence is insufficient for rejecting Spence's theory. 


Biederman (1970) has suggested that the 
validity of Spence's (1936) theory is suspect 
because it incorporates an incorrect assumption 
regarding the growth of habit strength in dis- 
crimination learning. Biederman's argument is 
based primarily on evidence from a series of 
experiments in which subjects were given con- 
current but unequal amounts of training on two 
simultaneous discrimination problems, followed 
by a choice between the overtrained and non- 
overtrained negative stimuli. After relatively 
few training trials, the subjects showed a pref- 
erence for the overtrained negative stimulus 
(Biederman, 1967, 1968; Deutsch & Biederman, 
1965). According to Biederman, this finding 
suggests that in discrimination learning, the 
negative stimulus (S—) loses its aversiveness 
as training proceeds, that is, that the inhibition 
associated with the S— is nonmonotonic. 

In the first place, Biederman's use of the term 
"overtraining" is inappropriate. Overtraining 
has conventionally referred to the administra- 
tion of additional trials beyond attainment of 
some specified criterion. By this standard, 
most of the "more trained" problems in Bieder- 
man's experiments would not be regarded as 
overtrained; indeed, some cannot even be con- 
sidered mastery trained. 

Second, Biederman states that "the basic 
assumption" of Spence's (1936) theory which 
is incorrect is that habit strength increases 
monotonically with number of reinforcements, 
However, the bulk of Biederman’s evidence, 
that which shows a preference for the more 
trained S—, has no bearing on this assumption, 
since responses to the negative stimulus are 


1 Requests for reprints should be sent to Daniel 
B. Berch, Department of Psychology, California 
State College at Los Angeles, 5151 State College 
Drive, Los Angeles, California 90032. 


never reinforced in the simultaneous discrimi- 
nation problem. 

Third, although Biederman’s evidence is rele- 
vant to Spence's assumption that the inhibition 
associated with the S— is a monotonic func- 
tion of the number of nonreinforced trials, 
Biederman incorrectly translates this assump- 
tion to avoidance terms. Part of Biederman’s 
confusion stems from his failure to recognize 
that Spence viewed inhibition as producing a 
decrement in the approach tendency rather than 
an increment in an avoidance tendency. A 
careful reading of Spence’s (1936) article 
makes this distinction readily apparent. 

‘ Fourth, and most important, an alternative 
interpretation of Biederman’s data is possible. 
Namely, the more trained S— may be preferred 
because it possesses a Stronger generalized 
habit (approach) tendency than the less trained 
S—. This should occur as a result of the 
stronger habit tendency conditioned to the more 
trained positive stimulus 
assuming that inhibition 
tion, the more trained S— should be preferred 
only when its 


al "e nu 
I: 55 the degree of Similarity of the nonspatial 
S within each of the discrimination problems. 


analysis demonstrates that 
one HE » 1 
can account for Biederman’s results while 
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It should be noted that Biederman (1968) 
has already rejected generalization of habit as 
an explanation of his results, because “the dis- 
criminative performance might be retarded 
[p. 462]." However, the fact that generaliza- 
tion of habit from S+ to S— would impede 
acquisition of the original discrimination clearly 
does not impugn the validity of this process as 
an explanation of the probe-trial findings, 

It must be concluded that Biederman's evi- 
dence does not constitute a sufficient basis for 
rejecting Spence's (1936) theory. Inasmuch 
as Biederman’s probe-trial technique may re- 
flect in part the generalized habit tendency of 
the negative stimulus, this measure may prove 
to be inadequate for evaluating the monotoni- 


city of inhibition per se. 
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HYPOTHESIS ANALYSIS OF CONJUNCTIVE 
CONCEPT-LEARNING SITUATIONS 


LANCE A. MILLER! 


IBM Thomas J. Watson Research Center, Yorktown Heights, New York 


For a taxonomy of conjuctive concept-learning situations, mathematical 


expressions are presented giving the number of 


possible hypotheses concerning 


the problem rule as a function of the task informational variables, The num- 


ber of initially possible hypotheses is shown to be 
knowledge of the number of relevant dimensions 
stimulus, as well as the problem parameters. 

tion in the number of possible hypotheses is 


a function of preproblem 
and the type of initial focus 
During problem solution, reduc- 
shown to be a function of the 


interstimulus similarity. Optimal stimulus selection procedures and expres- 
sions for the expected trial of solution are derived for various task situations 


involving the selection paradigm. 
A 7 H . 
The purpose of this paper is to present the 
analytical means for determining the number 


p possible concept rules, or hypotheses, in 
ncept-learning situations as a function of 


the problem parameters, pretask information, 
and the stimulus sequences. This work is 
essentially a development and augmentation 
of thë original contribution of Hovland (1952), 
who was one of the earliest proponents of an 
information-processing or “hypothesis search” 
point of view. However, in the present treat- 
ment, only conjunctive rules are considered, 
while Hovland was also concerned with a 
kind of disjunction rule (where any of several 
values of a particular dimension could char- 
acterize positives). This work subsumes both 
Hovland's concern for a hypothesis descrip- 
tion of situations in which the subjects know 
in advance the number of relevant dimensions, 
and also that section in Bruner, Goodnow, and 
Austin's (1956) work investigating human 
concept learning, where mathematical con- 
siderations were given to a situation of prior 
ignorance of this information. 

The following notations are employed throughout: 


N, = the number of logically possible (equi- 
probable) hypotheses concerning the cor- 
rect conjunctive rule. 

U(N,)- the amount of hypothesis uncertainty 


(log Na, in bits). 
the number of nonredundant stimulus 
dimensions. 


1 The author thanks Robert C. Haygood, Michael 
I. Posner, and Robert W. Schvaneveldt for their 
helpful comments and criticisms on an early draft 
of this paper. Appreciation is also expressed to 
Robert C. Calfee and Jane Morantz for their helpful 
discussion concerning mathematical derivations. 

Requests for reprints should be sent to Lance A. 
Miller, Thomas J. Watson Research Center, York- 
town Heights, New York 10598. 


e 
ll 


the number of values representing a di 
mension (equal for all dimensions). 

the number of relevant dimensions (di- 
mensions for which positives have the 
the same value) in the experimental rule 
a LNL AN 

d = in a comparison of two stimuli, the num- 


ber of dimensions having different respec- 
tive values (o < d < n). 
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Ni AT THE OUTSET OF A CONCEPT- 
LEARNING PROBLEM 


Two types of information may be supplied 
to subjects in the pretask situation which 
determine the intital value of Ni, given that 
subjects know the rule is conjunctive. One 
type of information is that relating to problem 
complexity, Assuming that subjects are 
always informed of the number of stimulus 
dimensions (n) and the number of values per 
dimension (v), this type of information refers 
to whether subjects know in advance the 
number of relevant dimensions (r). When 
subjects know r in advance, there is Relevance 
Certainty (RC). This condition was assumed 
in Hovland's analyses. When r is unknown, 
there is Relevance Uncertainty (RU), and 
most experimental investigations have in- 
volved this condition. The second type of 
d information is the presentation to the 
ca Ject of an initial classified “focus” stimulus 
tos pid et aL). The three alternatives 
red o : td this variable are (a) positive 
( i: negative focus, and (c) no focus 
Perlormance is measured from presentation 

of the first stimulus), 
tha e ineo fes conditions, combined with 
edd n itions of RC and RU, yield six 
Pretask conditions, Expressions for 
are presented in Table 1, giving Nn as à 
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ha rk E e 
Wi ; i the a and are 
min, | Tespe Xi 
the tion es to the RC-RU variable, ex- 
learn, lowing he expressions makes possible 
hing, predictic fi 4n conce 
Dro, E assumi ons for human concept 
S eaa that the concept-learning 
hypothesis an active search for the 
^ rategy k where the same hypothesis 
4) perfc s employed for both condi- 
È ee should be better for 
(c Should aff parahia RU; (b) manipulation 
ho Per R E only for RC; 
Si se trials should be maximum for 


0; 
e y 
and q n ditions : 
b.e V ons of RC which maximize Na; 


) per 
* eque ries Qe atate condition 
«s «quated with respect ow, y conditions 
] of the problem parz xi ua i arta BULE: 
LU comparis ; parameters. While RC- 
View or S ads are of theoretical interest 1n 
employed in analyses, this variable has been 
8eneral in only one known study, where 
found, b Lan for these predictions was 
Nercised hough this variable apparently 
Selected | some control over the strategies 
peach ct the subjects (Miller, 1969). 
Ocus o ai of the formulae for the three 
O oii also reveals some interesting 
ASE io Ics. Not only is the positive focus 
Possible T effective in reducing the set of 
Condition hypotheses, but also only for this 
epend is Na independent of T, and thus 
Banibrlated of the number of stimuli (when 
ls vat ed by variation of this parameter). 
cus q les of Na for a negative focus and no 
Verge epend primarily on and tend to con- 
th with v. Of these two cases, however, 
i ae i f least practical 
ee since either the positive focus or 
ive focus situation €X after presenta- 
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tion of the first « ; 3 
fonds m iesu E 
solves into that of the pu or ES 
presentation of the irst ede in * 
However, the negative focus pcs p 
of theoretical interest for investigations RET 
cerned either with the utilization of negative 
information in problem solution (e£ Cahill & 
Hovland, 1960), or the procedures adopted by 
subjects in the selection paradigm, where 
subjects select stimuli to learn of their classi- 
fication prior to the first positive instance 


(e.g., Miller, 1969). 

The two preproblem M 
directly compared in terms of the initial 
value of Na resulting for various combinations 
of these variables. In general, a positive focus 
under RU results in à smaller value of Na 
than for RC with no focus, and this difference 
the 5 and ?. The primary 
forr= 1 (and n > 3), where 
x RC can be much smaller 
a positive focus, and this 
and decreases with 
however. occurs 
tions of both 
and 


ariables may also be 


increases with 
exception occurs 
the. value of Na fo 
than for RU and 
difference increases with 5 
v, The minimum value of Vay 
for the most informative condit 
of these variables— that is, positive focus 
RC. 3 
A final co rning the calculation 
of Na at the prob ates to the 
possibility of redun 1 dimensions 
While partially correlate dimensions do not 
influence the 
correlated dimension 
independent dimension. 
values of three dimensions (e.g 
height, and color) are completely correlated, 
and one value combination characterizes posi- 
tives (e.g., three tall red figures), these dimen- 
sions count as only one relevant dimension 
in the Table same holds true 


1 formulae. The 
for completely corre 


mment conce 
lem outset rel 


present ex 


lated irrelevant dimensions. 


ERENT KINDS OF 
STIMULUS SEQUENCES 

The concern of this section is the determina- 
tion of Na as stimulus information permits the 
elimination of hypotheses from the initial set, 
as given by the expressions in the preceding 
section. At the outset, it should be pointed 
out that t o known exact analytical 
solution for g the reduction of Na 
d sequence of positives 
e kind of sequence most 
experimental research. 
expressions having 
presented here for 


Nn ron. DIFF 


here is n 
determinin 


Nevertheless, 
lications are 
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certain restricted kinds of sequences which are 
of interest in their own right. 


Simultaneous Scanning Strategy 


In the previous section, utilization of pre- 
task information was assumed to be inde- 
pendent of the choice of a particular problem 
strategy. Here, however, it is desirable to 
determine the maximum possible reduction in 
Nn as a function of stimulus information, and 
the optimally efficient simultaneous scanning 
strategy identified by Bruner et al. is the 
appropriate reference choice. 

The general operating procedure of this ideal 
strategy, as illustrated by an example in 
Table 2, is first to generate all the possible 
hypotheses, given the pretask information, and 
then to eliminate hypotheses from this set on 
the basis of information contained in sub- 
sequent stimuli. Positive stimuli are compared 
to each other, and hypotheses are eliminated 
which specify as relevant the noncommon 
dimensions among positives (cf. Trials 1 and 
3 of the example). Negative stimuli are in- 
formative to the extent that they have values 
in common with positives. All combinations 
of such common values (for RU) or all 
7-tuples ( for RC) are eliminated as hypotheses 

(cf. Trials 2, 5, and 6). This information 
processing is continued until there is only 
one remaining hypothesis. 

This strategy may be contrasted to a 
"focusing" strategy (cf. Bruner et al.) in which 
hypotheses are never directly considered but 
information is gained by logical deductions 
concerning the relevance or irrelevance of 
dimensions. In the presently formulated 
strategy of optimal focusing, information 
from positives is utilized by logically inter- 


TABLE 2 


RESULTS OF THE OPERATION OF SIMULTANEOUS 
SCANNING FOR A SAMPLE PROBLEM INVOLVING 
Four Dimensions, RC (r = 2), AND 
Positive Focus 


Trial | Stimu- | Classifi-| Eliminated | Remaining possible 


no. lus cation? | hypotheses hypothesesb 
1 0000 P 

2 0100 N 

3 0001 P 

4 1101 N 

5 0101 N 

6 1001 N 


^ P = positive; N = negative. 1 god 
b The symbol "x" indicates irrelevant dimension in the hy- 
pothesis; the problem involves two relevant dimensions, 


secting these stimuli to determine the irrele 
vant dimensions. Negatives are compared to 
a working hypothesis which initially presumes 
the relevance of the values present in the first 
positive (e.g., “0000” for the Table 2 example), 
where these entries are appropriately modified 
as dimensions are deduced to be irrelevant or 
relevant (e.g., “000x” for Trial 3 of the Table 2 
example, where the underlining indicates a de- 
duced relevant dimension). When a negative 
differs in only one value from the nonirrelevant 
entries in this working hypothesis, the relevance 
of the noncommon dimension may be deduced. 
The efficiency of this strategy (estimated in 
unpublished computer-simulation work to be 
within 6% of simultaneous scanning perform- 
ance) derives both from the fact that negatives 
can be informative even when they differ in 
more than one value from positives (unlike 
other focusing strategies; cf. Bruner et al.) 
and from the capability of utilizing previously 
uninformative negatives when new irrelevant 
dimensions are determined—that is, previous 
negatives are reprocessed. 

A compound of both these strategies is 
possible in which the simpler optimal focusing 
strategy is employed whenever possible, and 
simultaneous scanning is used only when 
relevance deductions cannot be made. The 
only assumption made in the following treat- 
ment is that the strategy employed utilizes all 
the available information from all stimuli to 
maximally reduce the uncertainty concerning 
the correct hypothesis. 


Nn, for an All Positive Sequence 


The most important parameter influencing 
the values of Nn for this and other sequence 
types is d, the number of dimensions for which 
two stimuli have different values. For posi- 
tive stimuli, the range of d is from 1 to (n — r), 
the number of irrelevant dimensions. Expres- 
sions 7 and 8 in Table 3 give the formulae for 
determining N, in a two-stimulus sequence 
involving a positive focus and a subsequent 
Positive of difference d from the focus. As 
ndicated by Expression 8, when the second 
Positive has d = (n — r) differences from the 
focus in the RC condition, all but the correct 
hypothesis may be eliminated. When d < 
(n — r), however, more than one positive 
following the focus will be required to permit 
deduction of the correct rule. [n particular, 
when d = 1, and a new dimension is varied 
by each subsequent stimulus, a sequence of 
n — r) positives following the positive focus 
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TABLE 3 


= GIVING Na AS A FUNCTION OF PROBLEM 

TERS FOLLOWING A PosrrIvE Focus, 

FOR THE KNOWLEDGE OF r AND CLASSIFI- 
CATION OF THE SECOND STIMULUS 


CONDITIONS 
Classifica- Knowledge of r 
tion of 
second 
stimulus RU RC 
TT m n—d 
Positive Nu-22v4-1 [7] w= ( ) [8] 


Negative | Na=2n—2e-¢ [9] m=(7 -("7’) [10] 


r 


is required for deduction of the correct 
hypothesis. 

In contrast to the RC condition, with RU, 
a sequence of positives, in which all irrelevant 
dimensions are varied, results not in elimina- 
tion of all but the correct hypothesis, but leads 
to reduction to some minimum set which can- 


not be reduced further by positive stimuli 


alone. This is shown by the following 
expression : 
Minimum N, (RU, positives) = 2r—1 [il] 


Ny, for a Positive Focus and Subsequent Negatives 


In a negative stimulus following a positive 
focus, at least one of the relevant dimensions 
must be different, and all dimensions may be 
different, such that 1 X d X ». It can be 
Seen from Expressions 9 and 10 in Table 3 
that the largest number of hypotheses is 
eliminated when d = 1, while no hypotheses 
are eliminated when d = " for RU, or when 
or RC. Unlike a posi- 


(n — d) is less than 7 f n : 
tive sequence, the most informative negative 
es not lead to 


following a positive focus do i 
elimination of all but the correct hypothesis 
In RC (except when 7 = 1) A negative with 
d — 1 permits reduction of Nn to those hypoth- 
eses which specify as relevant the noncom- 
mon dimension between the negative and the 
focus (with the relevant dimensional value 
being that occurring in the focus). Complete 
reduction of Na by means of negative stimuli 
positive focus, therefore, requires 


following a 1 where a dif- 


at least r negatives with d= 1, w 
sion is varied by each. 
for a sequence 
f negatives 


ferent noncommon dimen 

In the RU condition, just as 
of positive stimuli, a sequence © ri 
following a positive focus reduces Nn only to 
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some minimum value, which cannot be reduced 
by additional negative information: 


Minimum Na (RU, negatives following 


[12] 


positive focus) = 2"7" 


Nn, for an All Negative Sequence 


For a sequence involving no positives, the 
informational value of a subsequent negative 
following a negative focus increases with d, 
rather than decreases as in the case of a posi- 
tive focus and subsequent negatives. The 


EC s Wi n 
initial negative focus eliminates ( ) or2"— 1 
r 


hypotheses from the initial set given by Expres- 
sion 2 (RC) or Expression 1 (RU), respectively. 
However, only in the RC condition for r = 1 
or r — can subsequent negatives continue 
to eliminate this maximum number of hypo- 
theses (a total of v such stimuli can be maxi- 
mally informative for r= 1, while every 
negative is informative for r — n). For RU, 
or, generally, for 1 «r <n, some smaller 
number of hypotheses is eliminated due to 
the fact that at some point in the sequence 
(depending primarily on v), subsequent nega- 
tives cannot be chosen without overlapping 
dimensional values of previous negatives, 
resulting in the redundant elimination of 
hypotheses (e.g., compare the negatives of 
Trials 2 and 5 in Table 2). 

At present, there is no known general solu- 
tion for determining the change in Na as à 
function of 1, 7, and v for random negative 
sequences. Exact determination of the chang- 
ing Na values for a particular negative sequence 
(or a mixed sequence, for that matter) pre- 
sently can be accomplished only by cen 
of an optimal strategy for the sequence o 
interest and computing N on each trial. 


Estimation of Ny in & Mixed. Sequence 


sity of simulation for deter- 


mining the values of N,ina E seque 
some analytical assistance „can be provi s 
for estimating Np ina positive focus E 
with following negative or mixed ems i : 
analyses require that the relatively simple 


optimal focusing strategy be simuleget up B 
and including the kth stimulus when er s 
timated so that the number of deduce 
pae nt and irrelevant dimensions can be 
[eo med: The author has found that this 
dee ually be accomplished quickly and 
easly by "hand" simulations (rather than 
a 


Despite the neces: 
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computer simulation) by beginning with an 
intersection of the positives on or preceding 
the trial for which JV; is to be estimated to 
compute the irrelevant entries for the working 
hypothesis. Previous negatives are then com- 
pared to the resulting working hypothesis to 
determine how many relevance deductions 
can be made (e.g., how many previous nega- 
tives differ uniquely from the working hypo- 
thesis in only one dimensional value). 

Given that the simulation results in the 
knowledge that i dimensions are irrelevant 
and j dimensions are relevant, an estimate of 
the number of hypotheses remaining after k 
stimuli, V^, for RC and RU is given by: 


Nn* (RC, optimal focusing) 

E a 

z ( r-j ) [3] 
N4* (RU, optimal focusing) 


= 2H ry) pg 


where f(j) in Expression 14 is defined to be 1 
if j = 0, and O if j > 0. 

These formulae may occasionally under- 
estimate Np (particularly for small k), since 
account is not taken of potential information 
in negatives which differ in more than one 
value from the working hypothesis (whereas 
the simultaneous scanning strategy would 
utilize all information). The formulae given 
in Table 3 may also be modified to permit 
estimation of N; on Trial (k + 1) asa function 
of d by substituting (n — i — j) for n, and 
(r — j) for r. It should be noted, however, 
that reduction of N; by the next stimulus can 
occur only if the stimulus contains all j dimen- 
sional values deduced to be relevant, and if 
the stimulus is different in at least one of the 
values of the (n — i — j) dimensions whose 
status is unknown. 


Distribution of Stimulus Information as a 
Function of d 


Expressions 7-10 indicate the general rela. 
tionship that the informational value of suc- 
cessive stimuli is determined by their similarity 
(dimensional overlap) with previous stimuli. 
However, there may be several different 
stimuli at any point in a sequence which have 
the same potential informational value, and 
it is of interest to examine how the total 
amount of problem information is distributed 
over the stimulus population. 

The following Expressions (15 and 16) give 
the frequency of second stimuli which have d 


differences from a positive focus, where the 
second stimulus is Positive and negative, Py 
and Na, respectively (the superscript indicates 
the stimulus number): 


pe ec (*zn) m 


( - Ser) - Demy us] 


Expression 15 indicates, for example, that for 
n= 6,r = 3, and v = 2, there are three possi- 
ble second positives which have d = 2, any 
one of which could reduce the initial U(Nn) 
by 2.3 bits of information in the RC condition. 
Similarly, solution of Expression 16 gives 
three alternative negatives having d = 1, each 
of which would result in a one-bit reduction 
in uncertainty. 

Examination of the dependence of the fre- 
quency values on the value of v indicates that 
for second positives, intermediate informa- 
tional values, (n — 12 S AE mE 1); 
are most probable for small v, while for large v 
(say,v > 8), the most informative positives 
are also the most probable—that is, positives 
having d — n — r are most frequent. For 
negatives, however, the least informative 
instances become the most probable as v 
increases. 

Similar expressions may be given for the 
frequency of stimuli of a given class as a 
function of don the kth trial, where i irrelevant 
and j relevant dimensions have been previously 
deduced (by the optimal focusing or similar 
Strategy). „The formulae given below permit 
determination of the number of stimuli on the 
kth trial which possess all of the j known 
relevant dimensional values and differ in one 
or more of the (n ~ ; — j) remaining dimen- 
sions whose status has yet to be determined. 


Stimuli not meeting these conditions are 
uninformative, 


Pye = ye (4 Ti dae [17] 


Nak = (y — pja -i-j 
e-w[(*7i-7) 
-(^i7 en [18] 


mias strate how the distribution of infor- 
T e remaining stimulus set changes as 
x € status of dimensions becomes known, con- 
“es RC example where z = 8, r= 4, and 
ni Following the positive focus there are 

Y lour positive or negative stimuli with 


ll 
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d — | which convey one bit of information. 
' However, at the point where the status of 
i = j = two dimensions is known, there are 
eight d — 1 stimuli of both classes, each still 
conveying about one bit of information. In 
general, it appears that as the amount of 
uncertainty decreases in a problem, there is an 
increase in the probability that succeeding 
stimuli will be more informative, where this 
probability depends primarily on i for fixed v. 


Optimal Selection Strategies 


Consideration of the role of d in the above 
analyses leads to the identification of three 
procedures for selecting stimuli which are 
optimal for different task situations. Asso- 
ciated with two of these are formulae giving 
the expected trial of solution, assuming that 
all of the information available in the stimuli 
is utilized. 

The first situation of concern is when there 
is only one relevant dimension, and this is 
known by the subjects. Solution of Expres- 
sions 8 and 10 for r = 1 indicates that maxi- 
mum information results when d = (n — 1), 
if the second stimulus is positive, and when 
d = 1, if the second stimulus is negative. Since 
the probabilities of both classes are equal (and 
assuming that the prior probabilities of dimen- 
sions being relevant are equal), the choice of d 
which maximizes reduction of N, for either out- 
come is an average of the two solutions, result- 
ing in d — n/2. Thus, in this situation, the 
best selection procedure, called simple optimal 
selection (since the concept is a simple uni- 
dimensional one), is to pick a second stimulus 
in which half of the dimensional values in the 
first stimulus are changed (either positive or 
negative focus), and successive stimuli vary 
half of the remaining untested dimensional 
values on each trial. In this way Nj is reduced 
by half by each stimulus, and solution will 
occur with logs trials (this is the classic in- 
formation-theory situation; cf. Garner, 1962). 
The extent to which subjects employ this 
selection strategy is not known, however. — 

The second selection strategy is appropriate 
for the conditions of RU and RC for r > 1. 
Assuming that the probability distribution for 
choosing a value of r is uniform over the range 
1 c y <n, negatives will, on the average, be 
much more probable than positives. One 
approach to the selection. problem for this 
situation, then, is to determine the procedure 
which maximizes information gained, given 
the expectation of a negative stimulus. | Ex- 
amination of the Table 3 expressions indicates 
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that a value of d = 1 maximizes the informa- 
tion in negatives (i.e. a relevant dimension 
may be deduced for a negative outcome) and 
leads to an equivalent amount of information 
if the outcome is positive (i.e., an irrelevant 
dimension may be deduced). This procedure, 
called optimal selection here, in connection 
with focusing utilization of information, cor- 
responds to the conservative focusing strategy 
identified by Bruner et al. and has been ob- 
served in several other studies (e.g., Laughlin, 
1965; Miller, 1969). 

Assuming optimal selection (and complete 
information utilization), formulae can be 
derived giving the expected trial of solution, 
T, for the RU and RC conditions (both with 
positive focus). 


T (RU, optimal selection) 


= U(N,) = logs(2" — 1) =» [19] 


T (RC, optimal selection) 
-—r(n—r)(nd-2)/(r4-1)(n—r4-1) [20] 


The last situation of interest is that of a 
negative focus and RU. Here the problem has 
two possible formulations, depending on the 
stimulus sequence: (a) if the sequence can 
consist of only negative stimuli, how canstimuli 
be chosen experimentally to minimize U(N;) 
at each point in the sequence; and (b) if there 
are no constraints on the occurrence of posi- 
tives, how can the subject select stimuli so 
as to minimize the number of trials before 
encountering a positive stimulus (at which 
point optimal selection is appropriate). The 
procedure discussed below is optimal for both 
formulations, but is considered with respect 
to the former. 

The correct selection of the next negative 
stimulus, based on the previous consideration 
of negative sequences, is one which minimizes 
the overall similarity of the stimulus to each 
of the previous negatives. In general, succes- 
sive sets of stimuli are to be chosen so that the 
number of common dimensions among mem- 
bers of the set and between stimuli of the 
previous set are minimized. The difficulty of 
applying this principle depends on » and v 
and is simplest (for v > 2) when v = z (or at 
least when 7 = 0 modulo v). An example of 
an optimal sequence for n = v — 3 follows, 
where the dimensionality of the hypotheses 
eliminated is given in parentheses: "000, 111, 
222" (all one-dimensional, and some two- and 
three-dimensional hypotheses are eliminated) ; 
«912, 120, 201, 021, 210, 102" (all one-dimen- 
sional hypotheses are redundantly eliminated ; 


268 


the remaining two-dimensional hypotheses are 
eliminated, and additional three-dimensional 
hypotheses are eliminated); ‘001, 112, 220, 
010, 121, 202, 100, 211, 022" (the one- and 
two-dimensional hypotheses are redundantly 
eliminated; new three-dimensional hypotheses 
are eliminated, leaving only nine such remain- 
ing). It is evident that the expected length 
of a stimulus sequence produced by this 
principle increases with r. However, the au- 
thor was unable to derive an analytical expres- 
sion giving the expected minimum number of 
trials (as a function of the problem parameters) 
for eliminating all r-dimensional hypotheses 
or for encountering a positive stimulus. 

The above procedure, called optimal nega- 
tive selection, is much easier to verbalize for 
problems in which v — 2:for any first stimulus, 
the second has the opposite values for all 
dimensions; the third differs from the second 
in terms of z/2 (randomly chosen) dimen- 
sional values; the fourth is the complement of 
the third; etc. This variant of the strategy 
has been previously suggested by Kochen 
(1960) and was followed only infrequently by 
Subjects in unconstrained negative focus 
situations (Miller, 1969). 


Discusston 


The importance of d in determining the 
informational value of stimuli, as shown by 
the present analyses, has several implications 
for the often encountered practice in the ex- 
perimental literature of distinguishing among 
stimuli simply in terms of their assignment to 
the two classes of positive and negative 
instances (cf. references below). First, the 
present analyses do in fact demonstrate that 
the expected informational value of randomly 
chosen positives is greater than that for nega- 
tives. Nevertheless, depending on d (and v), 
a particular set of positives may be either 
highly redundant or else permit maximum 
reduction in uncertainty (only two are neces- 
sary). Similarly, a set of negatives may either 
convey no information at all, or else lead to 
maximum uncertainty reduction with only y 
negatives following a positive focus. 

Second, description of experimental condi- 
tions in terms of the proportion of positives 
in a sequence (e.g, Haygood & Stevenson, 
1967 ; Schvaneveldt, 1966; Whitman & Garner, 
1963) provides only partial definition of their 
informational characteristics, since sequences 
having a fixed proportion of positives may 
differ greatly in the manner in which the 
uncertainty changes as a function of trials. 
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Furthermore, even sequences having similar 
such functions may differ in the opportunities 
available for employing different information- 
utilization Strategies. When negatives always 
differ from positives (or the working hypothe- 
sis) in more than one dimensional value, only 
a hypothesis-testing Strategy such as simul- 
taneous scanning is able to utilize any informa- 
tion they convey. Similarly, sequences may 
determine the Opportunity for utilization of 
different focusing Strategies. Thus, when 
negatives are always different from positives 
in more than one dimension, only optimal 
focusing can be employed when the negatives 
differ in a single value from the working hy- 
pothesis. A less efficient focusing strategy 
which requires the single difference between 
negatives and positives cannot be used. It 
may be that the common finding that subjects 
do not utilize negative information efficiently 
(cf. Bourne, 1966) can be 
relative ease of a focusing procedure for using 
positive information as contrasted to a hypo- 
thesis-testing procedure Possibly necessary to 
utilize the negative information, rather than 
to any marked inefficiency in utilizing negative 


Third, a similar difficulty is found with 
descriptions of task conditions in terms of the 
"availability" of Previous instances (e.g. 
Bourne, Goldstein, & Link, 1964; Pishkin & 
W olfgang, 1965). The availability of a single 
Positive stimulus may always be expected to 
be of some use in that the set of possible com- 
mon values among positives is thereby dis- 
Played. While the additional availability of 
other positives with d>0 always guarantees 
the Possibility that irrelevant dimensions may 
be deduced, mere availability of negative 
stimuli does not necessarily offer the possi- 
bility of corresponding relevance deductions, 
nor even the Possibility that any information 
is thereby made available at all (e.g., given a 
Positive focus “0000,” and v= 4, the negatives 

1123, 1231, 2312," etc, —81 in all—convey 
no information), In comparing the effects of 
availability of Previous positives and negatives 
9n performance efficiency, some account must 
be taken of the information made available by 
these stimuli and of the opportunity for 


Utilizing differ i i 
1 ent strategies iven this 
information, 5 piv 


p 


X 


A topic related to the above question of 
positives versus negatives concerns the inter- 
stimulus similarity relationships of a stimulus 
sequence. The present analysis indicates 
that the optimal manipulation of the similarity 
between stimulus pairs is to make positive- 
positive (or negative-negative) pairs minimally 
similar, and positive-negative pairs maximally 
similar. Empirical support for this prediction 
is given by studies in which transfer perform- 
ance was facilitated by the optimal type of 
variation as compared to less optimal condi- 


~ tions (Anderson & Guthrie, 1966; Detambel & 


Stolurow, 1956). A similar question, yet to 
be empirically tested, is whether the maximally 
informative sequences generated by the three 
optimal selection procedures facilitate per- 
formance as compared to less optimal (e.g., 
random) presentations. 

In summary, the present analyses afford the 
opportunity for further characterizing con- 
junctive concept-learning tasks in terms of 
the information available to and the perform- 
ance expected from an optimal problem solver. 
Analyses of this kind provide “absolute” 
intraexperimental and interexperimental stan- 
dards for evaluating the efficiency of human 
performance. In the present case, the analyti- 
cal expressions permit a uniform description 
of most concept-learning task situations in 
terms of the number of initially possible 
hypotheses. The expressions themselves detail 
the information potential of positive and 
negative stimuli as a function of both the 
initial problem conditions and the similarity 
among stimuli within a stimulus sequence. The 
present discussion of the experimental liter- 
ature with respect to the analyses illustrates 
the necessity of providing highly specific 
informational descriptions of tasks as a basis 
for the further detailed understanding of the 
information-processing aspects of human per- 
formance. Finally, the expressions provide a 
means for generating a variety of informa- 
tionally configured problem conditions for the 
testing of hypotheses concerning human con- 


cept learning. 
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APPENDIX 
MATHEMATICAL DERIVATIONS 


The derivations of the expressions, identified 
by the text numbering, are indicated very 
briefly below without reproduction of the 
final form. Comments given in parentheses 
refer to related previous work. The following 
mathematical identity is employed and referred 


to by letter. 


2 t j= [al 


1. For each of the 7 dimensions which may 
be relevant, there are v possible relevant values. 
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Since a dimension may be irrelevant also, the 
total number of possible states for each dimen- 
sion is v+ 1, and the nonpermissible case 
where all dimensions are irrelevant is 


n H . 
ere are e: ) ways of selecting r di- 


as relevant out of 7 variable dimen- 


sions; for ach combination the r dimensions 
may be a: signed any of v values. (This cor- 
responds to Hovland's Expression 4.) 

3. The correct concept is one of the combi- 
nations of 7 relevant values (1 < i X n) out of 


E n : " 
t possible values, or * ) which is summed 


using Identity a. It is also derived by letting 
v= 1 in Expression 1. (Bruner et al. pre- 
sented this expression in the nonsimplified 
summation form.) 

4. The correct concept is one of the combi- 
nations of 7 values out of the z values present 
in the positive focus. It is also derived by 
letting v — 1 in Expression 2. 

5. The negative eliminates 2" —1 hy- 
potheses from the set of (v + 1)" — 1 initially 
possible hypotheses. 


6. The negative eliminates (2) hypothe- 


ses from the set of Gy initially possible 


hypotheses. 

7. The remaining set of hypotheses fol- 
lowing a second positive is derived from all 
combinations of the (z — d) values common 
to the two stimuli; summed using Identity a, 
(Bruner et al., p. 114, give an erroneous expres- 
sion for the number of hypotheses eliminated.) 

8. À second positive with d differences 
restricts the possible hypotheses to some com- 
bination of r out of the (n — d) remaining 
common dimensions. 

9. The negative permits elimination of 
hypotheses given by Expression 7 from the set 
given by Expression 3. (Bruner et al., p. 114, 
are slightly in error, as 2"74 — 1 hypotheses 
are eliminated rather than 2^"-4.) 

10. The negative permits elimination of 
hypotheses given by Expression 8 from the set 
siven by Expression 4. 

11. The remaining hypotheses are all com- 
Pinations of i out of the remaining 7 common 
limensions, 1 < i < r; summed using Identity 
lk 

12. The minimum set is composed of those 
ombinations of the r relevant dimensions with 
hez= 0,1, ==, (z — r) remaining irrelevant 
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dimensions; summed using Identity a, but 
from i = 0. 

13. Each new deduction reduces the total 
number of dimensions out of which some 
combination may be selected. When only i 
irrelevant dimensions are deduced, there are 


To = j 
( " ) remaining hypotheses; when only j 


relevant dimensions are deduced, there are 
i x4 remaining hypotheses. Both condi- 
tions are represented in the single expression. 

14. The set of possible hypotheses is the 
combination of the remaining (y — i — 9) 
untested values taken oe ee Be og x È 
(n—i— j)ata time; summed using Identity 
a but from k = 0. When no relevant dimen- 
sions are known, fC) is 1 since the hypothesis 
of all irrelevant dimensions is disallowed. 

15 and 16. The total number of stimuli, v”, 
may be expressed as a similarity function giving 
the number of stimuli of different d from any 


ways. Thus, 


n 
m= 5 


n 
v— d H H 
2. 2) (v— 1y5. "The combinatorial 


; n 
expression 7 ivi i 
p (3 ) may be divided into two sub- 


expressions, one for each of the stimulus classes. 
For positive stimuli there are (> Pa " ways 
d 


of picking a second positive with d dif csemes 
dimtdions ane TUR of (n — r) irrelevant 

: 3 . hat is, positives must have r 
dimensions in Common. For negative stimuli, 


out of the tota] of (3) Possibilities, the d dif- 


fer t j 
ó ences may not be selected from just the 
t — r) irrelevant dimensions, or else the 


^ , "»—r 
These ( d ) 


therefore, be b ted 
p ; subtracte 
om the total, and the negative component 


n\, 
e (1)is thus (7) E da E 
: d 
17 and 18 Fc id 
- For : ; "re 
total (y — > — : Dositives, there 
sions from 


Stimulus would be positive. 


possibilities must, 


j are only à 
1) untested Irrelevant dimen- 
Which the g differences may be 


Selected, However, for each of the( ” — 7 — +) 
Pt d 

[n 

p Ssibilities, the values of the irrelevant di- 


4 
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mensions may take on any value, for a total 
of! ways. For negative stimuli, the d differ- 
ences may be selected from any of the 
(n — i — j) untested dimensions, but selection 
from just the (» — r — i) untested irrelevant 
dimensions are not allowable, else the stimulus 
would be positive. 

19. In the RU condition the testing of each 
successive dimension halves Nj on each trial 
such that T =n, for the range 0 € r € x. 
When, as in most problems, at least one dimen- 
sion must be relevant, this expression is slightly 
inflated due to the possibility that, when z — 1, 
the problem can be solved in (» — 1) trials if 
all of these lead to irrelevance deductions. 
Similarly, there is additional inflation when a 
maximum value of r—rmax—is established and 
made known to the subjects. They can cease 
testing if they happen to discover all 7max 
relevant dimensions before the sth trial. For 
situations involving a value of rmax which is 
small compared to 7% (say fmax < 7/2), the 
exact waiting-time expression should be de- 
rived. This would consist of three components, 
one for each of the following situations: (a) 
the case where r= 1 (i.e. (n — 1/2)/Pr(r— 1); 
(b) the case where 1 € r Ê Max where 2 is 
the proper estimate of Î (weighted by 
PH (t f < rmax)); and (c) the case where 
r = rmax, Where a component must be included 
to account for the possibility of solving 1n 
from rax to (n — 1) trials (cf. the left-hand 
component in the T expression given below). 

20. In the RC condition, optimal selection 
may cease when either the r relevant dimen- 
sions or the (» — r) irrelevant dimensions have 
been deduced. The waiting-time expression 
for f thus has two components. Solution may 
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occur on the jth trial only if no more (and no 
less) than (r — 1) relevant or (n — r — 1) 
irrelevant dimensions were deduced on Trial 
(j — 1) (where the remaining dimension is 


found on Trial j). This can occur in (3 3 1 
ways for the first condition and in G zs ur 1 


ways for the second. The probability is ob- 
tained by dividing these combinatorial ex- 


pressions by ( id which gives the total num- 


ber of ways r relevant dimensions may be 
chosen from x dimensions. The probability 
of solution on Trial j, times the value of j, then 
gives the trial weight for that situation, where 
summation is taken from r and (n — r) for 
the two conditions to (1 — 1): 


n=l : 

j-1 ^ (3 
+ 2 12 5 | 20) 
The summation expressions may be eliminated 
by using the identity: 


nal 2 ^ n 

r TUM ( rcd 
and the final expression is obtained by expand- 
ing the combinational expressions and simplify- 
ing algebraically. 


pe 
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COMMENT ON "GENERAL SHAPE FUNCTION MODEL OF LEARNING 
WITH APPLICATIONS IN PSYCHOBIOLOGY" 


JOEL P. 


WIESEN! 


King's College, Wilkes-Barre, Pennsylvania 


A reader of a recent study by Warburton and Greeno is likely to form a gross 


misconception of the probability of fi 
randomization process is applied with 


orming very dissimilar groups when the 
small groups. This is due to a misappli- 


cation of elementary probability theory. The model they presented was, 
in part, justified by this erroneous approach. The other theoretical consider- 
ations motivating their analysis are not affected by this error. 


Warburton and Greeno (1970) present an 
innovative model and an involved technique 
for analyzing data from some standard learn- 
n f ^ 
ing paradigms. These are presented as an 
alternative to relying on the frequently un- 
equal groups which are allegedly the unavoid- 


able outcome when one forms small groups by _ 


the random assignment of subjects (Ss) to 
groups. They state that this (alleged) weak- 
ness leads to the commission of Type I errors. 
Their analysis is also presented as an approach 
to some theoretical problems, such as dis- 
tinguishing between learning curves of dif- 
ferent shapes. This paper questions only the 
statements made by Warburton and Greeno 
about the limits of the randomization process. 

Their lack of faith in the equality of groups 
in a randomized groups design is supported by 
some rather shocking probabilities. They 
state that when dealing with small groups (a 
common situation), the random assignment of 
Ss to two groups may too often yield groups 
which differ greatly before the treatment is 
applied. This initial difference, they continue, 
is often not brought to light, since Ss are often 
tested only after completing the experiment 
(i.e., after the independent variable is applied). 
If it were common to form such obviously 
biased groups, it would indeed be wise to con- 
sider randomization only for larger groups, 
and perhaps to look for a method (e.g. a 
model) which would enable one to cope with 
these differences when small groups are un- 
avoidable. (Note, however, that the statisti- 
cal procedures which require randomization 
take possible biases into account; Myers, 1966, 
p. 7.) 

More specifically, they present the follow- 
ing quite plausible situation: An Æ has decided 
to form two groups of eight Ss. It is assumed 

1 Requests for reprints should be sent to Joel P. 


Wiesen, Department of Psychology, King’s College, 
Wilkes-Barre, Pennsylvania 18702. 


that of the 16 available, 8 are “bright” and 8 are 
"dull." To show how fallible randomization 
can be, they present the probability of one of 
the chosen groups consisting of eight Ss of one 
kind, and the probability of one of the chosen 
groups having at least seven of one kind. (If 
one group formed from the 16 Ss has at least 7 
dull Ss, the group consisting of the remaining 
Ss must contain at least 7 bright Ss. These 
are obviously two quite different groups. The 
probability figures they give are 1/128 and 
1/14, respectively. 

How were these figures obtained? It seems 
that Warburton and Greeno assui d that the 
probability of choosing a bright S (B)] and 
the probability of choosing a dull S [P(D)] 
both equal $ for an infinite population of Ss. 
Then the probability of choosing a group of 
eight of one kind from the total population 
would be 2/256= 1/128. Using the binomial 
expansion (Adler & Roessler, 1964), the proba- 
| 2 Ra poin b NL ans a 

= alike wou e 1/256 
+8/256+1/256 (i.e., the probability of diet 
ing a group of either eight bright, seven 
bright and one dull, seven dull and one bright, 
or eight dull)=.0703 œ 1/14, Thus, War- 
burton and Greeno seem to confuse their 
explicit example with an application of the 
randomization Procedure to selection of one 
group from an infinite population with 4 bright 
and j dull animals, The following is their 
Statement of the problem. 


idt rpblem of learning experiments has been 
dus iflerences ^. the experimenter must 
randomizati remove any bias by the procedure of 
ples generally eed However, with the small sam- 
randomization used in psychobiological research, 
atic with res Lan result in arrangements system- 
ilarins ed the uncontrolled factor. Suppose 
and half Fuer of 16 animals, half are “maze bright’ 
of 1 in a dull, then there is a probability 
kind, but at a group of eight will be all of one 

; Dut only a Probability of 1 in 14 that there will 
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a 
b. 
be seven 
«a S. or more of one kind. 
| a , of course, the information 
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TABLE 1 
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SEVERAL Ape 5S TO THE 
AL APPROACHES TO THE CALCULATION OF THE PROBABILITY OF FORMING 
BIASED GROUPS BY RANDOMIZATION 


Approach 


Type of groups to be formed 


Eight bright in one 
ig grou 
Eight dull in one ER 


At least seven bright in one group and at least 
seven dull in one group 


Warburton & 
Greeno, 1970 


Infinite population 
of half bright and 
half dull Ss 


Cs,oP5 + Cs QR 
1/128^ 
.00781 


(Cs, oP* X Cs,sQ8) + (Cs,sQ8 X Cs,oP9) 


Cs oP* + CiP + Cs; PQ! + Ca^ 
1/14 (actually 9/128)* 
.07143 (actually .07031)^ 


(Cs.oP8 X Cs,sQ8) + (C05 X Cs oP?) 
4 (Cs,0P8 X GaPQ") + (Cs; PQ* 
x CaP) + (GaP'Q X Cs,s@°) 

F (C5808 X Cs aP'Q) + (Cs PQ 
X GPO) + (C 3PQ* X Cs P'Q) 
1/400 (actually 81/32,768) 


1/32,768 
00003 .00247 
Finite i 
population of | Pss x 
ce H A ; j 
sant Sbright and | Pis; xE p: T 

Das 1/715 
E .00140 


? Inferre: 
D Actu rom Warburton and Greeno (1970). 
ues given in Warburton and Gree 


Cp. 348]. 


Lis ntes that given a fini 
deca Pe eight bright and 
cage d is 1 in 14 that two groups 
ed m at least seven bright in one 
dum in the other. 
(he eat least seven of one 

e must be at least seven of one kind in the 
Presuming random selection, 
are grossly incorrect. 


deer group.) 
hese probabilities 
We can assess 


Such groups starting witl 


sumptions. First, 


Ryu an infinite populati 
bright and half are dull ani 
may choose randomly from 


of 16 animals of 
8 are dull. 


associated with th 
group of eig 
eight dull Ss are 
two approaches, 
bilities associated 


trolled f; à 
P is not available, and so extreme random 
ents of small populations cannot be avoide! 


the probabili 


Warburton à 
to present the latter case. 

e chance 
ht bright Ss and one group of 
.00003 and 
respectively. 
with the chance 


no (1970). 


In the usual experi- 
about the uncon- 


te population con- 
eight dull Ss, the 
will be 
and 
(Note again that if 
kind in one group, 


ties of forming 
h two different as- 
we may randomly select 
on of which half are 
mals. Second, we 
a finite population 
8 are bright and 
nd Greeno seem 
The probabilities 
formation of one 


which 


.00016 for the 
The proba- 
formation 


of one group W 
and one group W 
are .00247 and .00 
respectively. 
presented in 
solutions) alon 
Warburton and 
origins). The pro 
groups is thus no 
of 196 

It ca 
and Greeno coul 
retical support 
cedure was due to a 
tion of a tricky pro 
correct probabilities. 
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ith at least seven dull Ss 
140 for the two approaches, 


These probability values are 
Table 1 (with their unsimplified 

g with the values given by 
Greeno (with their probable 
bability of forming two such 


t about 7% but, at best, 1 
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ate a lack of theo- 
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the ensuing in- 
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LETTERS TO THE EDITOR 


ITION MEMORY AND SIGNAL 
RECOGI SECTION ANALYSES 


Though it has been several years since 
Egan (1958) first applied the theory of signal 
detection (TSD) to yos o e the 

wW ears have experience a boom in 
E t Rc da (see Banks, 1970). The most 
important contribution this has made has been 
to give specific recognition to the decision 
properties of recognition memory and to the 
role of subject-established criteria. Since 
recognition memory is so very prevalent in 
"real life" situations involving memory, this 
would appear to be an important contribution. 

Tulving and Madigan (1970) have claimed 
the application of TSD to recognition memory 
to be not only unimportant but actually 
frivolous. They state that TSD “provides 
good research opportunities for those students 
who like exotic quantitive techniques . . . 
Lp. 468]," and then go on to dismiss the ap- 
plication of TSD on the basis of a totally 
incorrect assertion, that “to the extent that 
the slopes of ROC curves depart from unity, 
measures of sensitivity and criterion are not 
independent [p. 468]." The various indexes 
of sensitivity discussed by Green and Swets 

(1966, pp. 96-98) for ROC curves whose slopes 
are not unitary are all criterion free. The 
independence of sensitivity and criterion mea- 
sures is a fundamental tenet of TSD: the only 
way to obtain a lack of independence is to 
make an egregious error in determining the 
measure of sensitivity, such as looking up d’ 
in Elliott's (1964) tables, which are based on 
the unitary slope assumption, when the actual 
slope of the ROC curve departs from unity. 

Tulving and Madigan would prefer simple 
information about hits and false alarms, but 
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comparisons of performance across tasks re- 
quires some correction for false alarms or 
criterion. The correction for false alarms 
suggested by Woodworth and Schlosberg 
(1954, p. 700) has been shown by Egan 
(1958) to be implicitly based on theoretical 
considerations that are greatly at odds with 
data. On the other hand TSD provides a 
criterion-free measure of sensitivity or memory. 

Since the attack by Tulving and Madigan 


is based at least in part on a patently incorrect ^ 


assertion about TSD, it seems necessary to 
set the record straight. Perhaps by the word 
"exotic," Tulving and Madigan mean merely 
something that they do not understand. 
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